The goal was to train the autonomous robot to navigate real-world environments. The client wanted to annotate 10 thousand images obtained from the robot's RBG camera to train the neural network. The quality of data labelling was paramount because the robot was going to navigate the streets of a metropolitan centre, and the cost of navigation error was high. It led to a large number of annotation classes.