Contact Us
LICENSE

Text from the Goods Dataset

This dataset is designed to assist with text recognition tasks in different languages
OCR Optical character recognition is a process that converts printed texts into digital image files
Data labelling The process of identifying objects in photos for training a system to recognize and interpret them
Bounding box A type of annotation used in computer vision that refers to a box drawn around an object in an image or video
Computer Vision Ability of a machine to interpret, analyze, and understand visual data
150+
languages
200+
contries
<10
M types of covers and goods

TECHNICAL SPECIFICATIONS

Two types of images with text:

Advertising:

  • names of organizations, posters, billboards, stickers and banners most often filmed on the street

Products:

  • food, cosmetics, personal hygiene items, book covers and video games filmed indoors

Two types of lighting:

Daylight:

  • filmed indoors or outdoors in daylight

Night:

  • filmed in the dark outdoors or indoors

Types of data labelling:

Bounding Box:

  • labelling for each sequence of letters or numbers

OCR:

  • labelling for the selected sequence, including punctuation

APPLICATION AREAS OF THE DATASET

Retail trade:

OCR for creating mobile applications for recognizing products on store shelves and obtaining real-time information about products and prices

Inventory management:

Classification and computer vision to optimize inventory management, prevent product shortages, and excess inventory

Marketing and analytics:

Computer vision and classification for evaluating the effectiveness of product placement and optimizing merchandising

Urban infrastructure:

Computer vision for optimizing the placement of advertising banners and improving the urban environment

FACTORS AFFECTING THE FINAL PROJECT COST INCLUDE

Our data quality guarantee is 95%. For annotation orders requiring quality above 95%, we offer enterprise solutions
How it works?
How is the data collected?
Is it possible to get a part of the data?
Do you provide additional labeling for the dataset?
What is the price of the dataset?

Why
Training Data

  • Quality Assurance:
  • Enhanced Data Accuracy
  • Consistency in Labels
  • Reliable Ground Truth
  • Mitigation of Annotation Biases
  • Cost and Time Efficiency
  • Data Security and Confidentiality:
  • GDPR Compliance
  • Non-disclosure agreement
  • Data Encryption
  • Multiple data storage options
  • Access Controls and Authentication
  • Expert Team:
  • 6 years in industry
  • 35 top project managers
  • 40+ languages
  • 100+ countries
  • 250k+ assessors
  • Flexible and Scalable Solutions:
  • 24/7 availability of customer service
  • 100% post payment
  • $550 minimum check
  • Variable Workload
  • Customized Solutions

Team leads project

Alexey Antushenya
Operations manager
Sergey Razumny
TeamLead Crowd Solutions
Maria Kuzmina
Project manager
woman

Tell us about your project!