Contact Us
USE CASE

Children Speech Dataset

Dataset for training a neural network to recognize children's speech for voice assistants and children's versions of applications
NLP The system's ability to understand, analyze, and interpret human languages
ASR Technology for converting human speech into text format
Machine Learning The system's ability to automatically interpret data and predict outcomes
Data Collection Gathering suitable data for subsequent labeling
1 000
audio recordings
8 weeks
project duration

CASE DESCRIPTION

The dataset consists of 5,000 audio materials, collected through crowdsourcing platforms and an internal team of AI trainers

Audio recordings of children’s voices for training a voice assistant. Each child should record 1 video, 6 audios from prepared sentences, and 3 improvisations

Data format:

mp3 and xml – a file with a transcript

APPLICATION AREAS OF THE DATASET

ASR

to develop a system for automatic recognition and transcription of children's speech recordings

NLP and data classification

for systems for automatically determining age or age category of users

Data collection

for the internal database of LLM services that work with children's audiences

FACTORS AFFECTING THE FINAL PROJECT COST INCLUDE

Our data quality guarantee is 95%. For annotation orders requiring quality above 95%, we offer enterprise solutions

Why
Training Data

  • Quality Assurance:
  • Enhanced Data Accuracy
  • Consistency in Labels
  • Reliable Ground Truth
  • Mitigation of Annotation Biases
  • Cost and Time Efficiency
  • Data Security and Confidentiality:
  • GDPR Compliance
  • Non-disclosure agreement
  • Data Encryption
  • Multiple data storage options
  • Access Controls and Authentication
  • Expert Team:
  • 6 years in industry
  • 35 top project managers
  • 40+ languages
  • 100+ countries
  • 250k+ assessors
  • Flexible and Scalable Solutions:
  • 24/7 availability of customer service
  • 100% post payment
  • $550 minimum check
  • Variable Workload
  • Customized Solutions

Team leads project

Wadim Starosotnikow
Senior quality control manager
Arthur Kazukevich
Python-developer
Maria Kuzmina
Project manager
woman

Tell us about your project!