Reviews Dataset
Case
A dataset for training a neural network in tasks of matching text descriptions of first impressions of a person with a photograph of that person

Кейс Сбор впечатлений о людях

Generation
The ability to automatically create plausible objects
Machine Learning
The system's ability to automatically interpret data and predict outcomes
NLP
The system's ability to understand, analyze, and interpret human languages
Data Collection
Gathering suitable data for subsequent labeling
100 000
text descriptions
5 weeks
duration
Case Description
The dataset was collected on the "Toloka" platform by Roman Kutsev as part of his personal project in 2019 - "Impressor"
User would send any photo in the chat, and within 5 minutes, a bot described its first impression of the person in the photo
The dataset contains 100,000 textual descriptions of people's photos. Each text fragment was checked for grammatical errors, insults, profanity, and other ethical violations
APPLICATION AREAS OF THE DATASET
  • NLP for analyzing opinions about people on social media.
Social Media Sentiment Analysis:
  • NLP for determining potential clients or buyers
Identifying Potential Customers:
  • NLP to improve the performance of recommender systems, for example, movie or product recommendation systems
Recommender Systems:

Didn't find the necessary information?

Leave a request for consultation with our Account manager to get the price of the dataset!
THE FINAL COST OF THE PROJECT IS INFLUENCED BY:
Scope of work
Markup complexity
Timing
Markup quality
Our data quality guarantee is 95%. When ordering markup with quality above 95%, we offer enterprise solutions
Tell us about your project!