Back to datasets
Dataset assetClassic DatasetSentiment AnalysisNatural Language Processing
Yelp Reviews Dataset
The dataset comprises Yelp review data for sentiment analysis, specifically comparing the effectiveness of BERT and RoBERTa models on Yelp review sentiment classification.
Source
github
Created
Nov 29, 2023
Updated
Dec 2, 2023
Signals
386 views
Availability
Linked source ready
Overview
Dataset description and usage context
Dataset Overview
This dataset is used for sentiment analysis, focusing on Yelp reviews. It compares two advanced models—Hugging Face's bert-base-multilingual-uncased and cardiffnlp/twitter-roberta-base-sentiment-latest—to analyze sentiment expressions in the reviews.
Model Usage
- BERT Multilingual Uncased: Suitable for understanding multiple languages, especially useful for the diverse linguistic characteristics of Yelp reviews.
- Twitter RoBERTa: Fine‑tuned for sentiment analysis, excels at capturing nuanced English sentiment.
Dataset Source
- Yelp-provided review dataset.
- Dataset link: Yelp Dataset
Need downstream help?
Pair the dataset with AI analysis and content workflows.
Once the source passes your review, move straight into summarization, transformation, report drafting, or presentation generation with the JuheAI toolchain.