Back to datasets
Dataset assetClassic DatasetSentiment AnalysisNatural Language Processing

Yelp Reviews Dataset

The dataset comprises Yelp review data for sentiment analysis, specifically comparing the effectiveness of BERT and RoBERTa models on Yelp review sentiment classification.

Source
github
Created
Nov 29, 2023
Updated
Dec 2, 2023
Signals
386 views
Availability
Linked source ready
Overview

Dataset description and usage context

Dataset Overview

This dataset is used for sentiment analysis, focusing on Yelp reviews. It compares two advanced models—Hugging Face's bert-base-multilingual-uncased and cardiffnlp/twitter-roberta-base-sentiment-latest—to analyze sentiment expressions in the reviews.

Model Usage

  • BERT Multilingual Uncased: Suitable for understanding multiple languages, especially useful for the diverse linguistic characteristics of Yelp reviews.
  • Twitter RoBERTa: Fine‑tuned for sentiment analysis, excels at capturing nuanced English sentiment.

Dataset Source

Need downstream help?

Pair the dataset with AI analysis and content workflows.

Once the source passes your review, move straight into summarization, transformation, report drafting, or presentation generation with the JuheAI toolchain.

Explore AI studio