jniimi/tripadvisor-review-rating
This dataset contains hotel reviews and ratings collected from TripAdvisor. After processing, only the review text and multiple aspect scores are retained. Originally released by Jiwei Li et al., the processed data is provided as a single pandas DataFrame. It is primarily intended for aspect‑based sentiment analysis (ABSA). The dataset includes columns such as hotel ID, user ID, review title, review text, overall rating, cleanliness rating, and others.
Description
Dataset Overview
Dataset Name
- Name: TripAdvisor Easy Dataset
- Alias: sentiment
Dataset Features
- Feature List:
hotel_id: Unique hotel identifier, typeint64user_id: Unique user identifier, typestringtitle: Review title submitted by the user, typestringtext: Full review body, typestringoverall: Overall rating given by the user, typefloat64cleanliness: Cleanliness rating, typefloat64value: Value rating, typefloat64location: Location rating, typefloat64rooms: Room rating, typefloat64sleep_quality: Sleep quality rating, typefloat64stay_year: Year of stay, typeint64post_date: Review publication date, typetimestamp[ns]freq: Frequency, typeint64review: Full review, typestringchar: Number of characters, typeint64lang: Language, typestring
Dataset Splits
- Training Set:
num_examples: 201295num_bytes: 368237342
Dataset Size
- Download Size: 220909380
- Dataset Size: 368237342
Dataset Configuration
- Default Configuration:
config_name: defaultdata_files:split: trainpath: data/train-*
Task Categories
- Task: text-classification
Language
- Language: en
Size Category
- Size: 10K<n<100K
License
- License: Apache-2.0
Intended Uses
- Direct Use: Suitable for Aspect‑based Sentiment Analysis (ABSA)
- Out‑of‑Scope Use: Follow the original data source policy
Dataset Structure
- Column Information:
hotel_id: Unique hotel identifieruser_id: Unique user identifiertitle: Review titletext: Review bodyreview: Combined title + bodyoverall: Overall ratingcleanliness: Cleanliness ratingvalue: Value ratinglocation: Location ratingrooms: Room ratingsleep_quality: Sleep quality ratingdate_stayed: Stay datedate: Review publication date
AI studio
Generate PPTs instantly with Nano Banana Pro.
Generate PPT NowAccess Dataset
Please login to view download links and access full dataset details.
Topics
Source
Organization: hugging_face
Created: Unknown
Power Your Data Analysis with Premium AI Models
Supporting GPT-5, Claude-4, DeepSeek v3, Gemini and more.
Enjoy a free trial and save 20%+ compared to official pricing.