DATASET
Open Source Community
ACM-1, ACM-2, ACM-3, MovieLens, Douban Movie, Douban Book, Amazon, LastFM
These datasets include the ACM series, MovieLens, Douban Movie, Douban Book, Amazon, and LastFM; each provides detailed entity and relationship statistics for heterogeneous graph analysis.
Updated 5/6/2024
github
Description
ACM-1
- Entity: Paper, Author, Conf, Term (paper feature), Index(paper label)
- Statistics:
- Paper: 12,500
- Term: 300
- Index: 11
ACM-2
- Entity: Paper, Author, Subject, Term (paper feature), Research area(paper label)
- Statistics:
- Paper: 3,025
- Author: 5,835
- Subject: 56
- Term: 1,830
- Research area: 3
ACM-3
- Entity: Paper, Author, Affiliations, Term, Subjects
- Statistics:
- Paper: 12,000
- Author: 17,000
- Affiliations: 1,800
- Term: 1,500
- Subjects: 73
MovieLens
- Entity: User, Age, Occupation, Movie, Genre
- Statistics:
- User: 943
- Age: 8
- Occupation: 21
- Movie: 1,682
- Genre: 18
- Relation Statistics:
- User - Movie: 100,000
- User - User (KNN): 47,150
- User - Age: 943
- User - Occupation: 943
- Movie - Movie (KNN): 82,798
- Movie - Genre: 2,861
Douban Movie
- Entity: User, Movie, Group, Actor, Director, Type
- Statistics:
- User: 13,367
- Movie: 12,677
- Group: 2,753
- Actor: 6,311
- Director: 2,449
- Type: 38
- Relation Statistics:
- User - Movie: 1,068,278
- User - Group: 570,047
- User - User: 4,085
- Movie - Actor: 33,587
- Movie - Director: 11,276
- Movie - Type: 27,668
Douban Book
- Entity: User, Book, Group, Location, Author, Publisher, Year
- Statistics:
- User: 13,024
- Book: 22,347
- Group: 2,936
- Location: 38
- Author: 10,805
- Publisher: 1,815
- Year: 64
- Relation Statistics:
- User - Book: 792,062
- User - Group: 1,189,271
- User - User: 169,150
- User - Location: 10,592
- Book - Author: 21,907
- Book - Publisher: 21,773
- Book - Year: 21,192
Amazon
- Entity: User, Item, View, Category, Brand
- Statistics:
- User: 6,170
- Item: 2,753
- View: 3,857
- Category: 22
- Brand: 334
- Relation Statistics:
- User - Item: 195,791
- Item - View: 5,694
- Item - Category: 5,508
- Item - Brand: 2,753
LastFM
- Entity: User, Artist, Tag
- Statistics:
- User: 1,892
- Artist: 17,632
- Tag: 11,945
- Relation Statistics:
- User - Artist: 92,834
- User - User (Original): 25,434
- User - User (KNN): 18,802
- Artist - Artist (KNN): 153,399
- Artist - Tag: 184,941
Yelp
- Entity: User, Business, Compliment, Category, City
- Statistics:
- User: 16,239
- Business: 14,284
- Compliment: 11
- Category: 47
- City: 511
- Relation Statistics:
- User - Business: 198,397
- User - User: 158,590
- User - Compliment: 76,875
- Business - City: 14,267
- Business - Category: 40,009
Yelp-2
- Entity: User, Business, Service, Star level, Reservation, Category
- Statistics:
- User: 1,286
- Business: 2,614
- Service: 2
- Star level: 9
- Reservation: 2
- Category: 3
- Relation Statistics:
- User - Business: 30,838
- Business - Service: 2,614
- Business - Star level: 2,614
- Business - Reservation: 2,614
- Business - Category: 2,614
DBLP-1
- Entity: Author, Paper, Author_label, Conference, Type
- Statistics:
- Author: 14,475
- Paper: 14,376
- Author_label: 4
- Conference: 20
- Type: 8,920
- Relation Statistics:
- Author - Label: 4,057
- Paper - Author: 41,794
- Paper - Conference: 14,376
- Paper - Type: 114,624
DBLP-2
- Entity: Paper, Author, Conf, Term, Profile(author feature), Research area(author label)
- Statistics:
- Paper: 14,328
- Author: 4,057
- Conf: 20
- Term: 8,789
- Profile: 334
- Research area: 4
Aminer
- Entity: Author, Paper, Papel_label, Conference, Reference
- Statistics:
- Author: 164,472
- Paper: 127,623
- Papel_label: 10
- Conference: 101
- Reference: 147,251
- Relation Statistics:
- Paper - Label: 127,623
- Paper - Author: 355,072
- Paper - Conference: 127,632
- Paper - Reference: 392,519
IMDB
- Entity: Movie, Actress, Actor, Director, Plot(movie feature), Genre(movie label)
- Statistics:
- Movie: 14,475
- Plot: 1,000
- Genre: 9
SLAP
- Entity: Gene, Ontology(gene feature), Tissue, Pathway, Diease, Chemical Compound, Family(gene label)
- Statistics:
- Gene: 20,419
- Ontology: 3,000
- Family: 15
AI studio
Generate PPTs instantly with Nano Banana Pro.
Generate PPT NowAccess Dataset
Login to Access
Please login to view download links and access full dataset details.
Topics
Heterogeneous Graph Analysis
Multi-domain Data
Source
Organization: github
Created: 9/12/2019
Power Your Data Analysis with Premium AI Models
Supporting GPT-5, Claude-4, DeepSeek v3, Gemini and more.
Enjoy a free trial and save 20%+ compared to official pricing.