Back to datasets
Dataset assetOpen Source CommunityHeterogeneous Graph AnalysisMulti-domain Data
ACM-1, ACM-2, ACM-3, MovieLens, Douban Movie, Douban Book, Amazon, LastFM
These datasets include the ACM series, MovieLens, Douban Movie, Douban Book, Amazon, and LastFM; each provides detailed entity and relationship statistics for heterogeneous graph analysis.
Source
github
Created
Sep 12, 2019
Updated
May 6, 2024
Signals
230 views
Availability
Linked source ready
Overview
Dataset description and usage context
ACM-1
- Entity: Paper, Author, Conf, Term (paper feature), Index(paper label)
- Statistics:
- Paper: 12,500
- Term: 300
- Index: 11
ACM-2
- Entity: Paper, Author, Subject, Term (paper feature), Research area(paper label)
- Statistics:
- Paper: 3,025
- Author: 5,835
- Subject: 56
- Term: 1,830
- Research area: 3
ACM-3
- Entity: Paper, Author, Affiliations, Term, Subjects
- Statistics:
- Paper: 12,000
- Author: 17,000
- Affiliations: 1,800
- Term: 1,500
- Subjects: 73
MovieLens
- Entity: User, Age, Occupation, Movie, Genre
- Statistics:
- User: 943
- Age: 8
- Occupation: 21
- Movie: 1,682
- Genre: 18
- Relation Statistics:
- User - Movie: 100,000
- User - User (KNN): 47,150
- User - Age: 943
- User - Occupation: 943
- Movie - Movie (KNN): 82,798
- Movie - Genre: 2,861
Douban Movie
- Entity: User, Movie, Group, Actor, Director, Type
- Statistics:
- User: 13,367
- Movie: 12,677
- Group: 2,753
- Actor: 6,311
- Director: 2,449
- Type: 38
- Relation Statistics:
- User - Movie: 1,068,278
- User - Group: 570,047
- User - User: 4,085
- Movie - Actor: 33,587
- Movie - Director: 11,276
- Movie - Type: 27,668
Douban Book
- Entity: User, Book, Group, Location, Author, Publisher, Year
- Statistics:
- User: 13,024
- Book: 22,347
- Group: 2,936
- Location: 38
- Author: 10,805
- Publisher: 1,815
- Year: 64
- Relation Statistics:
- User - Book: 792,062
- User - Group: 1,189,271
- User - User: 169,150
- User - Location: 10,592
- Book - Author: 21,907
- Book - Publisher: 21,773
- Book - Year: 21,192
Amazon
- Entity: User, Item, View, Category, Brand
- Statistics:
- User: 6,170
- Item: 2,753
- View: 3,857
- Category: 22
- Brand: 334
- Relation Statistics:
- User - Item: 195,791
- Item - View: 5,694
- Item - Category: 5,508
- Item - Brand: 2,753
LastFM
- Entity: User, Artist, Tag
- Statistics:
- User: 1,892
- Artist: 17,632
- Tag: 11,945
- Relation Statistics:
- User - Artist: 92,834
- User - User (Original): 25,434
- User - User (KNN): 18,802
- Artist - Artist (KNN): 153,399
- Artist - Tag: 184,941
Yelp
- Entity: User, Business, Compliment, Category, City
- Statistics:
- User: 16,239
- Business: 14,284
- Compliment: 11
- Category: 47
- City: 511
- Relation Statistics:
- User - Business: 198,397
- User - User: 158,590
- User - Compliment: 76,875
- Business - City: 14,267
- Business - Category: 40,009
Yelp-2
- Entity: User, Business, Service, Star level, Reservation, Category
- Statistics:
- User: 1,286
- Business: 2,614
- Service: 2
- Star level: 9
- Reservation: 2
- Category: 3
- Relation Statistics:
- User - Business: 30,838
- Business - Service: 2,614
- Business - Star level: 2,614
- Business - Reservation: 2,614
- Business - Category: 2,614
DBLP-1
- Entity: Author, Paper, Author_label, Conference, Type
- Statistics:
- Author: 14,475
- Paper: 14,376
- Author_label: 4
- Conference: 20
- Type: 8,920
- Relation Statistics:
- Author - Label: 4,057
- Paper - Author: 41,794
- Paper - Conference: 14,376
- Paper - Type: 114,624
DBLP-2
- Entity: Paper, Author, Conf, Term, Profile(author feature), Research area(author label)
- Statistics:
- Paper: 14,328
- Author: 4,057
- Conf: 20
- Term: 8,789
- Profile: 334
- Research area: 4
Aminer
- Entity: Author, Paper, Papel_label, Conference, Reference
- Statistics:
- Author: 164,472
- Paper: 127,623
- Papel_label: 10
- Conference: 101
- Reference: 147,251
- Relation Statistics:
- Paper - Label: 127,623
- Paper - Author: 355,072
- Paper - Conference: 127,632
- Paper - Reference: 392,519
IMDB
- Entity: Movie, Actress, Actor, Director, Plot(movie feature), Genre(movie label)
- Statistics:
- Movie: 14,475
- Plot: 1,000
- Genre: 9
SLAP
- Entity: Gene, Ontology(gene feature), Tissue, Pathway, Diease, Chemical Compound, Family(gene label)
- Statistics:
- Gene: 20,419
- Ontology: 3,000
- Family: 15
Need downstream help?
Pair the dataset with AI analysis and content workflows.
Once the source passes your review, move straight into summarization, transformation, report drafting, or presentation generation with the JuheAI toolchain.