Back to datasets
Dataset assetOpen Source CommunityHeterogeneous Graph AnalysisMulti-domain Data

ACM-1, ACM-2, ACM-3, MovieLens, Douban Movie, Douban Book, Amazon, LastFM

These datasets include the ACM series, MovieLens, Douban Movie, Douban Book, Amazon, and LastFM; each provides detailed entity and relationship statistics for heterogeneous graph analysis.

Source
github
Created
Sep 12, 2019
Updated
May 6, 2024
Signals
230 views
Availability
Linked source ready
Overview

Dataset description and usage context

ACM-1

  • Entity: Paper, Author, Conf, Term (paper feature), Index(paper label)
  • Statistics:
    • Paper: 12,500
    • Term: 300
    • Index: 11

ACM-2

  • Entity: Paper, Author, Subject, Term (paper feature), Research area(paper label)
  • Statistics:
    • Paper: 3,025
    • Author: 5,835
    • Subject: 56
    • Term: 1,830
    • Research area: 3

ACM-3

  • Entity: Paper, Author, Affiliations, Term, Subjects
  • Statistics:
    • Paper: 12,000
    • Author: 17,000
    • Affiliations: 1,800
    • Term: 1,500
    • Subjects: 73

MovieLens

  • Entity: User, Age, Occupation, Movie, Genre
  • Statistics:
    • User: 943
    • Age: 8
    • Occupation: 21
    • Movie: 1,682
    • Genre: 18
  • Relation Statistics:
    • User - Movie: 100,000
    • User - User (KNN): 47,150
    • User - Age: 943
    • User - Occupation: 943
    • Movie - Movie (KNN): 82,798
    • Movie - Genre: 2,861

Douban Movie

  • Entity: User, Movie, Group, Actor, Director, Type
  • Statistics:
    • User: 13,367
    • Movie: 12,677
    • Group: 2,753
    • Actor: 6,311
    • Director: 2,449
    • Type: 38
  • Relation Statistics:
    • User - Movie: 1,068,278
    • User - Group: 570,047
    • User - User: 4,085
    • Movie - Actor: 33,587
    • Movie - Director: 11,276
    • Movie - Type: 27,668

Douban Book

  • Entity: User, Book, Group, Location, Author, Publisher, Year
  • Statistics:
    • User: 13,024
    • Book: 22,347
    • Group: 2,936
    • Location: 38
    • Author: 10,805
    • Publisher: 1,815
    • Year: 64
  • Relation Statistics:
    • User - Book: 792,062
    • User - Group: 1,189,271
    • User - User: 169,150
    • User - Location: 10,592
    • Book - Author: 21,907
    • Book - Publisher: 21,773
    • Book - Year: 21,192

Amazon

  • Entity: User, Item, View, Category, Brand
  • Statistics:
    • User: 6,170
    • Item: 2,753
    • View: 3,857
    • Category: 22
    • Brand: 334
  • Relation Statistics:
    • User - Item: 195,791
    • Item - View: 5,694
    • Item - Category: 5,508
    • Item - Brand: 2,753

LastFM

  • Entity: User, Artist, Tag
  • Statistics:
    • User: 1,892
    • Artist: 17,632
    • Tag: 11,945
  • Relation Statistics:
    • User - Artist: 92,834
    • User - User (Original): 25,434
    • User - User (KNN): 18,802
    • Artist - Artist (KNN): 153,399
    • Artist - Tag: 184,941

Yelp

  • Entity: User, Business, Compliment, Category, City
  • Statistics:
    • User: 16,239
    • Business: 14,284
    • Compliment: 11
    • Category: 47
    • City: 511
  • Relation Statistics:
    • User - Business: 198,397
    • User - User: 158,590
    • User - Compliment: 76,875
    • Business - City: 14,267
    • Business - Category: 40,009

Yelp-2

  • Entity: User, Business, Service, Star level, Reservation, Category
  • Statistics:
    • User: 1,286
    • Business: 2,614
    • Service: 2
    • Star level: 9
    • Reservation: 2
    • Category: 3
  • Relation Statistics:
    • User - Business: 30,838
    • Business - Service: 2,614
    • Business - Star level: 2,614
    • Business - Reservation: 2,614
    • Business - Category: 2,614

DBLP-1

  • Entity: Author, Paper, Author_label, Conference, Type
  • Statistics:
    • Author: 14,475
    • Paper: 14,376
    • Author_label: 4
    • Conference: 20
    • Type: 8,920
  • Relation Statistics:
    • Author - Label: 4,057
    • Paper - Author: 41,794
    • Paper - Conference: 14,376
    • Paper - Type: 114,624

DBLP-2

  • Entity: Paper, Author, Conf, Term, Profile(author feature), Research area(author label)
  • Statistics:
    • Paper: 14,328
    • Author: 4,057
    • Conf: 20
    • Term: 8,789
    • Profile: 334
    • Research area: 4

Aminer

  • Entity: Author, Paper, Papel_label, Conference, Reference
  • Statistics:
    • Author: 164,472
    • Paper: 127,623
    • Papel_label: 10
    • Conference: 101
    • Reference: 147,251
  • Relation Statistics:
    • Paper - Label: 127,623
    • Paper - Author: 355,072
    • Paper - Conference: 127,632
    • Paper - Reference: 392,519

IMDB

  • Entity: Movie, Actress, Actor, Director, Plot(movie feature), Genre(movie label)
  • Statistics:
    • Movie: 14,475
    • Plot: 1,000
    • Genre: 9

SLAP

  • Entity: Gene, Ontology(gene feature), Tissue, Pathway, Diease, Chemical Compound, Family(gene label)
  • Statistics:
    • Gene: 20,419
    • Ontology: 3,000
    • Family: 15
Need downstream help?

Pair the dataset with AI analysis and content workflows.

Once the source passes your review, move straight into summarization, transformation, report drafting, or presentation generation with the JuheAI toolchain.

Explore AI studio