Back to datasets
Dataset assetOpen Source CommunityNetwork AnalysisCommunity Detection

Blogcatalog, Citeseer, Cora, Cornell, Flickr, Pubmed, Texas, UAI2010, Washington, Wisconsin, Email, Wiki, ACM, Amazon, DBLP, IMDB, Cellphone, DBLP, Dynamic_cora, highSchool, Java

The repository contains various types of network datasets, such as complex networks, topological networks, multilayer networks, and dynamic networks, for research in community detection, graph neural networks, and related fields.

Source
github
Created
Jun 23, 2022
Updated
Apr 11, 2024
Signals
127 views
Availability
Linked source ready
Overview

Dataset description and usage context

Dataset Overview

Dataset Categories

Complex Networks

  • Blogcatalog
  • Citeseer
  • Cora
  • Cornell
  • Flickr
  • Pubmed
  • Texas
  • UAI2010
  • Washington
  • Wisconsin

Topological Networks

  • Email
  • Wiki

Multilayer Networks

  • ACM
  • Amazon
  • DBLP
  • IMDB

Dynamic Networks

  • Cellphone
  • DBLP
  • Dynamic_cora
  • highSchool
  • Java

Overlapping Complex Networks

Dataset Format

  • name: dataset name
  • topo: topology structure, type csr_matrix
  • attr: attribute information, type csr_matrix
  • label: label information, type csr_matrix, an n×k matrix where 1 indicates membership in the community, otherwise 0

Example Dataset Filenames

  • Fb_X: Facebook X
  • mag_chem: Chemistry
  • mag_cs: Computer Science
  • mag_med: Medicine
  • mag_end: Engineering

Dataset Characteristics

  1. All datasets contain self‑loops for nodes.
  2. Apart from the label matrix, all other matrices contain only 1s or 0s.
Need downstream help?

Pair the dataset with AI analysis and content workflows.

Once the source passes your review, move straight into summarization, transformation, report drafting, or presentation generation with the JuheAI toolchain.

Explore AI studio