Back to datasets
Dataset assetOpen Source CommunityNetwork AnalysisCommunity Detection
Blogcatalog, Citeseer, Cora, Cornell, Flickr, Pubmed, Texas, UAI2010, Washington, Wisconsin, Email, Wiki, ACM, Amazon, DBLP, IMDB, Cellphone, DBLP, Dynamic_cora, highSchool, Java
The repository contains various types of network datasets, such as complex networks, topological networks, multilayer networks, and dynamic networks, for research in community detection, graph neural networks, and related fields.
Source
github
Created
Jun 23, 2022
Updated
Apr 11, 2024
Signals
127 views
Availability
Linked source ready
Overview
Dataset description and usage context
Dataset Overview
Dataset Categories
Complex Networks
- Blogcatalog
- Citeseer
- Cora
- Cornell
- Flickr
- Pubmed
- Texas
- UAI2010
- Washington
- Wisconsin
Topological Networks
- Wiki
Multilayer Networks
- ACM
- Amazon
- DBLP
- IMDB
Dynamic Networks
- Cellphone
- DBLP
- Dynamic_cora
- highSchool
- Java
Overlapping Complex Networks
Dataset Format
- name: dataset name
- topo: topology structure, type csr_matrix
- attr: attribute information, type csr_matrix
- label: label information, type csr_matrix, an n×k matrix where 1 indicates membership in the community, otherwise 0
Example Dataset Filenames
- Fb_X: Facebook X
- mag_chem: Chemistry
- mag_cs: Computer Science
- mag_med: Medicine
- mag_end: Engineering
Dataset Characteristics
- All datasets contain self‑loops for nodes.
- Apart from the label matrix, all other matrices contain only 1s or 0s.
Need downstream help?
Pair the dataset with AI analysis and content workflows.
Once the source passes your review, move straight into summarization, transformation, report drafting, or presentation generation with the JuheAI toolchain.