JUHE API Marketplace
DATASET
Open Source Community

Twitter/TwitterFollowGraph

TwitterFollowGraph is a bipartite directed graph comprising user (consumer) nodes and author (producer) nodes, where edges represent a user's "follow" interaction with an author. Each edge is assigned to a predefined time chunk, denoted by a consecutive ordinal that respects the temporal order of interactions. TwitterFollowGraph contains a total of 261 million edges and 15.5 million vertices, with a maximum degree of 900 000 and a minimum degree of 5. The data format is shown in the table below: | user_index | author_index | time_chunk |

Updated 10/31/2022
hugging_face

Description

Dataset Overview

Dataset Name

  • TwitterFollowGraph

Dataset Description

  • TwitterFollowGraph is a bipartite directed graph that includes user (consumer) nodes and author (producer) nodes, where edges represent the action of a user "following" an author. Each edge is allocated to a predefined time slot, expressed as an ordinal; the ordinals are consecutive and follow the chronological order.

Dataset Scale

  • The graph contains 261 million edges and 15.5 million nodes.
  • Maximum degree: 900 000, minimum degree: 5.

Data Format

FieldDescription
user_indexUser index
author_indexAuthor index
time_chunkTime‑chunk ordinal

License

  • This dataset is released under the Creative Commons Attribution 4.0 International License.

Citation Information

@article{el2022knn,
  title={kNN-Embed: Locally Smoothed Embedding Mixtures For Multi-interest Candidate Retrieval},
  author={El-Kishky, Ahmed and Markovich, Thomas and Leung, Kenny and Portman, Frank and Haghighi, Aria and Xiao, Ying},
  journal={arXiv preprint arXiv:2205.06205},
  year={2022}
}

AI studio

Generate PPTs instantly with Nano Banana Pro.

Generate PPT Now

Access Dataset

Login to Access

Please login to view download links and access full dataset details.

Topics

Social Media Analysis
User Behavior

Source

Organization: hugging_face

Created: Unknown

Power Your Data Analysis with Premium AI Models

Supporting GPT-5, Claude-4, DeepSeek v3, Gemini and more.

Enjoy a free trial and save 20%+ compared to official pricing.