Back to datasets
Dataset assetOpen Source CommunitySocial Media AnalysisRumor Detection

bigIR/AuFIN

This is an Arabic dataset for authoritative user search on Twitter. The dataset provides the top five users retrieved by the BM25 lexical retrieval model, where the query is a rumor text and the document collection consists of user documents. Each user document is constructed by concatenating the translated profile name and description, along with all translated Twitter list names and descriptions.

Source
hugging_face
Created
Nov 28, 2025
Updated
Mar 8, 2024
Signals
83 views
Availability
Linked source ready
Overview

Dataset description and usage context

Dataset Overview

Dataset Name

AuFIN

Language

Arabic

Dataset Description

AuFIN is an Arabic dataset for authoritative user discovery on Twitter. The dataset includes the top five users retrieved by the BM25 lexical retrieval model, where the query is a rumor text and the document collection consists of user documents. Each user document is formed by concatenating the translated profile name and description, as well as all translated Twitter list names and descriptions.

Dataset Links

Related Paper

The related work for this dataset was published in the journal Information Processing & Management under the title “Who can verify this? Finding authorities for rumor verification in Twitter”.

Need downstream help?

Pair the dataset with AI analysis and content workflows.

Once the source passes your review, move straight into summarization, transformation, report drafting, or presentation generation with the JuheAI toolchain.

Explore AI studio