bigIR/AuFIN
This is an Arabic dataset for authoritative user search on Twitter. The dataset provides the top five users retrieved by the BM25 lexical retrieval model, where the query is a rumor text and the document collection consists of user documents. Each user document is constructed by concatenating the translated profile name and description, along with all translated Twitter list names and descriptions.
Dataset description and usage context
Dataset Overview
Dataset Name
AuFIN
Language
Arabic
Dataset Description
AuFIN is an Arabic dataset for authoritative user discovery on Twitter. The dataset includes the top five users retrieved by the BM25 lexical retrieval model, where the query is a rumor text and the document collection consists of user documents. Each user document is formed by concatenating the translated profile name and description, as well as all translated Twitter list names and descriptions.
Dataset Links
Related Paper
The related work for this dataset was published in the journal Information Processing & Management under the title “Who can verify this? Finding authorities for rumor verification in Twitter”.
Pair the dataset with AI analysis and content workflows.
Once the source passes your review, move straight into summarization, transformation, report drafting, or presentation generation with the JuheAI toolchain.