difficult_retrieval
This dataset is used for the paper “Hyper‑multi‑step: The Truth Behind Difficult Long‑context Tasks”. It contains a series of long‑context retrieval tasks, divided into simple and difficult categories. Simple tasks include direct key‑to‑value and multi‑step key‑value retrieval; difficult tasks involve logic‑based retrieval, multi‑match retrieval, etc. Each task has a filename indicating the task type and context size. Columns include the full prompt, gold keys (correct answer keys), and gold values (correct answer values).
Description
Difficult Long‑context Retrieval Tasks Dataset
Dataset Overview
The dataset supports the paper "Hyper‑multi‑step: The Truth Behind Difficult Long‑context Tasks" and contains a series of long‑context retrieval tasks.
Task Types
Simple Tasks
- simple_k2v: Direct key‑to‑value retrieval. Given a key, the model must retrieve the corresponding value.
- simple_v2k: Direct value‑to‑key retrieval. Given a value, the model must retrieve the corresponding key.
- multi_step(kv): Multi‑step (formal) KV retrieval. The model retrieves multiple values through several queries, concatenates them to form a new key, and finally retrieves the associated value.
Difficult Tasks
- logic(kv): Logic‑based KV retrieval. All values range from 0‑9. Given a value range, the model must retrieve the matching keys.
- logic(resume): Logic‑based resume retrieval. Given a GPA range, the model must retrieve students whose GPA falls within that range.
- multi_match(kv): Multi‑match KV retrieval. Given a value, the model must retrieve multiple corresponding keys.
- multi_match(resume): Multi‑match resume retrieval. Given a university name, the model must retrieve multiple students from that university.
- multi_match_last(kv): Multi‑match KV retrieval where the last key is omitted; all other gold keys are provided in the prompt.
File Naming Conventions
- logic_kv_10: Logic‑based KV retrieval task with a context containing 10 KV pairs.
- 3_match_resume_100: Multi‑match resume retrieval with a context of 100 students, requiring the model to retrieve 3 students.
- concat_3_kv_100_cot: Multi‑step KV retrieval with a context of 100 KV pairs; the model must retrieve 3 values via 3 queries and concatenate them. The prompt follows a Chain‑of‑Thought (CoT) style.
Dataset Columns
- prompt: The complete task prompt.
- gold_keys: Gold keys for KV retrieval tasks. A single string if only one gold key; otherwise a list of strings. For resume tasks, this corresponds to student names (or a list).
- gold_values: Gold values for KV retrieval tasks. A single string if only one gold value; otherwise a list. For resume tasks, this corresponds to GPAs or university names (or a list).
Note: In logic‑based and multi‑match tasks, gold_keys actually represent the answer contained in the prompt.
AI studio
Generate PPTs instantly with Nano Banana Pro.
Generate PPT NowAccess Dataset
Please login to view download links and access full dataset details.
Topics
Source
Organization: huggingface
Created: 10/14/2024
Power Your Data Analysis with Premium AI Models
Supporting GPT-5, Claude-4, DeepSeek v3, Gemini and more.
Enjoy a free trial and save 20%+ compared to official pricing.