Explore high-quality datasets for your AI and machine learning projects.
WinoGrande is a dataset containing 44,000 problems, inspired by the Winograd Schema Challenge, but scaled up and adjusted to improve robustness against dataset‑specific biases. The task is cloze, providing two options; the goal is to select the correct option for the given sentence, requiring common‑sense reasoning.
The LAMBADA dataset is used to evaluate computational models' text‑understanding ability, specifically testing whether a model can handle long‑range dependencies via a word‑prediction task. The dataset consists of narrative passages extracted from BookCorpus, split into development and test sets, with training data covering the full text of 2,662 novels. Its structure includes text and label fields, and it is partitioned into training, development, and test sets. The dataset was created to assess whether language models can retain long‑term contextual memory. Annotation involved paid crowdworkers ensuring that the target word could only be guessed by reading the entire passage. The language is English and the license is CC BY 4.0.