chiayewken/bamboogle

The Bamboogle dataset contains data for studying the compositionality gap in language models. It includes two features—question and answer—and consists of a test split with 125 examples, totalling 10,747 bytes. The dataset is associated with the paper "Measuring and Narrowing the Compositionality Gap in Language Models" and is released under the MIT License.

Updated 10/27/2023

hugging_face

Description

Dataset Overview

Dataset Information

Features:
- Question: Data type is string.
- Answer: Data type is string.
Splits:
- test: Contains 125 samples, total size 10,747 bytes.
Download Size: 8,383 bytes.
Dataset Size: 10,747 bytes.

Configuration

Config Name: default
- Data Files:
  - Split: test
  - Path: data/test-*

AI studio

Generate PPTs instantly with Nano Banana Pro.

Generate PPT Now

Access Dataset

Please login to view download links and access full dataset details.

Topics

Language Models

Natural Language Processing

Source

Organization: hugging_face

Created: Unknown

Power Your Data Analysis with Premium AI Models

Supporting GPT-5, Claude-4, DeepSeek v3, Gemini and more.

Enjoy a free trial and save 20%+ compared to official pricing.

Check Prices →