Dataset assetOpen Source CommunityComputer VisionVisual Question Answering

HuggingFaceM4/A-OKVQA

--- configs: - config_name: default data_files: - split: train path: data/train-* - split: validation path: data/validation-* - split: test path: data/test-* dataset_info: features: - name: image dtype: image - name: question_id dtype: string - name: question dtype: string - name: choices list: string - name: correct_choice_idx dtype: int8 - name: direct_answers dtype: string - name: difficult_direct_answer dtype: bool - name: rationales list: string splits: - name: train num_bytes: 929295572.0 num_examples: 17056 - name: validation num_bytes: 60797340.875 num_examples: 1145 - name: test num_bytes: 338535925.25 num_examples: 6702 download_size: 1323807326 dataset_size: 1328628838.125 --- # Dataset Card for "A-OKVQA" [More Information needed](https://github.com/huggingface/datasets/blob/main/CONTRIBUTING.md#how-to-contribute-to-the-dataset-cards)

Source

hugging_face

Created

Nov 28, 2025

Updated

Feb 8, 2024

Signals

232 views

Availability

Linked source ready

Overview

Dataset description and usage context

Dataset Overview

Configuration

Default configuration:
- Data files:
  - Training set: path data/train-*
  - Validation set: path data/validation-*
  - Test set: path data/test-*

Dataset Information

Features:
- image: data type image
- question_id: data type string
- question: data type string
- options: data type list of strings
- correct_option_index: data type int8
- direct_answer: data type string
- hard_direct_answer: data type bool
- reasoning: data type list of strings
Splits:
- Training set:
  - Size (bytes): 929,295,572.0
  - Samples: 17,056
- Validation set:
  - Size (bytes): 60,797,340.875
  - Samples: 1,145
- Test set:
  - Size (bytes): 338,535,925.25
  - Samples: 6,702
Download size: 1,323,807,326 bytes
Dataset size: 1,328,628,838.125 bytes

Need downstream help?

Pair the dataset with AI analysis and content workflows.

Once the source passes your review, move straight into summarization, transformation, report drafting, or presentation generation with the JuheAI toolchain.

Explore AI studio