Back to datasets
Dataset assetOpen Source CommunityComputer VisionVisual Question Answering

HuggingFaceM4/A-OKVQA

--- configs: - config_name: default data_files: - split: train path: data/train-* - split: validation path: data/validation-* - split: test path: data/test-* dataset_info: features: - name: image dtype: image - name: question_id dtype: string - name: question dtype: string - name: choices list: string - name: correct_choice_idx dtype: int8 - name: direct_answers dtype: string - name: difficult_direct_answer dtype: bool - name: rationales list: string splits: - name: train num_bytes: 929295572.0 num_examples: 17056 - name: validation num_bytes: 60797340.875 num_examples: 1145 - name: test num_bytes: 338535925.25 num_examples: 6702 download_size: 1323807326 dataset_size: 1328628838.125 --- # Dataset Card for "A-OKVQA" [More Information needed](https://github.com/huggingface/datasets/blob/main/CONTRIBUTING.md#how-to-contribute-to-the-dataset-cards)

Source
hugging_face
Created
Nov 28, 2025
Updated
Feb 8, 2024
Signals
232 views
Availability
Linked source ready
Overview

Dataset description and usage context

Dataset Overview

Configuration

  • Default configuration:
    • Data files:
      • Training set: path data/train-*
      • Validation set: path data/validation-*
      • Test set: path data/test-*

Dataset Information

  • Features:

    • image: data type image
    • question_id: data type string
    • question: data type string
    • options: data type list of strings
    • correct_option_index: data type int8
    • direct_answer: data type string
    • hard_direct_answer: data type bool
    • reasoning: data type list of strings
  • Splits:

    • Training set:
      • Size (bytes): 929,295,572.0
      • Samples: 17,056
    • Validation set:
      • Size (bytes): 60,797,340.875
      • Samples: 1,145
    • Test set:
      • Size (bytes): 338,535,925.25
      • Samples: 6,702
  • Download size: 1,323,807,326 bytes

  • Dataset size: 1,328,628,838.125 bytes

Need downstream help?

Pair the dataset with AI analysis and content workflows.

Once the source passes your review, move straight into summarization, transformation, report drafting, or presentation generation with the JuheAI toolchain.

Explore AI studio