JUHE API Marketplace
DATASET
Open Source Community

ASAG2024

ASAG2024 is a comprehensive short‑answer grading benchmark dataset created by the Zurich University of Applied Sciences. It comprises seven commonly used short‑answer grading datasets, totaling 19,000 question‑answer‑score triples across multiple subjects and education levels. Scores are normalized between 0 and 1 to facilitate comparison across datasets. The creation involved integrating data from multiple sources and standardizing them. The dataset is primarily used to evaluate and compare the performance of automated grading systems, aiming to address automation and generalizability challenges in short‑answer grading.

Updated 9/27/2024
arXiv

Description

ASAG2024 Dataset Overview

Dataset Description

  • Name: ASAG2024
  • Tags: ASAG, Grading
  • Size: 10K < n < 100K
  • Language: English
  • Creator: Gérôme Meyer
  • License: Data source license applies (see below)

Data Source

The dataset was collected from the following source:

Dataset Content

The dataset contains the following elements:

  • Questions
  • Reference answers
  • Student answers
  • Human scores

Dataset Authors

  • Gérôme Meyer
  • Philip Breuer

Contact Information

News

  • [May 12 2024] ⏰ Reorganizing code, materials and dataset.
  • [Nov 26 2024] 🎉 Paper published on arXiv.

AI studio

Generate PPTs instantly with Nano Banana Pro.

Generate PPT Now

Access Dataset

Login to Access

Please login to view download links and access full dataset details.

Topics

Automated Scoring
Educational Assessment

Source

Organization: arXiv

Created: 9/27/2024

Power Your Data Analysis with Premium AI Models

Supporting GPT-5, Claude-4, DeepSeek v3, Gemini and more.

Enjoy a free trial and save 20%+ compared to official pricing.