Back to datasets
Dataset assetOpen Source CommunityEducational TechnologyHandwritten Text Erasure

SCUT-EnsExam

SCUT-EnsExam is a real‑world handwritten text erasure dataset designed for exam paper scenarios, containing 545 exam images. The dataset is randomly split into a training set of 430 images and a test set of 115 images.

Source
github
Created
Apr 27, 2023
Updated
Dec 5, 2023
Signals
427 views
Availability
Linked source ready
Overview

Dataset description and usage context

SCUT-EnsExam Dataset Overview

Dataset Description

SCUT-EnsExam is a real‑world handwritten text erasure dataset for exam paper scenarios, containing 545 exam images. The dataset is randomly split into a training set of 430 images and a test set of 115 images.

Dataset Download

The dataset can be downloaded via the following links:

Usage License

The SCUT-EnsExam dataset may be used only for non‑commercial research purposes and must comply with the Creative Attribution‑NonCommercial‑NoDerivatives 4.0 International (CC BY‑NC‑ND 4.0) License.

Directory Structure

The dataset directory structure is as follows:

├── SCUT-EnsExam ├── train │ ├── all_images │ ├── all_labels │ └── quad_annotation ├── test ├── all_images ├── all_labels └── quad_annotation

Citation and Contact

When using this dataset, please cite the following paper:

@InProceedings{ author = {Huang, Liufeng and Chen, Bangdong and Liu, Chongyu and Peng, Dezhi and Zhou, Weiying and Wu, Yaqiang and Li, Hui and Ni, Hao and Jin, Lianwen}, title = {EnsExam: A Dataset for Handwritten Text Erasure on Examination Papers.}, booktitle = {Document Analysis and Recognition – ICDAR 2023}, month = {August}, year = {2023}, pages = {470–485} }

For any questions, please contact the authors:

Need downstream help?

Pair the dataset with AI analysis and content workflows.

Once the source passes your review, move straight into summarization, transformation, report drafting, or presentation generation with the JuheAI toolchain.

Explore AI studio