Back to datasets
Dataset assetOpen Source CommunityNatural Language ProcessingLinguistics
CreativeLang/vua20_metaphor
VUA20 is a metaphor detection dataset, likely the largest used in the FigLang2020 workshop. The dataset comprises 200 k instances and was created in 2020. Annotation methodology is detailed in the MIP paper.
Source
hugging_face
Created
Nov 28, 2025
Updated
Jun 27, 2023
Signals
189 views
Availability
Linked source ready
Overview
Dataset description and usage context
VUA20 Dataset Overview
Dataset Description
Dataset Summary
- Type: Metaphor
- Task Type: Detection
- Size: 200 k
- Creation Year: 2020
VUA20 is likely the largest metaphor detection dataset used in the FigLang2020 workshop.
Citation Information
If you find this dataset useful, please cite:
@inproceedings{Leong2020ARO,
title={A Report on the 2020 VUA and TOEFL Metaphor Detection Shared Task},
author={Chee Wee Leong and Beata Beigman Klebanov and Chris Hamill and Egon W. Stemle and Rutuja Ubale and Xianyang Chen},
booktitle={FIGLANG},
year={2020}
}
Need downstream help?
Pair the dataset with AI analysis and content workflows.
Once the source passes your review, move straight into summarization, transformation, report drafting, or presentation generation with the JuheAI toolchain.