Multi30k Dataset
Machine LearningMultilingual Image Description
The Multi30k dataset is a multilingual English‑German image description dataset, containing training, validation, and test sets, and supporting multiple languages such as English, German, French, and Czech. The dataset provides detailed statistics such as the number of sentences, word count, and average words per sentence. Additionally, it offers download links for visual features and original images.
Source githubUpdated Nov 22, 2019755 viewsLinked
Inspect dataset