Back to datasets
Dataset assetOpen Source CommunityNatural Language ProcessingImage Difference Recognition

spot-the-diff

This dataset is used for learning to describe the differences between pairs of similar images. It contains four image features (img_a, img_b, img_diff) and one sentence sequence feature (sentences). The dataset is split into training, testing, and validation sets with 9,524, 1,404, and 1,634 samples respectively.

Source
huggingface
Created
Dec 19, 2024
Updated
Dec 19, 2024
Signals
258 views
Availability
Linked source ready
Overview

Dataset description and usage context

Dataset Overview

Dataset Information

  • Features:

    • img_id: String type, unique identifier for the image.
    • img_a: Image type, first image.
    • img_b: Image type, second image.
    • img_diff: Image type, difference image.
    • sentences: Sequence of strings, sentences describing the differences.
  • Dataset Splits:

    • train: Training set, 9,524 samples, size 1,904,363,199.892 bytes.
    • test: Test set, 1,404 samples, size 268,451,640.804 bytes.
    • val: Validation set, 1,634 samples, size 308,229,248.356 bytes.
  • Dataset Size:

    • Download size: 2,292,419,742 bytes
    • Total size: 2,481,044,089.052 bytes

Configuration

  • Configuration Name: default
    • Data File Paths:
      • Training: data/train-*
      • Testing: data/test-*
      • Validation: data/val-*

Original Dataset

  • Source: https://github.com/harsh19/spot-the-diff/

Reference

@inproceedings{jhamtani2018learning, title={Learning to Describe Differences Between Pairs of Similar Images}, author={Jhamtani, Harsh and Berg-Kirkpatrick, Taylor}, booktitle={Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing (EMNLP)}, year={2018} }

Need downstream help?

Pair the dataset with AI analysis and content workflows.

Once the source passes your review, move straight into summarization, transformation, report drafting, or presentation generation with the JuheAI toolchain.

Explore AI studio