Back to datasets
Dataset assetOpen Source CommunityBenchmark DatasetImage Harmonization

iHarmony4

iHarmony4 is the first large‑scale image harmonization public benchmark dataset, containing four sub‑datasets: HCOCO, HAdobe5k, HFlickr, and Hday2night. Each sub‑dataset includes synthetic composite images, foreground masks of the composite images, and the corresponding real images.

Source
github
Created
Aug 10, 2019
Updated
May 18, 2024
Signals
231 views
Availability
Linked source ready
Overview

Dataset description and usage context

Dataset Overview

Dataset Name

iHarmony4

Dataset Description

iHarmony4 is the first large‑scale image harmonization dataset, aiming to adjust the appearance of synthetic image foregrounds to be consistent with background regions. The dataset includes four sub‑datasets: HCOCO, HAdobe5k, HFlickr, and Hday2night, each containing synthetic composite images, foreground masks, and corresponding real images.

Sub‑Dataset Details

1. HCOCO

  • Source: Based on the Microsoft COCO dataset.
  • Content: Contains 42 k synthetic composite images, edited using various color transfer methods on the foreground region.
  • Storage: Available at Baidu Cloud (access code: ab5e) and OneDrive.

2. HAdobe5k

  • Source: Based on the MIT‑Adobe FiveK dataset.
  • Content: Foreground regions manually segmented and swapped between two versions.
  • Storage: Available at Baidu Cloud and OneDrive.

3. HFlickr

  • Source: Collected from Flickr (4,833 images).
  • Content: Manually segmented foregrounds, generated using the same method as HCOCO.
  • Storage: Available at Baidu Cloud and OneDrive.

4. Hday2night

  • Source: Based on the day2night dataset.
  • Content: Manually segmented foregrounds, cropped and overlaid onto another image captured at a different time.
  • Storage: Available at Baidu Cloud and OneDrive.

Dataset Statistics

HCOCOHAdobe5kHFlickrHday2nightiHarmony4
Train38,54519,4377,44931165,742
Test4,2832,1608281337,404

Dataset Storage

The iHarmony4 dataset is provided at Baidu Cloud (access code: kqz3) and OneDrive.

Need downstream help?

Pair the dataset with AI analysis and content workflows.

Once the source passes your review, move straight into summarization, transformation, report drafting, or presentation generation with the JuheAI toolchain.

Explore AI studio