JUHE API Marketplace
DATASET
Open Source Community

Intentonomy

Intentonomy is a dataset of 14,455 images created jointly by Cornell University and Facebook AI to understand and analyze human intent behind social‑media images. The images span everyday scenarios and are manually annotated with 28 intent categories using a psychology‑based taxonomy. Labels were collected via a novel “purpose game” on Amazon Mechanical Turk. The dataset supports tasks such as fake‑news detection and improving vision systems’ understanding of human intent.

Updated 3/28/2021
arXiv

Description

Intentonomy Dataset Overview

Dataset Introduction

Dataset Download

  • Name: Intentonomy
  • Content: 14 K images manually labeled with 28 intent categories, organized in a hierarchical structure by psychology experts.
  • Download: See DATA.md.

Annotation Method

  • Method: “Purpose Game” approach, gathering intent annotations via Amazon Mechanical Turk.
  • Details: See Appendix C of the paper.

Research Content

Relationship Between Image Content and Human Intent

  • Goal: Explore the subtle link between visual content and intent.
  • Findings:
    1. Different intent categories rely on distinct objects and scenes for recognition.
    2. For categories with large intra‑class variation, visual content offers limited performance gains.
    3. Focusing on relevant objects and scene categories positively influences intent recognition.

Intent Recognition Baselines

  • Framework: Introduces weakly‑supervised localization and auxiliary label modeling to narrow the gap between human and machine image understanding.
  • Implementation: Provides the localization loss in loc_loss.py; download image masks and set MASK_ROOT accordingly.
  • Dependencies: Requires cv2 and pycocotools.

Intent Category Sub‑division

  • Basis:
    1. Content Dependency: Object‑dependent (O‑classes), Context‑dependent (C‑classes), and Others.
    2. Difficulty: Classified as “Easy”, “Medium”, and “Hard” based on the performance gap between visual models and random chance.
  • Details: See Appendix A of the paper.

Baseline Results

Validation Set

ModelMacro F1Micro F1Samples F1
VISUAL23.03 ± 0.7931.36 ± 1.1629.91 ± 1.73
VISUAL + $L_{loc}$24.42 ± 0.9532.87 ± 1.1332.46 ± 1.18
VISUAL + $L_{loc}$ + HT25.07 ± 0.5232.94 ± 1.1633.61 ± 0.92

Test Set

ModelMacro F1Micro F1Samples F1
VISUAL22.77 ± 0.5930.23 ± 0.7328.45 ± 1.71
VISUAL + $L_{loc}$24.37 ± 0.6532.07 ± 0.8430.91 ± 1.27
VISUAL + $L_{loc}$ + HT23.98 ± 0.8531.28 ± 0.3631.39 ± 0.78

Validation Sub‑division (by Content Dependency)

ModelObjectContextOther
VISUAL25.58 ± 2.5130.16 ± 2.9721.34 ± 0.74
VISUAL + $L_{loc}$28.15 ± 1.9428.62 ± 2.1322.60 ± 1.40
VISUAL + $L_{loc}$ + HT29.66 ± 2.1932.48 ± 1.3422.61 ± 0.48

Validation Sub‑division (by Difficulty)

ModelEasyMediumHard
VISUAL54.64 ± 2.5424.92 ± 1.1810.71 ± 1.33
VISUAL + $L_{loc}$57.10 ± 1.8425.68 ± 1.2412.72 ± 2.31
VISUAL + $L_{loc}$ + HT58.86 ± 2.5626.30 ± 1.4213.11 ± 2.15

Citation

@inproceedings{jia2021intentonomy,
  title={Intentonomy: a Dataset and Study towards Human Intent Understanding},
  author={Jia, Menglin and Wu, Zuxuan and Reiter, Austin and Cardie, Claire and Belongie, Serge and Lim, Ser‑Nam},
  booktitle={CVPR},
  year={2021}
}

AI studio

Generate PPTs instantly with Nano Banana Pro.

Generate PPT Now

Access Dataset

Login to Access

Please login to view download links and access full dataset details.

Topics

Social Media Analysis
Computer Vision

Source

Organization: arXiv

Created: 11/11/2020

Power Your Data Analysis with Premium AI Models

Supporting GPT-5, Claude-4, DeepSeek v3, Gemini and more.

Enjoy a free trial and save 20%+ compared to official pricing.