JUHE API Marketplace
DATASET
Open Source Community

LAMBDA

This dataset is primarily used for analyzing and evaluating the effectiveness of video advertisements. It includes video identifiers (video_id), recall scores (recall_score), YouTube video IDs (youtube_id), and ad details (ad_details). The ad_details field is a structured feature containing sub‑features such as Audio, Brand, Duration, etc. The dataset is split into a training set (1,964 samples) and a test set (219 samples). Total size is 5,707,189 bytes, with a download size of 2,281,142 bytes.

Updated 7/3/2024
huggingface

Description

Dataset Overview

Dataset Information

  • Name: Long Term Memorability of Advertisements (LAMBDA)
  • License: MIT
  • Task Categories:
    • Text Classification
    • Text Generation
    • Question Answering
  • Tags:
    • Memorability
    • Long‑Term Memorability
    • Advertising Memorability

Dataset Structure

  • Features:
    • video_id: Identifier of the sample, type int64
    • recall_score: Memorability score of the video, range 0–1, type float64
    • youtube_id: YouTube ID of the video, type string
    • ad_details: Scene features for each video, containing:
      • Audio: Audio, type string
      • Brand: Brand, type string
      • Duration: Duration, type string
      • Orientation: Orientation, type string
      • Pace: Pace, type string
      • Scenes: List of scene items, each with sub‑features:
        • Colors: Colors, type string
        • Description: Description, type string
        • Emotions: Emotions, type string
        • Number: Number, type string
        • Photography Style: Photography Style, type string
        • Tags: Tags, type string
        • Text Shown: Text Shown, type string
        • Tone: Tone, type string
        • Visual Complexity: Visual Complexity, type string
      • Title: Title, type string

Data Splits

  • Training Set:
    • Sample count: 1,964
    • Bytes: 5,490,622.457169034
  • Test Set:
    • Sample count: 219
    • Bytes: 612,243.5428309665

Dataset Sizes

  • Download Size: 2,551,503 bytes
  • Total Size: 6,102,866 bytes

Configuration

  • Default Configuration:
    • Training path: data/train-*
    • Test path: data/test-*

Citation

plaintext @misc{s2024longtermadmemorabilityunderstanding, title={Long‑Term Ad Memorability: Understanding and Generating Memorable Ads}, author={Harini S I au2 and Somesh Singh and Yaman K Singla and Aanisha Bhattacharyya and Veeky Baths and Changyou Chen and Rajiv Ratn Shah and Balaji Krishnamurthy}, year={2024}, eprint={2309.00378}, archivePrefix={arXiv}, primaryClass={cs.CL}, url={https://arxiv.org/abs/2309.00378} }

AI studio

Generate PPTs instantly with Nano Banana Pro.

Generate PPT Now

Access Dataset

Login to Access

Please login to view download links and access full dataset details.

Topics

Video Advertising
Data Analysis

Source

Organization: huggingface

Created: 7/3/2024

Power Your Data Analysis with Premium AI Models

Supporting GPT-5, Claude-4, DeepSeek v3, Gemini and more.

Enjoy a free trial and save 20%+ compared to official pricing.