COIN Dataset
COIN is currently the largest comprehensive instructional video analysis dataset, containing 11,827 videos covering 180 different tasks across 12 domains. All videos are collected from YouTube and annotated using an efficient toolbox.
Description
Dataset Overview
Name: COIN Dataset
Scale: Contains 11,827 videos covering 180 different tasks across 12 domains.
Content: Video content includes a variety of tasks such as vehicle maintenance, cooking, etc., specifically including car polishing, french fry making, and more.
Source: All videos are collected from YouTube.
Annotation Tools: Annotated using a specialized toolbox.
Dataset Structure
Hierarchy: The dataset is organized in a three‑level hierarchy comprising domain, task, and step levels.
File Formats:
- Video and annotation information: Stored in JSON files, containing YouTube ID, duration, task name, video URL, start and end times, subset type, task ID, and detailed annotation information.
- Annotation details: Include annotation ID, name, and time intervals.
Usage License
License Type: Research‑only use, including sharing and modification of the material. The licensor must not be implied to support or endorse the user’s actions.
AI studio
Generate PPTs instantly with Nano Banana Pro.
Generate PPT NowAccess Dataset
Please login to view download links and access full dataset details.
Topics
Source
Organization: github
Created: 3/4/2019
Power Your Data Analysis with Premium AI Models
Supporting GPT-5, Claude-4, DeepSeek v3, Gemini and more.
Enjoy a free trial and save 20%+ compared to official pricing.