MBZUAI/VideoInstruct-100K
VideoInstruct100K is a high‑quality video‑dialogue dataset created through human‑in‑the‑loop and semi‑automatic annotation techniques. The Q&A content covers video summarization, description‑based question answering (exploring spatial, temporal, relational, and reasoning concepts), and creative/generative question answering.
Description
VideoInstruct100K Dataset Overview
Dataset Description
VideoInstruct100K is a high‑quality video‑dialogue dataset generated using human‑in‑the‑loop and semi‑automatic annotation techniques. The Q&A content includes the following aspects:
- Video summarization
- Description‑based question answering (exploring spatial, temporal, relational, and reasoning concepts)
- Creative/generative question answering
Citation Information
If you find this dataset useful, please consider citing the following paper:
@article{Maaz2023VideoChatGPT,
title={Video-ChatGPT: Towards Detailed Video Understanding via Large Vision and Language Models},
author={Muhammad Maaz, Hanoona Rasheed, Salman Khan and Fahad Khan},
journal={ArXiv 2306.05424},
year={2023}
}
AI studio
Generate PPTs instantly with Nano Banana Pro.
Generate PPT NowAccess Dataset
Please login to view download links and access full dataset details.
Topics
Source
Organization: hugging_face
Created: Unknown
Power Your Data Analysis with Premium AI Models
Supporting GPT-5, Claude-4, DeepSeek v3, Gemini and more.
Enjoy a free trial and save 20%+ compared to official pricing.