M4-Instruct-Data
M4‑Instruct is a multi‑image dataset collected in April 2024 from public datasets and the GPT‑4V API, intended for training large multimodal models. It is used for research on large multimodal models and chatbots, targeting audiences in computer vision, natural language processing, machine learning, and AI.
Description
M4‑Instruct Dataset Overview
Dataset Details
Dataset Type: M4‑Instruct is a collection of multi‑image data gathered from public datasets or generated via the GPT‑4V API. It aims to train large multimodal models with interleaved multi‑image capabilities, such as LLaVA‑NeXT‑Interleave.
Dataset Date: Collected in April 2024, released in June 2024.
Data Statistics: The release includes multi‑image, multi‑frame (video), and multi‑view (3D) data for M4‑Instruct.
Data Content:
- JSON files:
m4_instruct_annotations.jsonandm4_instruct_video.json - Images:
*.zip - For
dreamsim_split.z01anddreamsim_split.zip, runzip -s 0 dreamsim_split.zip --out dreamsim.zip
License: Creative Commons Attribution 4.0 International; compliance with OpenAI policy required: https://openai.com/policies/terms-of-use
Contact for Issues:
Intended Use
Primary Intended Use: Research on large multimodal models and chatbots.
Primary Intended Users: Researchers and enthusiasts in computer vision, natural language processing, machine learning, and artificial intelligence.
AI studio
Generate PPTs instantly with Nano Banana Pro.
Generate PPT NowAccess Dataset
Please login to view download links and access full dataset details.
Topics
Source
Organization: huggingface
Created: 6/26/2024
Power Your Data Analysis with Premium AI Models
Supporting GPT-5, Claude-4, DeepSeek v3, Gemini and more.
Enjoy a free trial and save 20%+ compared to official pricing.