MM-CamObj
The MM‑CamObj dataset, created by Shanghai Jiao Tong University, addresses challenges for vision‑language models in complex, especially camouflaged‑object, scenarios. It comprises two subsets: CamObj‑Align (11,363 high‑quality image‑text pairs) for vision‑language alignment, and CamObj‑Instruct (11,363 images with 68,849 diverse dialogues) for instruction fine‑tuning. Images were carefully selected from classic datasets and detailed descriptions and dialogues were generated using GPT‑4o. MM‑CamObj is primarily used to evaluate and improve vision‑language models on camouflaged‑object detection, localization, and counting tasks.
Description
MM‑CamObj
Dataset Overview
- Name: MM‑CamObj
- Full Title: MM‑CamObj: A Comprehensive Multimodal Dataset for Camouflaged Object Scenarios
- Source: ARXIV 24
- Description: This repository hosts the official code and data for “MM‑CamObj: A Comprehensive Multimodal Dataset for Camouflaged Object Scenarios”.
Dataset Status
- Release Status: Code and dataset are forthcoming.
AI studio
Generate PPTs instantly with Nano Banana Pro.
Generate PPT NowAccess Dataset
Please login to view download links and access full dataset details.
Topics
Source
Organization: arXiv
Created: 9/24/2024
Power Your Data Analysis with Premium AI Models
Supporting GPT-5, Claude-4, DeepSeek v3, Gemini and more.
Enjoy a free trial and save 20%+ compared to official pricing.