JUHE API Marketplace
DATASET
Open Source Community

MM-CamObj

The MM‑CamObj dataset, created by Shanghai Jiao Tong University, addresses challenges for vision‑language models in complex, especially camouflaged‑object, scenarios. It comprises two subsets: CamObj‑Align (11,363 high‑quality image‑text pairs) for vision‑language alignment, and CamObj‑Instruct (11,363 images with 68,849 diverse dialogues) for instruction fine‑tuning. Images were carefully selected from classic datasets and detailed descriptions and dialogues were generated using GPT‑4o. MM‑CamObj is primarily used to evaluate and improve vision‑language models on camouflaged‑object detection, localization, and counting tasks.

Updated 9/24/2024
arXiv

Description

MM‑CamObj

Dataset Overview

  • Name: MM‑CamObj
  • Full Title: MM‑CamObj: A Comprehensive Multimodal Dataset for Camouflaged Object Scenarios
  • Source: ARXIV 24
  • Description: This repository hosts the official code and data for “MM‑CamObj: A Comprehensive Multimodal Dataset for Camouflaged Object Scenarios”.

Dataset Status

  • Release Status: Code and dataset are forthcoming.

AI studio

Generate PPTs instantly with Nano Banana Pro.

Generate PPT Now

Access Dataset

Login to Access

Please login to view download links and access full dataset details.

Topics

Vision-Language Models
Camouflaged Object Detection

Source

Organization: arXiv

Created: 9/24/2024

Power Your Data Analysis with Premium AI Models

Supporting GPT-5, Claude-4, DeepSeek v3, Gemini and more.

Enjoy a free trial and save 20%+ compared to official pricing.