DATASET
Open Source Community
MC-LLaVA Multi-Concept Personalization Dataset
The MC-LLaVA Multi-Concept Personalization dataset is a high-quality collection designed to advance multi-concept personalization research. It gathers images featuring multiple characters from various movies and manually generates multi-concept question‑answer samples. With diverse movie genres and QA types, the dataset aims to enable vision‑language models to excel in multi-concept personalization tasks.
Updated 11/23/2024
github
Description
MC-LLaVA: Multi-Concept Personalized Vision-Language Model
Overview
- Name: MC-LLaVA
- Type: Multi-Concept Personalized Vision-Language Model
- Paper: MC-LLaVA: Multi-Concept Personalized Vision-Language Model
Model Features
- Multi-Concept Personalization: Through a joint training strategy, MC-LLaVA can integrate multiple concepts in a single training session, achieving multi-concept personalization.
- Utilization of Visual Tag Information: Leverages visual tag information for concept label initialization, enhancing concept representations and accelerating joint training.
Dataset
- Data Source: Images containing multiple characters collected from various movies, with manually generated multi-concept QA samples.
- Dataset Characteristics:
- Diverse movie genres
- Diverse question‑answer types
Experimental Results
- Multi-Concept Personalization Response: Through comprehensive qualitative and quantitative experiments, MC-LLaVA demonstrates outstanding multi-concept personalization response capabilities.
AI studio
Generate PPTs instantly with Nano Banana Pro.
Generate PPT NowAccess Dataset
Login to Access
Please login to view download links and access full dataset details.
Topics
Vision-Language Models
Multi-Concept Personalization
Source
Organization: github
Created: 11/18/2024
Power Your Data Analysis with Premium AI Models
Supporting GPT-5, Claude-4, DeepSeek v3, Gemini and more.
Enjoy a free trial and save 20%+ compared to official pricing.