JUHE API Marketplace
DATASET
Open Source Community

MC-LLaVA Multi-Concept Personalization Dataset

The MC-LLaVA Multi-Concept Personalization dataset is a high-quality collection designed to advance multi-concept personalization research. It gathers images featuring multiple characters from various movies and manually generates multi-concept question‑answer samples. With diverse movie genres and QA types, the dataset aims to enable vision‑language models to excel in multi-concept personalization tasks.

Updated 11/23/2024
github

Description

MC-LLaVA: Multi-Concept Personalized Vision-Language Model

Overview

Model Features

  • Multi-Concept Personalization: Through a joint training strategy, MC-LLaVA can integrate multiple concepts in a single training session, achieving multi-concept personalization.
  • Utilization of Visual Tag Information: Leverages visual tag information for concept label initialization, enhancing concept representations and accelerating joint training.

Dataset

  • Data Source: Images containing multiple characters collected from various movies, with manually generated multi-concept QA samples.
  • Dataset Characteristics:
    • Diverse movie genres
    • Diverse question‑answer types

Experimental Results

  • Multi-Concept Personalization Response: Through comprehensive qualitative and quantitative experiments, MC-LLaVA demonstrates outstanding multi-concept personalization response capabilities.

AI studio

Generate PPTs instantly with Nano Banana Pro.

Generate PPT Now

Access Dataset

Login to Access

Please login to view download links and access full dataset details.

Topics

Vision-Language Models
Multi-Concept Personalization

Source

Organization: github

Created: 11/18/2024

Power Your Data Analysis with Premium AI Models

Supporting GPT-5, Claude-4, DeepSeek v3, Gemini and more.

Enjoy a free trial and save 20%+ compared to official pricing.