Explore high-quality datasets for your AI and machine learning projects.
The MMAD dataset is a comprehensive benchmark dataset for multimodal large language models in the field of industrial anomaly detection, containing questions, images, and descriptive text. All questions are presented in multiple‑choice format and have been manually verified. Images come from multiple sources and retain ground‑truth mask format to facilitate future evaluation of segmentation performance of multimodal large language models. The descriptive text is mostly of good quality but has not been manually verified, so use with caution. MMAD aims to evaluate the performance of current multimodal large language models in industrial quality inspection and identify key challenges in industrial anomaly detection.