MMPedestron Benchmark Dataset
MMPedestron Benchmark Dataset is a multimodal pedestrian detection dataset that includes sub‑datasets such as CrowdHuman, COCO‑Person, FLIR, PEDRo, etc.
Dataset description and usage context
MMPedestron Dataset Overview
Dataset Introduction
MMPedestron is a pedestrian detection dataset for multimodal learning, jointly developed by the following authors:
- Yi Zhang
- Wang ZENG
- Sheng Jin
- Chen Qian
- Ping Luo
- Wentao Liu
Dataset Configurations and Models
The dataset comprises multiple subsets and corresponding model configurations, as outlined below:
Region Proposal Performance
-
Pre‑training Stage
- Method & Config: MMPedestron
- Backbone: UNIXViT
- Download: Google Drive, Baidu Yun (Code: mmpd)
-
CrowdHuman
- Method & Config: MMPedestron
- Backbone: UNIXViT
- Download: Google Drive, Baidu Yun (Code: mmpd)
-
COCO‑Person
- Method & Config: MMPedestron finetune
- Backbone: UNIXViT
- Download: Google Drive, Baidu Yun (Code: mmpd)
-
FLIR
- Method & Config: MMPedestron
- Backbone: UNIXViT
- Download: Google Drive, Baidu Yun (Code: mmpd)
-
PEDRo
- Method & Config: MMPedestron
- Backbone: UNIXViT
- Download: Google Drive, Baidu Yun (Code: mmpd)
-
LLVIP Datasets
- Method & Config: MMPedestron
- Backbone: UNIXViT
- Download: Google Drive, Baidu Yun (Code: mmpd)
-
InOutDoor Datasets
- Method & Config: MMPedestron
- Backbone: UNIXViT
- Download: same as above
-
STCrowd Datasets
- Method & Config: MMPedestron
- Backbone: UNIXViT
- Download: same as above
-
EventPed Datasets
- Method & Config: MMPedestron
- Backbone: UNIXViT
- Download: same as above
Dataset Access
Please obtain the dataset from the following link: MMPD‑Dataset
License
Code and data may be used freely for non‑commercial purposes; commercial inquiries should contact Sheng Jin (jinsheng13[at]foxmail[dot]com).
Citation
If you use our paper and code, please cite:
@inproceedings{zhang2024when,
title={When Pedestrian Detection Meets Multi‑Modal Learning: Generalist Model and Benchmark Dataset},
author={Zhang, Yi and Zeng, Wang and Jin, Sheng and Qian, Chen and Luo, Ping and Liu, Wentao},
booktitle={European Conference on Computer Vision (ECCV)},
year={2024},
month={September}
}
Pair the dataset with AI analysis and content workflows.
Once the source passes your review, move straight into summarization, transformation, report drafting, or presentation generation with the JuheAI toolchain.