Dataset assetOpen Source CommunityMultimodal LearningPedestrian Detection

MMPedestron Benchmark Dataset

MMPedestron Benchmark Dataset is a multimodal pedestrian detection dataset that includes sub‑datasets such as CrowdHuman, COCO‑Person, FLIR, PEDRo, etc.

Source

github

Created

Jul 7, 2024

Updated

Jul 15, 2024

Signals

179 views

Availability

Linked source ready

Overview

Dataset description and usage context

MMPedestron Dataset Overview

Dataset Introduction

MMPedestron is a pedestrian detection dataset for multimodal learning, jointly developed by the following authors:

Yi Zhang
Wang ZENG
Sheng Jin
Chen Qian
Ping Luo
Wentao Liu

Dataset Configurations and Models

The dataset comprises multiple subsets and corresponding model configurations, as outlined below:

Region Proposal Performance

Pre‑training Stage
- Method & Config: MMPedestron
- Backbone: UNIXViT
- Download: Google Drive, Baidu Yun (Code: mmpd)
CrowdHuman
- Method & Config: MMPedestron
- Backbone: UNIXViT
- Download: Google Drive, Baidu Yun (Code: mmpd)
COCO‑Person
- Method & Config: MMPedestron finetune
- Backbone: UNIXViT
- Download: Google Drive, Baidu Yun (Code: mmpd)
FLIR
- Method & Config: MMPedestron
- Backbone: UNIXViT
- Download: Google Drive, Baidu Yun (Code: mmpd)
PEDRo
- Method & Config: MMPedestron
- Backbone: UNIXViT
- Download: Google Drive, Baidu Yun (Code: mmpd)
LLVIP Datasets
- Method & Config: MMPedestron
- Backbone: UNIXViT
- Download: Google Drive, Baidu Yun (Code: mmpd)
InOutDoor Datasets
- Method & Config: MMPedestron
- Backbone: UNIXViT
- Download: same as above
STCrowd Datasets
- Method & Config: MMPedestron
- Backbone: UNIXViT
- Download: same as above
EventPed Datasets
- Method & Config: MMPedestron
- Backbone: UNIXViT
- Download: same as above

Dataset Access

Please obtain the dataset from the following link: MMPD‑Dataset

License

Code and data may be used freely for non‑commercial purposes; commercial inquiries should contact Sheng Jin (jinsheng13[at]foxmail[dot]com).

Citation

If you use our paper and code, please cite:

@inproceedings{zhang2024when,
  title={When Pedestrian Detection Meets Multi‑Modal Learning: Generalist Model and Benchmark Dataset},
  author={Zhang, Yi and Zeng, Wang and Jin, Sheng and Qian, Chen and Luo, Ping and Liu, Wentao},
  booktitle={European Conference on Computer Vision (ECCV)},
  year={2024},
  month={September}
}

Need downstream help?

Pair the dataset with AI analysis and content workflows.

Once the source passes your review, move straight into summarization, transformation, report drafting, or presentation generation with the JuheAI toolchain.

Explore AI studio