Back to datasets
Dataset assetOpen Source CommunityVideo AnalysisMusic Recognition

MUSIC Dataset

This dataset contains YouTube video IDs used in the Sound of Pixels project, including solo video IDs for 11 and 21 instrument sets, as well as duet performance video IDs. After the paper was published, some noisy videos were removed, so the number of videos slightly differs from the paper.

Source
github
Created
Aug 13, 2018
Updated
May 20, 2024
Signals
173 views
Availability
Linked source ready
Overview

Dataset description and usage context

MUSIC Dataset from Sound of Pixels

Dataset contents

  • MUSIC_solo_videos.json: Contains YouTube video IDs for solo performances of 11 instruments.
  • MUSIC21_solo_videos.json: Contains YouTube video IDs for solo performances of 21 instruments.
  • MUSIC_duet_videos.json: Contains YouTube video IDs for duet performances.

Dataset notes

  • The number of videos in the dataset differs slightly from the paper because some noisy videos were later removed.

Citation information

  • When using this dataset or code, please cite the following papers: bibtex @InProceedings{zhao2018sound, author = {Zhao, Hang and Gan, Chuang and Rouditchenko, Andrew and Vondrick, Carl and McDermott, Josh and Torralba, Antonio}, title = {The Sound of Pixels}, booktitle = {The European Conference on Computer Vision (ECCV)}, month = {September}, year = {2018} }

    bibtex @inproceedings{zhao2019sound, title={The sound of motions}, author={Zhao, Hang and Gan, Chuang and Ma, Wei-Chiu and Torralba, Antonio}, booktitle={Proceedings of the IEEE International Conference on Computer Vision}, pages={1735--1744}, year={2019} }

Need downstream help?

Pair the dataset with AI analysis and content workflows.

Once the source passes your review, move straight into summarization, transformation, report drafting, or presentation generation with the JuheAI toolchain.

Explore AI studio