JUHE API Marketplace
DATASET
Open Source Community

MusicSet

MusicSet is built on the MTG‑Jamendo dataset and focuses on music audio with rich textual descriptions. The dataset selects music tracks that have at least five tags, extracts the middle 80 % of each audio file, and splits it into 10‑second clips while removing non‑melodic sections. The clips are saved as individual WAV files and their descriptive information is stored in JSON files. Textual descriptions are generated via the DeepSeek API, which was trained on the MusicCaps description style and consolidates multiple tags into full sentences. MusicSet ultimately contains about 150,000 10‑second music‑text pairs, integrating elements from MusicBench and MusicCaps.

Updated 11/5/2024
huggingface

Description

MusicSet Dataset

Overview

MusicSet is constructed from the MTG‑Jamendo dataset by selecting and expanding music audio and adding descriptive text. The dataset contains roughly 150 k 10‑second music‑text pairs.

Data Processing

  1. Audio Selection: Choose music audio with at least five tags.
  2. Audio Segmentation: Load audio files, extract the middle 80 %, and split into 10‑second segments, discarding non‑melodic parts.
  3. Tag Expansion: Use the DeepSeek API to expand multiple tags into full descriptive texts.
  4. Data Integration: Combine the generated music‑text pairs with MusicBench and MusicCaps to form the final MusicSet dataset.

Data Format

  • Audio Files: Saved as individual WAV files.
  • Description Files: Saved as JSON files.

Citation

@article{wei2024melodyneedmusicgeneration,
      title={Melody Is All You Need For Music Generation}, 
      author={Shaopeng Wei and Manzhen Wei and Haoyu Wang and Yu Zhao and Gang Kou},
      year={2024},
      eprint={2409.20196},
      archivePrefix={arXiv},
      primaryClass={cs.SD},
      url={https://arxiv.org/abs/2409.20196}, 
}

AI studio

Generate PPTs instantly with Nano Banana Pro.

Generate PPT Now

Access Dataset

Login to Access

Please login to view download links and access full dataset details.

Topics

Music Audio
Textual Description

Source

Organization: huggingface

Created: 10/31/2024

Power Your Data Analysis with Premium AI Models

Supporting GPT-5, Claude-4, DeepSeek v3, Gemini and more.

Enjoy a free trial and save 20%+ compared to official pricing.