JUHE API Marketplace
API CatalogDatasetsDocsBlog
API CatalogDatasetsDocsBlog

Dataset Catalog

Browse trusted datasets for evaluation, enrichment, and production use.

Category index
Showing 1 of 1 datasets
Category: Textual Description

MusicSet

Music AudioTextual Description

MusicSet is built on the MTG‑Jamendo dataset and focuses on music audio with rich textual descriptions. The dataset selects music tracks that have at least five tags, extracts the middle 80 % of each audio file, and splits it into 10‑second clips while removing non‑melodic sections. The clips are saved as individual WAV files and their descriptive information is stored in JSON files. Textual descriptions are generated via the DeepSeek API, which was trained on the MusicCaps description style and consolidates multiple tags into full sentences. MusicSet ultimately contains about 150,000 10‑second music‑text pairs, integrating elements from MusicBench and MusicCaps.

Source huggingfaceUpdated Nov 5, 2024141 viewsLinked
Inspect dataset
JUHE API Marketplace

Accelerate development and ship production-grade integrations with APIs, MCP services, and AI-first infrastructure workflows.

For Developers

ConsoleDocumentation

Product

Browse APIsTemp Mail APIGlobal SMS

Company

What's NewContact SupportTerms Of ServicePrivacy Policy
Copyright © 2026 JUHEDATA HK LIMITED - All rights reserved