Back to datasets
Dataset assetOpen Source CommunityMusic DataData Processing

roszcz/maestro-base-v2

The dataset named maestro‑base‑v2 is intended for music analysis. It includes three main features: `notes`, `control_changes`, and `source`. `notes` contain numeric fields for note end time, pitch, start time, and velocity. `control_changes` contain numeric fields for control number, time, and value. `source` is a string possibly indicating the music source. The dataset is split into validation (137 samples), test (177 samples), and train (962 samples). Total download size is 141,530,448 bytes; total size is 493,963,458 bytes.

Source
hugging_face
Created
Nov 28, 2025
Updated
Nov 10, 2023
Signals
88 views
Availability
Linked source ready
Overview

Dataset description and usage context

Dataset Information

Features

  • notes
    • end: sequence, float64
    • pitch: sequence, int64
    • start: sequence, float64
    • velocity: sequence, int64
  • control_changes
    • number: sequence, int64
    • time: sequence, float64
    • value: sequence, int64
  • source: string

Data Split

  • validation
    • Bytes: 53,035,261.55642633
    • Samples: 137
  • test
    • Bytes: 68,520,009.45611285
    • Samples: 177
  • train
    • Bytes: 372,408,186.9874608
    • Samples: 962

Dataset Size

  • Download size: 141,530,448 bytes
  • Total size: 493,963,458.0 bytes
Need downstream help?

Pair the dataset with AI analysis and content workflows.

Once the source passes your review, move straight into summarization, transformation, report drafting, or presentation generation with the JuheAI toolchain.

Explore AI studio