Back to datasets
Dataset assetOpen Source CommunitySpeech RecognitionMusic Technology
Human Voice Dataset
A collection of human voice recordings featuring various singing styles (pitch, vowels, consonants, etc.). The dataset aims to simplify research on voice‑based music controllers and can be used to benchmark vocal feature detection algorithms (pitch detection, onset detection) as well as serve as training data for machine‑learning models.
Source
github
Created
May 29, 2018
Updated
Jun 19, 2021
Signals
179 views
Availability
Linked source ready
Overview
Dataset description and usage context
Human Voice Dataset
Dataset Overview
- Purpose: This dataset is designed to streamline research on voice‑controlled musical interfaces. It facilitates benchmarking of vocal feature detection algorithms (e.g., pitch detection, onset detection) and provides training material for machine‑learning models.
- Current Version: Recordings from one singer are provided; additional singers will be added in the coming weeks.
Vocal Features
- Notes: Explored in semitone intervals from the lowest to the highest range, e.g.,
c3.wav,c#3.wav,d3.wav, ... - Vowels: Form a limited‑value dimension (a, e, i, ...), e.g.,
_-a-[note].wav,_-e-[note].wav, ... - Consonants: Must be combined with a vowel to be intelligible, e.g.,
t-a-[note].wav,t-u-[note].wav, ... - Dynamics: Volume changes, pitch bends, vibrato (currently unavailable)
Dataset Structure
- File Naming Pattern:
[consonant]-[vowel]-[note]-[dynamic].wav - Directory Layout:
data/voices/martin/notes/sources/exports/
vowels/sources/exports/
consonants/sources/exports/
Singer Information
- Property Files:
singer.properties: Contains age, gender, nationality, etc.recorder.properties: Contains recording equipment, recording conditions, etc.
Dataset Expansion
- Adding Samples:
- Clone the repository:
git clone https://github.com/vocobox/human-voice-dataset.git - Copy the singer folder and commit changes:
git add .,git commit -m "[new singer] barbara",git push origin master
- Clone the repository:
Other Useful Sound Datasets
- Piano Note Dataset: MAPS Database
- Singing Voice Dataset: Singing Voice Dataset
- Speech Corpora: CMU Speech, VoxForge, TED‑LIUM Corpus
- Sound and Instruments: IRMAS Dataset
Need downstream help?
Pair the dataset with AI analysis and content workflows.
Once the source passes your review, move straight into summarization, transformation, report drafting, or presentation generation with the JuheAI toolchain.