renumics/esc50
ESC‑50 is an environmental sound classification dataset comprising 50 distinct sound categories such as animal noises (dog, cat, chicken), natural sounds (rain, sea waves, wind), human sounds (laughter, cough, footstep), and mechanical sounds (helicopter, chainsaw, siren). Features include audio files, labels, and fold information. The training set contains 2,000 samples (≈882 MB). The dataset is released under a Creative Commons Attribution‑NonCommercial license.
Description
Dataset Overview
Dataset Information
Features
- src_file: string
- fold: int64
- label: categorical with 50 classes (e.g., 0: dog, 1: rooster, …, 49: hand_saw)
- esc10: boolean
- take: string
- audio: audio data
Data Split
- train: 2,000 samples, 882,179,256 bytes
Size
- Download size: 773,038,488 bytes
- Dataset size: 882,179,256 bytes
Configuration
- default: data files located at
data/train-*
License
- Creative Commons Attribution‑NonCommercial (cc‑by‑nc‑2.0)
Task Type
- Audio Classification
Scale
- 1K < N < 10K
AI studio
Generate PPTs instantly with Nano Banana Pro.
Generate PPT NowAccess Dataset
Please login to view download links and access full dataset details.
Topics
Source
Organization: hugging_face
Created: Unknown
Power Your Data Analysis with Premium AI Models
Supporting GPT-5, Claude-4, DeepSeek v3, Gemini and more.
Enjoy a free trial and save 20%+ compared to official pricing.