Explore high-quality datasets for your AI and machine learning projects.
The dataset comprises 440 audio samples of meowing produced by 21 cats from two breeds (Maine Coon and European Shorthair) under three different contexts: brushing, isolation in an unfamiliar environment, and waiting for food. Each audio file follows a naming convention that encodes cat ID, breed, sex, owner ID, recording session, and vocalization count. An additional zip file contains excluded recordings (non‑meow sounds) and unedited continuous vocalization sequences.
The DCASE 2022 Task 3 dataset comprises the STARSS22 dataset and synthetic SELD mixtures, collected jointly by the University of Tampere and Sony. It includes multichannel recordings and spatio‑temporal annotations for sound event detection and localization tasks. The dataset features real‑world recordings, multiple recording formats (first‑order Ambisonics and tetrahedral microphone arrays), detailed annotation procedures, and specifications. It is suitable for training and evaluating machine‑listening models for sound event detection, source localization, and joint sound event detection‑localization.