huckiyang/DiPCo
Speech Signal ProcessingMulti‑Speaker Separation
The DipCo dataset, publicly released by Amazon, aims to help speech scientists separate multiple speakers' signals in reverberant rooms. The dataset was created by simulating dinner‑party scenarios with volunteers in a lab; each session involves four participants. It includes near‑field and far‑field recordings together with detailed transcriptions for development and evaluation. The dataset is released under the CDLA‑Permissive‑1.0 license.
Source hugging_faceUpdated Feb 6, 2024167 viewsLinked
Inspect dataset