DATASET
Open Source Community
LibriSeVoc
We provide the LibriSeVoc dataset, which contains self‑vocoded samples generated by six state‑of‑the‑art neural vocoders. The goal is to highlight and exploit vocoder‑induced artifacts. The underlying real data are sourced from LibriTTS, following its naming convention.
Updated 5/23/2024
github
Description
Dataset Overview
Dataset Name
- LibriSeVoc Dataset
Description
- Designed to identify and exploit artifacts introduced by neural vocoders in synthetic speech.
- Contains self‑vocoded samples generated by six cutting‑edge vocoders, emphasizing vocoder‑produced signal artifacts.
Composition
- Detailed composition is shown in the accompanying table image.
Source Data
- Real data originates from LibriTTS and follows its naming logic.
Usage
- Intended for detecting synthetic human speech by revealing neural‑vocoder artifacts, improving the RawNet2 baseline and lowering error rates.
Access
- More information and download link: Dataset Download
Related Paper
AI studio
Generate PPTs instantly with Nano Banana Pro.
Generate PPT NowAccess Dataset
Login to Access
Please login to view download links and access full dataset details.
Topics
Speech Synthesis
Dataset
Source
Organization: github
Created: 4/4/2023
Power Your Data Analysis with Premium AI Models
Supporting GPT-5, Claude-4, DeepSeek v3, Gemini and more.
Enjoy a free trial and save 20%+ compared to official pricing.