JUHE API Marketplace
DATASET
Open Source Community

LibriSeVoc

We provide the LibriSeVoc dataset, which contains self‑vocoded samples generated by six state‑of‑the‑art neural vocoders. The goal is to highlight and exploit vocoder‑induced artifacts. The underlying real data are sourced from LibriTTS, following its naming convention.

Updated 5/23/2024
github

Description

Dataset Overview

Dataset Name

  • LibriSeVoc Dataset

Description

  • Designed to identify and exploit artifacts introduced by neural vocoders in synthetic speech.
  • Contains self‑vocoded samples generated by six cutting‑edge vocoders, emphasizing vocoder‑produced signal artifacts.

Composition

  • Detailed composition is shown in the accompanying table image.

Source Data

  • Real data originates from LibriTTS and follows its naming logic.

Usage

  • Intended for detecting synthetic human speech by revealing neural‑vocoder artifacts, improving the RawNet2 baseline and lowering error rates.

Access

Related Paper

AI studio

Generate PPTs instantly with Nano Banana Pro.

Generate PPT Now

Access Dataset

Login to Access

Please login to view download links and access full dataset details.

Topics

Speech Synthesis
Dataset

Source

Organization: github

Created: 4/4/2023

Power Your Data Analysis with Premium AI Models

Supporting GPT-5, Claude-4, DeepSeek v3, Gemini and more.

Enjoy a free trial and save 20%+ compared to official pricing.