JUHE API Marketplace
DATASET
Open Source Community

encoded_LA_2021

This dataset is the original ASVspoof 2021 LA subset, containing two main features: 'label' and 'input_values'. The 'label' feature is a categorical label with two classes: 'fake' and 'real'. The 'input_values' feature is a sequence of floating‑point numbers. The dataset is split into training, validation, and test sets, each with specified sample counts and file sizes. The configuration name is 'default', and data files are assigned per split. The license is 'odc‑by', and the dataset name is 'ASVspoof 2021 LA'.

Updated 8/20/2024
huggingface

Description

Dataset Overview

Dataset Information

  • Feature Information:

    • label: categorical label with two classes:
      • 0: fake
      • 1: real
    • input_values: sequence of float32 values
  • Data Splits:

    • train: 16,464 samples, 2,461,510,888 bytes
    • validation: 16,926 samples, 1,849,172,416 bytes
    • test: 148,176 samples, 22,112,409,484 bytes
  • Dataset Size:

    • Download size: 23,303,764,736 bytes
    • Total size: 26,423,092,788 bytes
  • Configuration:

    • config_name: default
    • data_files:
      • train: path data/train-*
      • validation: path data/validation-*
      • test: path data/test-*
  • License: odc‑by (Open Data Commons Attribution License)

  • Dataset Name: ASVspoof 2021 LA

Source

  • Source: ASVspoof 2021 LA subset
  • Copyright: Derived from the ASVspoof 2021 challenge; licensed under ODC‑BY and available via Zenodo.

AI studio

Generate PPTs instantly with Nano Banana Pro.

Generate PPT Now

Access Dataset

Login to Access

Please login to view download links and access full dataset details.

Topics

Speech Recognition
Anti‑spoofing

Source

Organization: huggingface

Created: 8/20/2024

Power Your Data Analysis with Premium AI Models

Supporting GPT-5, Claude-4, DeepSeek v3, Gemini and more.

Enjoy a free trial and save 20%+ compared to official pricing.