encoded_LA_2021

This dataset is the original ASVspoof 2021 LA subset, containing two main features: 'label' and 'input_values'. The 'label' feature is a categorical label with two classes: 'fake' and 'real'. The 'input_values' feature is a sequence of floating‑point numbers. The dataset is split into training, validation, and test sets, each with specified sample counts and file sizes. The configuration name is 'default', and data files are assigned per split. The license is 'odc‑by', and the dataset name is 'ASVspoof 2021 LA'.

Updated 8/20/2024

huggingface

Description

Dataset Overview

Dataset Information

Feature Information:
- label: categorical label with two classes:
  - 0: fake
  - 1: real
- input_values: sequence of float32 values
Data Splits:
- train: 16,464 samples, 2,461,510,888 bytes
- validation: 16,926 samples, 1,849,172,416 bytes
- test: 148,176 samples, 22,112,409,484 bytes
Dataset Size:
- Download size: 23,303,764,736 bytes
- Total size: 26,423,092,788 bytes
Configuration:
- config_name: default
- data_files:
  - train: path data/train-*
  - validation: path data/validation-*
  - test: path data/test-*
License: odc‑by (Open Data Commons Attribution License)
Dataset Name: ASVspoof 2021 LA

Source

Source: ASVspoof 2021 LA subset
Copyright: Derived from the ASVspoof 2021 challenge; licensed under ODC‑BY and available via Zenodo.

AI studio

Generate PPTs instantly with Nano Banana Pro.

Generate PPT Now

Access Dataset

Please login to view download links and access full dataset details.

Topics

Speech Recognition

Anti‑spoofing

Source

Organization: huggingface

Created: 8/20/2024

Power Your Data Analysis with Premium AI Models

Supporting GPT-5, Claude-4, DeepSeek v3, Gemini and more.

Enjoy a free trial and save 20%+ compared to official pricing.

Check Prices →