JUHE API Marketplace
DATASET
Open Source Community

Syoy/birdclef_2023_train

The dataset birdclef_2023_train primarily contains bird audio data and associated label information. Its features include audio files, primary labels, secondary labels, type, latitude, longitude, scientific name, common name, author, license, rating, URL, and embedding vectors. The dataset is divided into a training set, which includes 16,941 samples, with a total size of 5,388,534,029.882 bytes and a download size of 5,367,714,895 bytes.

Updated 3/21/2023
hugging_face

Description

Dataset Overview

Dataset Name

  • Name: birdclef_2023_train

Dataset Features

  • audio: Audio data
  • primary_label: Primary labels, encompassing 202 different category names
  • secondary_labels: Secondary labels, string type
  • type: String type
  • latitude: Latitude, float type
  • longitude: Longitude, float type
  • scientific_name: Scientific name, string type
  • common_name: Common name, string type
  • author: Author, string type
  • license: License, string type
  • rating: Rating, float type
  • url: URL link, string type
  • embeddings: Embedding vectors, sequence of floats

Dataset Split

  • train: Training set
    • num_bytes: 5388534029.882 bytes
    • num_examples: 16941 samples
    • download_size: 5367714895 bytes
    • dataset_size: 5388534029.882 bytes

AI studio

Generate PPTs instantly with Nano Banana Pro.

Generate PPT Now

Access Dataset

Login to Access

Please login to view download links and access full dataset details.

Topics

Bird Species Recognition
Audio Classification

Source

Organization: hugging_face

Created: Unknown

Power Your Data Analysis with Premium AI Models

Supporting GPT-5, Claude-4, DeepSeek v3, Gemini and more.

Enjoy a free trial and save 20%+ compared to official pricing.