High Quality Data

Dataset Hub

Explore high-quality datasets for your AI and machine learning projects.

Sort:

Browse by Category

RML2016

The dataset contains a feature named `signal` of type float32 and a feature named `label_id` of type int32. It is split into training, validation, and test sets with 9,900, 539, and 1,100 samples respectively. Total download size is 35,524,532 bytes; dataset size is 12,000,560 bytes.

huggingface

View Details

Clotho

Audio Captioning

Signal Processing

Clotho is an audio‑captioning dataset used as input/output for audio captioning methods. The dataset was accepted and published at the IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).

github

View Details

RadioMap_Dataset_Reflection/Scattering_Counts

Wireless Communications

Signal Processing

The dataset was generated using the open‑source TensorFlow‑based library Sionna for ray‑tracing simulations, covering reflection and scattering counts across different urban areas in China. The simulated area is a 620‑meter square, with both transmitters and receivers positioned at a height of 1.5 m. The dataset includes raw simulation output files, RSSI grayscale images, transmitter location images, and scene images.

github

View Details