Explore high-quality datasets for your AI and machine learning projects.
The dataset contains a feature named `signal` of type float32 and a feature named `label_id` of type int32. It is split into training, validation, and test sets with 9,900, 539, and 1,100 samples respectively. Total download size is 35,524,532 bytes; dataset size is 12,000,560 bytes.
Clotho is an audio‑captioning dataset used as input/output for audio captioning methods. The dataset was accepted and published at the IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).
The dataset was generated using the open‑source TensorFlow‑based library Sionna for ray‑tracing simulations, covering reflection and scattering counts across different urban areas in China. The simulated area is a 620‑meter square, with both transmitters and receivers positioned at a height of 1.5 m. The dataset includes raw simulation output files, RSSI grayscale images, transmitter location images, and scene images.