Explore high-quality datasets for your AI and machine learning projects.
MedOdyssey is a medical‑domain long‑context evaluation benchmark co‑created by East China University of Science and Technology and Shanghai Artificial Intelligence Laboratory, comprising 10 complex datasets covering medical corpora such as books, guidelines, case reports, and knowledge graphs. The datasets are built from open‑source and royalty‑free medical data to assess large language models’ performance on long‑context tasks, particularly in medical applications like electronic health‑record analysis and biomedical terminology standardisation.
The ptb-sss dataset contains electrocardiogram (ECG) data with features such as patient ID, age, sex, ECG array, and index. It consists of a training split with 10 examples, totaling 2,600,290 bytes.