Explore high-quality datasets for your AI and machine learning projects.
The nfcorpus dataset is a medical information retrieval collection comprising 5,371 documents, each with a document ID, URL, title, and abstract. It was introduced by Vera Boteva et al. at ECIR 2016 and serves as the basis for several related splits (e.g., `nfcorpus_dev`, `nfcorpus_test`).