Explore high-quality datasets for your AI and machine learning projects.
This dataset is a Yahoo Answers topic‑classification dataset for text‑classification tasks. It contains 1.4 million training examples and 60 000 test examples. Each example includes a question title, question content, the best answer, and the corresponding topic label. The topic labels cover ten categories such as Society & Culture, Science & Mathematics, Health, etc. The dataset language is English and it is monolingual.