Back to datasets
Dataset assetOpen Source CommunityMachine LearningDataset Classification
ML-learning-datasets
A collection of machine‑learning training datasets provided by Data Science Dojo. Currently includes 43 datasets, categorized into classification‑clustering and regression tasks, with difficulty levels easy, medium, and hard. Each dataset folder contains a README.md that details basic information, feature description, data source, etc.
Source
github
Created
Nov 7, 2024
Updated
Nov 7, 2024
Signals
421 views
Availability
Linked source ready
Overview
Dataset description and usage context
ML‑learning‑datasets
Overview
- Number of Datasets: 43
- Source: Data Science Dojo
- Classification:
- By Task Type: classification‑clustering, regression
- By Difficulty: easy, medium, hard
- Some datasets may belong to both regression and classification‑clustering categories.
Structure
- Each dataset folder includes a
README.mdproviding basic information, feature description, data source, and more.
Contribution
- New datasets are welcome.
Need downstream help?
Pair the dataset with AI analysis and content workflows.
Once the source passes your review, move straight into summarization, transformation, report drafting, or presentation generation with the JuheAI toolchain.