DATASET
Open Source Community
Diabetes Dataset
The dataset comprises information on 442 diabetes patients and is used for predicting disease progression. It originates from the scikit‑learn library and includes features relevant to disease progression forecasting.
Updated 12/28/2023
github
Description
Dataset Overview
Source
- Derived from the
scikit‑learnlibrary, containing data for 442 diabetes patients.
Content
- Features intended for predicting disease progression.
Methodology
Data Analysis
- Data preparation, visualization, and statistical analysis were performed to understand the dataset.
Model Development
- Implemented Multi‑Layer Perceptron (MLP), Linear Regression, and Deep Neural Network (DNN) models using PyTorch.
Training & Validation
- The dataset was split into training (80 %) and validation (20 %) sets for model evaluation.
Results & Validation
Model Evaluation
- Models were trained with the Adam optimizer and mean‑squared‑error loss. Validation losses were recorded to assess performance.
- Training and validation loss curves were plotted to visualize learning dynamics and detect overfitting.
Best Model
- Based on average validation loss, the Linear Regression model performed best.
Performance Metrics
Loss Metrics
- Detailed loss figures and summaries were provided to understand model behavior. Loss histories show how training and validation loss evolve over time.
Model Selection
- The model with the lowest average validation loss (Linear Regression) was selected as the optimal choice.
Conclusion
- The project successfully implemented and compared various machine‑learning models for predicting diabetes progression. The Linear Regression model achieved the best validation loss, demonstrating the effectiveness of the chosen architecture and hyperparameters.
AI studio
Generate PPTs instantly with Nano Banana Pro.
Generate PPT NowAccess Dataset
Login to Access
Please login to view download links and access full dataset details.
Topics
Diabetes
Disease Prediction
Source
Organization: github
Created: 11/5/2023
Power Your Data Analysis with Premium AI Models
Supporting GPT-5, Claude-4, DeepSeek v3, Gemini and more.
Enjoy a free trial and save 20%+ compared to official pricing.