Back to datasets
Dataset assetOpen Source CommunityDiabetesDisease Prediction

Diabetes Dataset

The dataset comprises information on 442 diabetes patients and is used for predicting disease progression. It originates from the scikit‑learn library and includes features relevant to disease progression forecasting.

Source
github
Created
Nov 5, 2023
Updated
Dec 28, 2023
Signals
503 views
Availability
Linked source ready
Overview

Dataset description and usage context

Dataset Overview

Source

  • Derived from the scikit‑learn library, containing data for 442 diabetes patients.

Content

  • Features intended for predicting disease progression.

Methodology

Data Analysis

  • Data preparation, visualization, and statistical analysis were performed to understand the dataset.

Model Development

  • Implemented Multi‑Layer Perceptron (MLP), Linear Regression, and Deep Neural Network (DNN) models using PyTorch.

Training & Validation

  • The dataset was split into training (80 %) and validation (20 %) sets for model evaluation.

Results & Validation

Model Evaluation

  • Models were trained with the Adam optimizer and mean‑squared‑error loss. Validation losses were recorded to assess performance.
  • Training and validation loss curves were plotted to visualize learning dynamics and detect overfitting.

Best Model

  • Based on average validation loss, the Linear Regression model performed best.

Performance Metrics

Loss Metrics

  • Detailed loss figures and summaries were provided to understand model behavior. Loss histories show how training and validation loss evolve over time.

Model Selection

  • The model with the lowest average validation loss (Linear Regression) was selected as the optimal choice.

Conclusion

  • The project successfully implemented and compared various machine‑learning models for predicting diabetes progression. The Linear Regression model achieved the best validation loss, demonstrating the effectiveness of the chosen architecture and hyperparameters.
Need downstream help?

Pair the dataset with AI analysis and content workflows.

Once the source passes your review, move straight into summarization, transformation, report drafting, or presentation generation with the JuheAI toolchain.

Explore AI studio