Skip to content

kaoutarmi/Diabetes-Prediction-System

Folders and files

NameName
Last commit message
Last commit date

Latest commit

Β 

History

4 Commits
Β 
Β 
Β 
Β 
Β 
Β 

Repository files navigation

🩺 Diabetes Prediction Using Machine Learning

Welcome to the Diabetes Prediction project! πŸŽ‰ The goal of this project is to predict whether a person has diabetes using various machine learning algorithms. We focus on applying data cleaning, visualization, and modeling techniques to build accurate prediction models.

🎯 Objective

  • 🧹 Clean the data to ensure high-quality inputs.
  • πŸ“Š Visualize the data to better understand patterns and correlations.
  • πŸ€– Train machine learning models to predict diabetes.
  • πŸ’‘ Evaluate model performance using several evaluation metrics.

πŸ› οΈ Techniques Used

  1. Data Cleaning 🧼: Removing missing values, handling outliers, and preparing the data for modeling.
  2. Data Visualization πŸ“Š: Analyzing and visualizing the data to understand patterns and trends.
  3. Machine Learning Modeling πŸ€–: Training multiple machine learning models to predict diabetes.

🧠 Algorithms Used

  1. Logistic Regression πŸ§‘β€πŸ’Ό
  2. Support Vector Machine (SVM) πŸ”²
  3. K-Nearest Neighbors (KNN) πŸ”
  4. Random Forest Classifier 🌳
  5. Naive Bayes πŸ§‘β€πŸ”¬
  6. Gradient Boosting πŸ”₯

πŸ“ˆ Model Evaluation Methods Used

  1. Accuracy Score βœ…: Measures how often the model makes correct predictions.
  2. ROC AUC Curve πŸ“‰: Evaluates the trade-off between true positive rate and false positive rate.
  3. Cross-Validation πŸ”„: Splitting the data into different subsets to ensure the model performs well on unseen data.
  4. Confusion Matrix πŸ“Š: Provides a breakdown of prediction errors, including false positives, false negatives, true positives, and true negatives.

πŸ“¦ Dependencies

To run this project, you will need the following libraries:

  • pandas πŸ“‘
  • numpy πŸ”’
  • matplotlib πŸ“Š
  • seaborn 🎨
  • scikit-learn πŸ§‘β€πŸ’»

You can install the dependencies by running:

pip install pandas numpy matplotlib seaborn scikit-learn

Releases

No releases published

Packages

 
 
 

Contributors