- Predict or test model @ https://forestclassclassification.streamlit.app/
- Understanding of features with Definitions.
- Creating Pickle files for One hot Encoded and Label Encoded Features.
- Preparing and Exporting the Dataset for Exploratory Data Analysis.
- Understanding the data.
- Finding out if the data is imbalanced.
- Outliers detection.
- Skewness and Kurtosis Detection.
- Univariate and Bivariate Analysis
- Making the data features normally distributed.
- Training With different Classification Model
- Logistic Regression
- KNN Classifier
- Decision Tree
- Random Forest Classifier
- Balanced Random Forest Classifier
- Xtreme Gradient Boost Classifier
- Hypertuning Each algo to get the best fit.
- Saving the best model into a pickle file and using for future predictions
- This process is done because github file size restriction is 25MB.
- But the Model here was more than 25MB.
- Reading Pickle Files from AWS S3
- The credentials will not be initialized in streamlit.py file but in streamlit environment for data security.
- Creating Manual Input and also Slider drag drop input for entering feature values
- Inputing data and getting the predictions in the application.