Airport survey analysis using Pandas and Sklearn machine learning tools: Random Forest, Support Vector Machines, Decision Tree, KNN, Logistic Regression, Linear SVC, Sthocastic Gradient Descent, Gaussian Naive Bayes and Perceptron.
My goal is to find the best machine learning tool that can predict the variable "Airport Rating" based on other variables.
Basic steps I followed:
- Import survey (.csv file)
- Check data (1400 rows, 47 columns/variables):
- Data processed using Pandas
- Check correlation between variables:
- Train sklearn maching learning models
- Test models (= predict airport_rating)
- Compare prediction's accuracy:
Random Forest is the best model predicting airport_rating with 63% accuracy

