We have 3 Notebooks that are being submitted. Please run them in Google Colaboratory in the following order:
- Wrangling_And_EDA.ipynb
- In this notebook, the main dataset csv is being downloaded from the public link of Murder Accountability Project | CSV File link
- For GeoMapping, we require the USA map shape file which we are storing in this Public Drive link accessible only via SEAS emails. You will be required to Mount your GDrive and add a shortcut of this folder (namely
CIS545_Final_Project_Files) folder into yourMy Drivelocation. All further data will be stored and collected from here. - The above drive link also stores the
income_state.csvfile which is downladed from the US Census website - Finally, the EDA data is stored in your local drive folder copy with the name
decoded_data_with_avg.csv- This file is essential for the 2nd part
- Modeling_And_Bias.ipynb
- Again, the your GDrive has to be mounted and this notebook expects the file
decoded_data_with_avg.csvto be present. This will be created after successful execution of the above notebook. - 2 pickle files namely -
all_encoded_data_offsex.pickleandfeature_data_offsex.picklewill be stored in your local drive folder copy.
- Again, the your GDrive has to be mounted and this notebook expects the file
- Interpretability.ipynb
- GDrive is again mounted and both the files
all_encoded_data_offsex.pickleandfeature_data_offsex.pickleare expected to be present. Both these files will be created after successful execution of the above notebook.
- GDrive is again mounted and both the files