Hi! This is my streamlit app that is used to do Exploratory Data Analysis!
What are inside :
- Overview of the data frame
- Statistics (Number of Rows, Number of Columns, Number of Categorical Variables, Number of Numerical Variables, Number of Boolean Variables)
- For Numerical Variables :
- 5 Number Summary (Max, Min, Q1, Median, Q3)
- Histogram (showing frequency of any numerical variable)
- Scatterplot (of any pair of numerical variables)
- For Categorical Variables :
- Table of Category Proportions
- Barplot (showing proportion of any categorical variable or categorical variable against numerical variable)
Instructions :
- Choose a file from your computer that you want to do analysis
- If you want to have an overview of the data you uploaded, click "Show Data Frame"
- Select Data Type from left side bar
- For Numerical Variables :
- Choose bin color
- Select Plot Type
- If choose Histogram
- Customize number of bins by dragging the bar
- Choose X-axis variable by selecting at the left side bar
- If choose Scatterplot
- Choose X-axis abd Y-axis variables at the left side bar
- If choose Histogram
- For Categorical Variables, customize your barplot by :
- Choose bin color
- Set Title
- Choose X-axis variable by either editing in the input widget or selecting at the left side bar
- Choose Y-axis variable by editing in the input widget
Note :
- NAs are dropped in order to calculate 5 number summaries for Numerical variables