This project focuses on performing Exploratory Data Analysis (EDA) on telecom customer data to understand customer churn behavior. The dataset is sourced from Kaggle and contains customer demographics, account information, and service usage details.
🔗 Dataset Source: Kaggle - Telco Customer Churn
- Rows: 7,043
- Columns: 21
- Target Variable:
Churn(Yes/No) - Contains both categorical and numerical features.
- Loaded dataset using
pandas.
- Dropped irrelevant
customerIDcolumn. - Converted
TotalChargesto numeric type. - Handled missing values by dropping rows with nulls.
- Mapped
Churnvalues:Yes→ChurnedNo→Retained
- Checked data types and basic statistics.
- Created separate DataFrames for
ChurnedandRetainedcustomers. - Analyzed churn distribution, tenure, and billing patterns.
- Customers with shorter tenure are more likely to churn.
- Month-to-month contracts show a higher churn rate.
- Higher monthly charges may be correlated with churn.
- Long-term contract types (1- or 2-year) have lower churn rates.
- Python
- Pandas
- Jupyter Notebook