This project focuses on analyzing hotel booking data, with a special interest in rejected/canceled bookings across different cities, market segments, and time periods
- Data Cleaning β preparing and standardizing the raw dataset.
- Exploratory Data Analysis (EDA) β identifying booking rejection patterns.
- Data Modeling (Star Schema) β building a Fact Table and Dimension Tables to enable deeper business insights.
Data Cleaning
- Imported raw dataset.
- Handled missing values.
- Standardized column formats (dates, cities, markets).
- Removed duplicates.
- Ensured data quality and readiness for analysis.
Data Modeling (Star Schema)
- To structure the data for analysis, I designed a Star Schema with one Fact Table and several Dimension tables
Data Analysis
- Calculated the percentage of rejected bookings relative to total bookings.
- Compared rejection rates across cities.
- Analyzed rejection rates by market segment.
- Tracked seasonal trends in rejections across months.
- Python (Pandas)
- Jupyter Notebook
- Power BI
- Canva (for infographics & visual design)
- Git & GitHub (for documentation and version control)