A modern data warehouse with SQL server which includes ETL processes, data modelling and analytics
This is a repository for the Data Warehouse and Analytics Project ! This project's objective is to demonstrate data warehousing and analytical solutions; includes building a data warehouse to generating actionable insights. This is a portfolio project for data engineering endeavours and analysis of the data stored in the warehouse.
The requirements for the project are as listed below:
To develop a data warehouse using SQL server to combine sales data, consequently enabling analytical ad-hoc reporting and informed decision making.
- Data Sources : Import the data from the source system that are available as CSV files
- Data Quality : Cleanse and transform the data suitable for business analysis
- Integration : Combine the 2 source files into a single, user-friendly data model designed for analytical queries.
- Scope : Focus on the latest dataset only and not include historisation in the warehouse.
- Documentation : Provide clear documentation of the data model to support business stakeholders and technical teams.
Develop SQL based analytics to create ad-hoc reports for the following:
- Customer behaviour
- Product Performance
- Sales Trends
data-warehouse-project/
├── datasets/ # Raw CRM and ERP datasets
├── docs/ # Project documentation
│ ├── IntegrationModel.svg # Draw.io diagram
│ └── data_catalog.md # Dataset catalog
├── scripts/ # SQL ETL scripts
│ ├── bronze/ # Raw data extraction
│ ├── silver/ # Data cleaning
│ └── gold/ # Analytical models
├── tests/ # Test scripts
├── README.md # Project overview
├── LICENSE # MIT License
└── .gitignore # Git ignore rules
This project is licensed under the MIT License. You are free to use, modify, and share this project with proper attribution.

