GitHub - siddhantj12/Beedie-Hackathon: BC Hydro Data analytics code

Project Overview

This repository contains all code, data, and documentation for Beedie Analytics Hackathon 2025—an interdisciplinary analytics challenge hosted by SFU’s Beedie School of Business. Our objective was to develop a data-driven solution addressing the business problem defined by the organizers.

Background & Motivation

Problem Statement
Describe the core challenge posed during this hackathon (e.g., “Optimize retail inventory allocation using predictive analytics”).
Why It Matters
Explain why solving this problem has tangible benefits—reducing stockouts, improving customer satisfaction, or minimizing holding costs.
Approach Summary
Provide a high-level overview of your solution—whether it’s a machine learning model, interactive dashboard, or optimization algorithm.

Features

Data Ingestion & Cleaning
Scripts to load raw data, handle missing values, and normalize features for downstream modeling.
Exploratory Data Analysis (EDA)
Jupyter notebooks showcasing key visualizations, statistical summaries, and correlation analyses.
Model Training & Evaluation
Implementation of one or more predictive models (e.g., Random Forest, XGBoost) with hyperparameter tuning and performance metrics.
Interactive Dashboard (Optional)
A web-based dashboard (built with Streamlit or Dash) allowing users to explore results and scenario-test parameters.
Deployment Scripts
Instructions/API endpoints to deploy the final model as a RESTful service or containerized application.

Installation

These steps assume you have Python 3.7+ installed.

Clone the repository

git clone https://github.com/siddhantj12/Beedie-Hackathon.git
cd Beedie-Hackathon


2. **Create a virtual environment**

   ```bash
   python3 -m venv venv
   source venv/bin/activate
   ```

3. **Install dependencies**

   ```bash
   pip install --upgrade pip
   pip install -r requirements.txt
   ```

4. **(Optional) Docker Setup**

   ```bash
   docker build -t beedie-hackathon .
   docker run -p 8501:8501 beedie-hackathon
   ```

---

## Usage

1. **Data Preprocessing**

   ```bash
   python scripts/preprocess_data.py \
       --input data/raw/your_data.csv \
       --output data/processed/clean_data.csv
   ```
2. **Run Exploratory Data Analysis**

   ```bash
   jupyter notebook notebooks/EDA.ipynb
   ```
3. **Train Model**

   ```bash
   python scripts/train_model.py \
       --train_data data/processed/clean_data.csv \
       --model_output models/final_model.pkl
   ```
4. **Evaluate Model**

   ```bash
   python scripts/evaluate_model.py \
       --model models/final_model.pkl \
       --test_data data/processed/test.csv
   ```
5. **Launch Dashboard (if available)**

   ```bash
   streamlit run app/dashboard.py
   ```

---

## Data

* **Raw Data Source**

  * *Filename:* `data/raw/your_data.csv`
  * *Description:* Briefly summarize the dataset (e.g., “Sales transactions for a major retailer from Jan 2020 to Dec 2021”).
* **Processed Data**

  * All intermediate features, cleaned tables, and train/test splits are stored under `data/processed/`.
* **Data Dictionary**

  * File: `data/data_dictionary.md`
  * Explains each column, data type, and possible values.

---

## Modeling & Implementation

1. **Algorithms Explored**

   * *Random Forest:* Baseline tree-based model for feature importance and interpretability.
   * *XGBoost:* Gradient boosting for improved predictive performance.
   * *Linear Models:* Lasso/Ridge for baseline regression/classification.
2. **Evaluation Metrics**

   * *Regression:* RMSE, MAE, $R^2$.
   * *Classification:* Accuracy, Precision, Recall, AUC-ROC.
3. **Hyperparameter Tuning**

   * Used scikit-learn’s `GridSearchCV` with 5-fold cross-validation.
4. **Final Model Performance**

   * Summarize best-in-class results (e.g., “Our XGBoost model achieved an RMSE of 12.5 on the holdout set”).

---

## Project Structure

```
Beedie-Hackathon/
├── data/
│   ├── raw/
│   │   └── your_data.csv
│   ├── processed/
│   │   ├── clean_data.csv
│   │   └── test.csv
│   └── data_dictionary.md
├── models/
│   └── final_model.pkl
├── notebooks/
│   └── EDA.ipynb
├── scripts/
│   ├── preprocess_data.py
│   ├── train_model.py
│   └── evaluate_model.py
├── app/
│   └── dashboard.py
├── requirements.txt
├── Dockerfile
└── README.md
```

---

## Team Members

* **Siddhant Jain**
* **Enya Zeng**
* **Ryan Lee**

---

## How to Contribute

1. **Fork the Repository** and create a feature branch:

   ```bash
   git checkout -b feature/YourFeatureName
   ```
2. **Make Your Changes** (code, notebooks, or documentation).
3. **Ensure Tests Pass** (if any tests exist).
4. **Submit a Pull Request** with a clear description of your contribution.
5. **Maintain Code Style:** Follow PEP 8 for Python, add docstrings, and use consistent notebook formatting.

---

## Acknowledgments

* **SFU Beedie School of Business** for hosting the hackathon.
* **Open-Source Libraries:** scikit-learn, pandas, NumPy, Matplotlib, Seaborn, Streamlit.
* **README Inspiration:** Derived structure from common hackathon README best practices.


<p align="center">“Made with ♥ at SFU Beedie Hackathon 2025”</p>

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
.DS_Store		.DS_Store
1.png		1.png
2.png		2.png
3.png		3.png
Aux_Elec_Equip_AvgBufferTime.png		Aux_Elec_Equip_AvgBufferTime.png
Aux_Elec_Equip_AvgLeadTime.png		Aux_Elec_Equip_AvgLeadTime.png
Aux_Elec_Equip_AvgVendorScore.png		Aux_Elec_Equip_AvgVendorScore.png
Aux_Elec_Equip_BufferForecast.png		Aux_Elec_Equip_BufferForecast.png
Aux_Elec_Equip_Canada_BufferForecast.png		Aux_Elec_Equip_Canada_BufferForecast.png
Aux_Elec_Equip_Canada_TCO.png		Aux_Elec_Equip_Canada_TCO.png
Aux_Elec_Equip_Canada_TRI.png		Aux_Elec_Equip_Canada_TRI.png
Aux_Elec_Equip_CurrentTariff.png		Aux_Elec_Equip_CurrentTariff.png
Aux_Elec_Equip_EffBufferIndex.png		Aux_Elec_Equip_EffBufferIndex.png
Aux_Elec_Equip_EffCostIndex.png		Aux_Elec_Equip_EffCostIndex.png
Aux_Elec_Equip_USA_BufferForecast.png		Aux_Elec_Equip_USA_BufferForecast.png
Aux_Elec_Equip_USA_TCO.png		Aux_Elec_Equip_USA_TCO.png
Aux_Elec_Equip_USA_TRI.png		Aux_Elec_Equip_USA_TRI.png
BC Hydro - All Vendor Data.xlsx		BC Hydro - All Vendor Data.xlsx
BC Hydro - Category Description.xlsx		BC Hydro - Category Description.xlsx
BC Hydro - SC risk tolerance by category.xlsx		BC Hydro - SC risk tolerance by category.xlsx
BC Hydro - Vendor and Inventory Data.xlsx		BC Hydro - Vendor and Inventory Data.xlsx
Beedie Hackathon - Detailed Case Study - BC Hydro.pdf		Beedie Hackathon - Detailed Case Study - BC Hydro.pdf
Distribution_Transf_AvgBufferTime.png		Distribution_Transf_AvgBufferTime.png
Distribution_Transf_AvgLeadTime.png		Distribution_Transf_AvgLeadTime.png
Distribution_Transf_AvgVendorScore.png		Distribution_Transf_AvgVendorScore.png
Distribution_Transf_BufferForecast.png		Distribution_Transf_BufferForecast.png
Distribution_Transf_Canada_BufferForecast.png		Distribution_Transf_Canada_BufferForecast.png
Distribution_Transf_Canada_TCO.png		Distribution_Transf_Canada_TCO.png
Distribution_Transf_Canada_TRI.png		Distribution_Transf_Canada_TRI.png
Distribution_Transf_CurrentTariff.png		Distribution_Transf_CurrentTariff.png
Distribution_Transf_EffBufferIndex.png		Distribution_Transf_EffBufferIndex.png
Distribution_Transf_EffCostIndex.png		Distribution_Transf_EffCostIndex.png
Doing Business - World Bank - Export Import Data.xlsx		Doing Business - World Bank - Export Import Data.xlsx
EVERYTHING.csv		EVERYTHING.csv
EVERYTHING.xlsx		EVERYTHING.xlsx
Figure_1.png		Figure_1.png
Figure_2.png		Figure_2.png
Imports from trading partners - HS Code specific.xlsx		Imports from trading partners - HS Code specific.xlsx
India_lowrisk_metrics.csv		India_lowrisk_metrics.csv
Japan_lowrisk_metrics.csv		Japan_lowrisk_metrics.csv
Logistics Performance Index (LPI) - 2023.xlsx		Logistics Performance Index (LPI) - 2023.xlsx
Low_risk.py		Low_risk.py
README.md		README.md
Switchgear_AvgBufferTime.png		Switchgear_AvgBufferTime.png
Switchgear_AvgLeadTime.png		Switchgear_AvgLeadTime.png
Switchgear_AvgVendorScore.png		Switchgear_AvgVendorScore.png
Switchgear_BufferForecast.png		Switchgear_BufferForecast.png
Switchgear_Canada_BufferForecast.png		Switchgear_Canada_BufferForecast.png
Switchgear_Canada_BufferTime_Forecast.png		Switchgear_Canada_BufferTime_Forecast.png
Switchgear_Canada_TCO.png		Switchgear_Canada_TCO.png
Switchgear_Canada_TRI.png		Switchgear_Canada_TRI.png
Switchgear_CurrentTariff.png		Switchgear_CurrentTariff.png
Switchgear_EffBufferIndex.png		Switchgear_EffBufferIndex.png
Switchgear_EffCostIndex.png		Switchgear_EffCostIndex.png
WTO tariff rate data - HS Code specific.xlsx		WTO tariff rate data - HS Code specific.xlsx
Wire_And_Cable_AvgBufferTime.png		Wire_And_Cable_AvgBufferTime.png
Wire_And_Cable_AvgLeadTime.png		Wire_And_Cable_AvgLeadTime.png
Wire_And_Cable_AvgVendorScore.png		Wire_And_Cable_AvgVendorScore.png
Wire_And_Cable_BufferForecast.png		Wire_And_Cable_BufferForecast.png
Wire_And_Cable_CurrentTariff.png		Wire_And_Cable_CurrentTariff.png
Wire_And_Cable_EffBufferIndex.png		Wire_And_Cable_EffBufferIndex.png
Wire_And_Cable_EffCostIndex.png		Wire_And_Cable_EffCostIndex.png
analyze_supply_chain.py		analyze_supply_chain.py
category_country_tco.csv		category_country_tco.csv
category_country_tri.csv		category_country_tri.csv
category_tariff_spend_impact.csv		category_tariff_spend_impact.csv
country_vendor_performance_summary.csv		country_vendor_performance_summary.csv
forecast_5y_lead_time.csv		forecast_5y_lead_time.csv
forecast_5y_with_tariff.csv		forecast_5y_with_tariff.csv
forecast_6m_lead_time.csv		forecast_6m_lead_time.csv
forecast_6m_with_tariff.csv		forecast_6m_with_tariff.csv
lead_time_components_6m.png		lead_time_components_6m.png
lead_time_components_6m_with_tariff.png		lead_time_components_6m_with_tariff.png
lead_time_forecast_5y.png		lead_time_forecast_5y.png
lead_time_forecast_5y_with_tariff.png		lead_time_forecast_5y_with_tariff.png
lead_time_forecast_6m.png		lead_time_forecast_6m.png
lead_time_forecast_6m_with_tariff.png		lead_time_forecast_6m_with_tariff.png
leadtime_forecast_5years.csv		leadtime_forecast_5years.csv
leadtime_forecast_5years.png		leadtime_forecast_5years.png
leadtime_forecast_6months.csv		leadtime_forecast_6months.csv
leadtime_forecast_6months.png		leadtime_forecast_6months.png
leadtime_forecast_components.png		leadtime_forecast_components.png
lo.py		lo.py
lol.py		lol.py
long-term.py		long-term.py
mvp_6m_holt-winters.png		mvp_6m_holt-winters.png
mvp_monthly_lead_time.png		mvp_monthly_lead_time.png
neiw.py		neiw.py
prophetpred.py		prophetpred.py
rec.py		rec.py
requirements.txt		requirements.txt
time-series.py		time-series.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Project Overview

Background & Motivation

Features

Installation

About

Uh oh!

Releases

Packages

Languages

siddhantj12/Beedie-Hackathon

Folders and files

Latest commit

History

Repository files navigation

Project Overview

Background & Motivation

Features

Installation

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages