Skip to content
View saadkhalmadani's full-sized avatar

Highlights

  • Pro

Organizations

@dataops-g1p2

Block or report saadkhalmadani

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
saadkhalmadani/README.md

Hi there πŸ‘‹

c633c20ede82f0e0ced7d570dbe3a1f3

Data Engineer Typing Animation

Profile Views GitHub followers

My name is Saad!

DevOps Engineer & Cloud Enthusiast (Azure) | Data Engineer (Python, SQL, Spark, Airflow)

Based in Morocco πŸ‡²πŸ‡¦, I'm passionate about transforming raw data into actionable insights through automated, scalable, and reliable data infrastructure.

I architect end-to-end data solutions that bridge the gap between data engineering and DevOps, ensuring data flows seamlessly from source to insight with enterprise-grade reliability.

Core Philosophy:
  "Data without automation is just expensive storage.
   Automation without monitoring is just expensive chaos."
   
Current Mission:
  Building next-generation data platforms that scale effortlessly
  and deliver real-time insights that drive business decisions.

πŸ› οΈ Tech Stack

Programming & Scripting

Python Bash

Data Processing & Orchestration

Spark Kafka NiFi Kylin Pandas Polars PySpark NumPy Airflow Databricks

Databases & Storage

PostgreSQL MySQL MongoDB Redis Hive

Modeling & Architecture

Snowflake Schema Dimensional Modeling ETL/ELT

Cloud & Big Data
Cloud Providers

GCP Azure

Azure Data Services

Azure Data Lake Azure SQL Database Azure Cosmos DB Azure Databricks Azure Synapse Analytics Azure Data Factory Azure Blob Storage Azure Functions Azure Key Vault Azure Event Hubs Azure Logic Apps

Big Data

Hadoop

Visualization & Analytics

Power BI Superset Matplotlib Plotly Streamlit

DevOps & Infrastructure

Docker Kubernetes Git GitHub Actions GitLab CI/CD Terraform


πŸš€ Featured Projects

Enterprise-Grade Data Flow Automation Platform

GitHub

What it does:

  • Automates Apache NiFi data flow deployments across multiple environments
  • Implements GitOps workflow with branch-based promotion strategy
  • Integrates version control with NiFi Registry for complete audit trails
  • Eliminates manual deployment errors through intelligent automation

Business Impact:

Deployment Time: 2 hours β†’ 15 minutes (87.5% reduction)
Manual Errors: 15% failure rate β†’ 0% (100% elimination)
Developer Onboarding: 2 days β†’ 4 hours (75% reduction)

Technical Architecture:

Infrastructure: Terraform β†’ Azure (Dev/Staging/Prod environments)
CI/CD Pipeline: GitHub Actions β†’ Change Detection β†’ Automated Deployment
Version Control: NiFi Registry ↔ Git Hooks β†’ Automatic Synchronization
GitOps Flow: develop β†’ staging β†’ main (PR-based promotion)

Intelligent Data Scraping & Visualization Platform

Live Demo GitHub

Demo Credentials: bob / bobpass

What it does:

  • Extracts and structures complex gaming data from web sources
  • Implements robust data validation and quality checks
  • Provides interactive analytics through modern dashboards
  • Supports multiple export formats with optimized performance

Technical Highlights:

Architecture: Web Scraping β†’ Data Processing β†’ Storage β†’ Visualization
Pipeline: Selenium + BeautifulSoup β†’ Pandas β†’ PostgreSQL β†’ Streamlit
Performance: Real-time filtering β€’ Advanced search β€’ Export optimization

Enterprise IoT Data Streaming Platform

GitHub

What it does:

  • Ingests high-volume IoT sensor data in real-time
  • Implements Change Data Capture for database synchronization
  • Processes streaming data with fault-tolerant architecture
  • Delivers actionable insights through interactive dashboards

Technical Architecture:

Data Flow: IoT Sensors β†’ Kafka β†’ Spark β†’ PostgreSQL β†’ Superset
CDC Pipeline: Database Changes β†’ Debezium β†’ Kafka β†’ Stream Processing
Monitoring: Real-time metrics β€’ Data quality validation β€’ Alert systems

πŸ“Š GitHub Analytics

πŸ“ˆ Contribution Overview

Profile Summary

πŸ”₯ Streak & Activity

GitHub Streak

πŸ“Š Detailed Stats

Stats Productive Time

πŸ’» Language Distribution

Top Languages by Repo Top Languages by Commit


πŸ’¬ Let's Connect & Build Something Amazing!

LinkedIn GitHub

Pinned Loading

  1. DofusDataForge-project DofusDataForge-project Public

    πŸ‰ Data scraping + visualization project for Dofus Touch monsters | πŸ“Š Streamlit dashboards | πŸ’Ύ PostgreSQL storage

    Python 1

  2. realtime-cdc-streaming-project realtime-cdc-streaming-project Public

    ⚑ IoT Data Streaming | πŸ”„ Real-time ingestion with Kafka | πŸ”§ CDC with Debezium | πŸ“Š Analytics with Superset

    Python