π‘ Data Scientist with experience across Sports Analytics, Biomedical AI, and Autonomous Vehicle domain
π Patent Holder β Patented algorithm in Data Science (based on my Masterβs thesis)
π M.Sc. Informatics (Machine Learning & Analytics) β Technical University of Munich
βοΈ Passionate about building production-grade ML systems that are scalable, modular, and impactful
I build scalable Python applications, real-time data pipelines, and end-to-end projects in machine learning, data science, and Generative AI. Every solution follows industry-standard best practices, combining technical depth with real-world impact to deliver results that are both robust and commercially viable.
1οΈβ£ β½ Football Stream Processor
Tech Stack: Python, Poetry, Streamlit, MLflow, Optuna, Docker, CI/CD
- Real-time match analytics and player pass network visualizations
- End-to-end ML pipeline for pass outcome prediction, trained using match event data
- Integrated with MLflow for model tracking & reproducibility, and Optuna for hyperparameter optimization
- Modular architecture: separate data ingestion, processing, modeling, and visualization layers
- Built with production-readiness in mind β includes structured logging, environment isolation, and CI workflows
2οΈβ£ ResearchAI (Work in Progress)
Tech Stack: FastAPI, Apache Airflow, OpenSearch (BM25 + vectors), PostgreSQL, MinIO, Gradio, Docker Compose, Poetry (Ollama planned)
- Production-grade RAG with hybrid retrieval (BM25 + embeddings); infra-first Docker setup with service health checks
- Automated ingestion (arXiv β MinIO/Postgres) via Airflow; modular OOP design for swappable chunkers, embedders, and vector stores
- API + UI: FastAPI
/askwith cited answers (WIP), Gradio frontend, CI scaffolding; evals via Recall@k/MRR; VPS-ready deployment
π Read the blog
π» Access the webapp
3οΈβ£ ClimaCast β ETL Weather Pipeline (Work in Progress)
Tech Stack: Apache Airflow, AWS S3, FastAPI, Pandas, Requests
- Production-grade ETL for weather data
- Orchestrated with Airflow DAGs, stored in S3 for scalability
- Designed to showcase data engineering best practices
π‘ This profile is a growing showcase of my work in ML, Data Engineering, and GenAI. Each project follows industry-standard engineering practices, with a focus on scalability, modularity, and clarity.



