Skip to content
View zarguirayen's full-sized avatar
๐ŸŽฏ
Focusing
๐ŸŽฏ
Focusing

Highlights

  • Pro

Block or report zarguirayen

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this userโ€™s behavior. Learn more about reporting abuse.

Report abuse
zarguirayen/README.md

๐Ÿ‘‹ Hi, I'm Rayen Zargui โ€” Data Science Student

๐ŸŽ“ About Me

I am a third-year engineering student at ENSTA Paris and a Masterโ€™s student in Data Science at ร‰cole Polytechnique (IP Paris), pursuing a dual degree through the academic partnership with:

  • ENIT โ€“ National Engineering School of Tunis
  • ENSTA Paris โ€“ ร‰cole Nationale Supรฉrieure des Techniques Avancรฉes

My background combines applied mathematics, probability, statistics, and optimization with advanced data science and machine learning, with a strong focus on real-world, high-impact applications.


๐Ÿ”ฌ Academic & Technical Interests

  • ๐Ÿ“Š Statistical Learning & Data Science
  • ๐Ÿค– Machine Learning & Deep Learning (MLP, CNN, RNN, Transformers)
  • ๐Ÿง  NLP & Large Language Models (RAG, fine-tuning, prompt engineering)
  • ๐Ÿ“ˆ Time Series & Spatio-Temporal Modeling
  • ๐Ÿ”— Extreme Value Theory & Dependence Modeling
    (Copulas, Pickands dependence function, spatial extremes)
  • ๐ŸŒ Climate, Risk & Energy Data

I am particularly interested in projects that connect theoretical modeling with large-scale, real-world data, emphasizing interpretability, robustness, and high-quality visualizations.


๐Ÿ› ๏ธ Tools & Technologies

  • Languages: Python, R, SQL, C/C++, Bash
  • ML / DL: Scikit-learn, PyTorch, TensorFlow, Optuna
  • NLP & LLMs: Transformers, LangChain, RAG, LoRA, Quantization, SFT / DPO
  • Data & Visualization: Pandas, NumPy, Matplotlib, Seaborn, Power BI
  • Scientific Computing: SciPy, Statsmodels, RStudio
  • Tools: Git, GitHub, Docker (basics), Linux, LaTeX

๐Ÿ“Œ Selected Projects

๐Ÿ”— Neural Estimation of Extreme-Value Copulas

Neural estimation of the Pickands dependence function for Gumbel, Galambos, and Tawn copulas, including convex projection, simulation studies, and applications to spatial climate data.

๐Ÿ“ˆ Time Series & Climate Risk Modeling

Spatio-temporal modeling of meteorological and climate data (Mรฉtรฉo-France), focusing on extreme events, dependence structures, and risk measures (VaR, TVaR).

๐Ÿค– RAG Analyst System

Design of a Retrieval-Augmented Generation (RAG) system with dynamic knowledge bases, traceable answers, and interactive dashboards using LangChain, ChromaDB, and Streamlit.

๐Ÿ’ณ Credit Card Fraud Detection

End-to-end ML pipeline handling highly imbalanced data, feature selection, model comparison (XGBoost, Random Forest), and robust evaluation (ROC-AUC, F1-score).

๐Ÿ“Š Statistical Analysis & Modeling โ€” mtcars (R)

Exploratory analysis and predictive modeling of fuel consumption (mpg) using the mtcars dataset: correlation analysis and visualization, multiple linear regression, stepwise AIC variable selection, and PCA + Principal Component Regression (PCR), with model comparison using ANOVA.


๐ŸŽฏ Objectives

  • Develop robust, interpretable, and scalable ML models
  • Work on research-driven or high-impact applied data science projects
  • Contribute to domains such as climate, risk, energy, and AI systems
  • Maintain high standards in code quality, reproducibility, and visualization

๐Ÿ“ซ Contact

Popular repositories Loading

  1. Neural-network-for-extreme-value-copula Neural-network-for-extreme-value-copula Public

    This repository contains the complete Python code and analysis pipeline for the paper. The project introduces and validates a novel semi-parametric methodology for estimating the Pickands dependencโ€ฆ

    Jupyter Notebook

  2. Cedit-Card-Fraud-Detection Cedit-Card-Fraud-Detection Public

    A project on detecting fraudulent credit card transactions using Python, Scikit-learn, and XGBoost.

    Jupyter Notebook

  3. Image-Segmentation Image-Segmentation Public

    This project, developed in C++, sheds light on the strengths and limitations of each algorithm, exploring their suitability for different types of images.

    Jupyter Notebook

  4. Real-Time-RAG-Analyst Real-Time-RAG-Analyst Public

    This project is a Real-Time RAG (Retrieval-Augmented Generation) Analyst Chatbot.

    Python

  5. datacamp-master datacamp-master Public

    Forked from x-datascience-datacamp/datacamp-master

    Datacamp class for master student - 1 week

    Jupyter Notebook

  6. 2025-assignment-numpy 2025-assignment-numpy Public

    Forked from x-datascience-datacamp/2025-assignment-numpy

    Assignment 1 for the DataCamp course X-DataScience Master -- numpy

    Python