Data Engineer — Spark, Delta Lake, Python, SQL
Streaming & batch Lakehouse pipelines. Distributed systems background.
Open to contract roles.
Pinned Loading
-
pinterest_databricks_pipeline_simulation
pinterest_databricks_pipeline_simulation PublicEnd-to-end streaming Lakehouse pipeline in Databricks using Auto Loader, Delta Lake, and medallion architecture.
Jupyter Notebook
-
bls-census-housing-analysis
bls-census-housing-analysis PublicEnd-to-end data engineering pipeline ingesting U.S. public data (BLS QCEW + Census permits), normalizing raw files, building analytical views, and producing reproducible metrics and visualizations.…
Python
Something went wrong, please refresh the page to try again.
If the problem persists, check the GitHub status page or contact support.
If the problem persists, check the GitHub status page or contact support.