Skip to content
View crocodile27's full-sized avatar
🏠
Working from home
🏠
Working from home

Highlights

  • Pro

Block or report crocodile27

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
crocodile27/README.md

Hi, I’m Anthea 👋

Software Engineer passionate about building scalable systems — from backend data infrastructure to interactive web applications.

I enjoy working across the stack:

  • Designing high-throughput data pipelines
  • Building resilient backend services
  • Deploying cloud-native applications
  • Creating intuitive, user-focused frontend experiences

🚀 What I Work On

🔧 Backend & Data Engineering

  • Config-driven ETL pipelines integrating 10+ heterogeneous data sources
  • REST API ingestion with retry logic & caching layers
  • Knowledge graph construction (Neo4j, KGX)
  • Concurrency & performance optimization (40× speedups, large-scale batch scheduling)
  • CI/CD with pytest, tox, Jenkins
  • Dockerized scheduled ingestion workflows
  • AWS deployments (EC2, Lambda)

🌐 Full-Stack & Frontend

  • React-based applications with dynamic UI and state management
  • Interactive tools deployed to AWS
  • Browser extensions and productivity tools
  • API integration and backend communication layers

⚙️ Systems & Performance

  • Parallel computing (x86, C)
  • GPU workload optimization
  • Memory and runtime profiling
  • Rate-limit testing & concurrency tuning

📌 Featured Projects

🧠 Knowledge Graph Microbe

Config-driven ETL system for large-scale graph construction.

  • Reduced full dataset processing from ~2 years to ~30 hours
  • Reduced API calls by ~33% via HTTP caching
  • Integrated CI/CD + Dockerized execution
  • Neo4j + SPARQL workflows

Repo Link Here


☁️ AWS Course Pathway Platform

Interactive curriculum discovery tool deployed on AWS (EC2 + Lambda).

  • Interactive applications with dynamic state management
  • UX-focused tools combining frontend + backend logic

Repo Link Here


❤️ Assistify

A multimodal visual-audio companion that empowers seniors to navigate smartphones safely, confidently, and independently.

  • Retrieval Augmented Generation database for personalized assistance
  • Speech-to-text text-to-speech hand free guidance
  • iOS and Android parallel development in React using Flutter

Repo Link Here


🛠 Tech Stack

Languages:
Java · Python · C++ · C · JavaScript · Bash

Frontend:
React · HTML · CSS

Backend & Data:
REST APIs · ETL · Neo4j · MongoDB · SQL · SPARQL

Cloud & DevOps:
AWS (EC2, Lambda) · Docker · Jenkins · CI/CD · Git


🎯 What Drives Me

I’m motivated by building systems that are:

  • Scalable
  • Reliable
  • Thoughtfully engineered
  • User-centered

Whether that’s optimizing a data pipeline or designing a clean frontend experience, I enjoy turning complex problems into robust, maintainable software.


📫 Feel free to explore my pinned repositories below!

Pinned Loading

  1. cellColor cellColor Public

    Coloring cells by gene to assist human annotation for segmentation data

    Python 1

  2. bioehs bioehs Public

    Bioengineering Honor Society Website

    TypeScript 1

  3. Knowledge-Graph-Hub/kg-microbe Knowledge-Graph-Hub/kg-microbe Public

    Jupyter Notebook 21 4

  4. dylancc5/assistify dylancc5/assistify Public

    ernie hackathon

    Dart 1