yash194

Yash Aggarwal

Building multi-agent systems, RL swarms, and practical AI products.

🔥 What I’m focused on

Multi-Agent RL + Swarm Intelligence (coverage, search, coordination, energy-aware policies)
Agentic systems (debate + memory + distillation → “reasoning that improves over time”)
Applied AI products (automation, NLP classification, developer tooling)

🚀 Flagship Projects (Proof > Claims)

🛰️ GASAMC — UAV Swarm RL Framework

Graph-attention + Soft Actor-Critic + multi-critic training for cooperative drone missions

Coverage/search phases, custom metrics (coverage, redundancy, energy-per-area, efficiency)
Designed for research-grade ablations + paper-ready plots

🤖 Automation Products (n8n + Next.js + Supabase)

Classroom/assignment automation pipelines
YouTube script generator (web + scraping + structured outputs)

💬 Risk Signal Extractor (NLP)

Classifies support messages into categories (fraud/chargeback/legal threat/refund…) using classical ML / lightweight DL (no transformers)

🛠️ Tech Stack (what I actually use)

Core: Python • Pytorch • TensorFlow • NumPy • Pandas
ML/RL: Optax • Gym/Gymnasium • RL tooling
Backend: FastAPI • Node.js • REST APIs
Frontend: Next.js • React • Tailwind
Automation: n8n • Webhooks • Supabase
Dev: Git • Docker • Linux • CI (GitHub Actions)

📌 What I value (how I work)

Research-grade clarity: clean experiments, ablations, reproducibility
Engineering discipline: tests, readable diffs, minimal magic
Systems thinking: metrics, failure modes, memory/latency constraints
Ship mindset: demos that real users can try

📊 GitHub Stats

🤝 Let’s collaborate

If you’re working on multi-agent RL, agent memory/distillation, JAX tooling, or AI automation products — I’m open to collaborations.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly