Welcome to my personal GitHub Page! I'm a Senior Software Engineer with 4+ years of experience in AI systems, compilers, and edge inference. I specialize in developing high-performance AI inference kernels, model quantization, and hardware-software co-design for next-generation AI accelerators.
- π Graduated from NIT Srinagar in Electronics and Communication Engineering.
- π§ Passionate about AI/ML Systems, TinyML, Compilers, and Computer Architecture.
- π§ͺ Previously worked at Jio, contributed to Gecko engine of Mozilla Firefox, and built Progressive Web Apps for JioOS.
- π‘ Currently working at Kinara AI (acquired by NXP Semiconductors), focusing on 8-bit/16-bit quantization and optimized kernel development for custom AI accelerators (Ara-2).
- C++, Python, JavaScript, C
- PyTorch ,Quantization, Computer Architecture CUDA,NNP-TIE ISA, LLM inference (dynamic quantization, KV caching)
- Git, GDB, Multithread Programming, Operating System , Object-Oriented Programming , Data structures and Algorithms , SQL
- Linux, Bash, Makefiles
π Bengaluru, India | ποΈ Oct 2022 β Present
- Developed high-performance inference kernels for custom AI accelerator Ara-2.
- Designed and implemented 8-bit dynamic quantization for transformer models.
- Worked on performance profiling and SNR accuracy analysis between float and quantized models.
π Navi Mumbai, India | ποΈ Jun 2021 β Oct 2022
- Contributed to Gecko (Firefox) engine for KaiOS β API compatibility and system app updates.
- Built system PWAs like Settings app using React, JavaScript, and KaiOS APIs.
- Improved app UI/UX and integrated system-level features.
- π GitHub
- π§ Email: lokeshyadavmandapalli@gmail.com
"Always learning, always building β at the intersection of AI and Systems."