Z Lab
Efficient AI. PI: Zhijian Liu
Popular repositories Loading
-
sparselora
sparselora Public[ICML 2025] SparseLoRA: Accelerating LLM Fine-Tuning with Contextual Sparsity
-
flash-colreduce
flash-colreduce PublicFast, memory-efficient attention column reduction (e.g., sum, mean, max)
Python 37
Repositories
Showing 4 of 4 repositories
- paroquant Public
[ICLR 2026] ParoQuant: Pairwise Rotation Quantization for Efficient Reasoning LLM Inference
z-lab/paroquant’s past year of commit activity
Top languages
Loading…
Most used topics
Loading…