OpenBitSys
Popular repositories Loading
-
BitDistiller
BitDistiller Public[ACL 2024] A novel QAT with Self-Distillation framework to enhance ultra low-bit LLMs.
-
BitDecoding
BitDecoding Public[HPCA 2026] A GPU-optimized system for efficient long-context LLMs decoding with low-bit KV cache.
Repositories
Showing 3 of 3 repositories
- vlut.cpp Public
[MobiSys 2026] On-device parallel ultra-low-bit (ternary) LLM inference with LUT-based mpGeMM kernel.
OpenBitSys/vlut.cpp’s past year of commit activity - BitDecoding Public
[HPCA 2026] A GPU-optimized system for efficient long-context LLMs decoding with low-bit KV cache.
OpenBitSys/BitDecoding’s past year of commit activity - BitDistiller Public
[ACL 2024] A novel QAT with Self-Distillation framework to enhance ultra low-bit LLMs.
OpenBitSys/BitDistiller’s past year of commit activity
Top languages
Loading…
Most used topics
Loading…