I am a Ph.D. student in TANKLAB at Tianjin University, advised by Wenyu Qu and Yitao Hu. My research interests include Machine Learning Systems, LLM inference serving, and Distributed Systems. I received my B.S. degree in computer science from Northwest A&F University.
GitHub: zhixin612
Email: zhao612@tju.edu.cn
-
PAT: Accelerating LLM Decoding via Prefix-Aware Attention with Resource Efficient Multi-Tile Kernel
Jinjun Yi†, Zhixin Zhao†, Yitao Hu*, Ke Yan, Weiwei Sun, Hao Wang, Laiping Zhao, Yuhao Zhang, Wenxin Li, Keqiu Li.
ACM International Conference on Architectural Support for Programming Languages and Operating Systems (ASPLOS), 2026. -
PARD: Enhancing Goodput for Inference Pipeline via Proactive Request Dropping
Zhixin Zhao, Yitao Hu, Simin Chen, Mingfang Ji, Wei Yang, Yuhao Zhang, Laiping Zhao, Wenxin Li, Xiulong Liu, Wenyu Qu, Hao Wang.
The 21st ACM European Conference on Computer Systems (EuroSys), 2026. -
SLOpt: Serving Real-Time Inference Pipeline with Strict Latency Constraint
Zhixin Zhao, Yitao Hu*, Guotao Yang, Ziqi Gong, Chen Shen, Laiping Zhao, Wenxin Li, Xiulong Liu, and Wenyu Qu.
IEEE Transactions on Computers (TC), 2025. -
Harpagon: Minimizing DNN Serving Cost via Efficient Dispatching, Scheduling and Splitting
Zhixin Zhao, Yitao Hu*, Ziqi Gong, Guotao Yang, Wenxin Li, Xiulong Liu, Keqiu Li, and Hao Wang.
IEEE International Conference on Computer Communications (INFOCOM), 2025. -
TightLLM: Maximizing Throughput for LLM Inference via Adaptive Offloading Policy
Yitao Hu, Xiulong Liu*, Guotao Yang, Linxuan Li, Kai Zeng, Zhixin Zhao, Sheng Chen, Laiping Zhao, Wenxin Li, and Keqiu Li.
IEEE Transactions on Computers (TC), 2025. -
SuperSpec: Enhanced Verification and Sampling for End-to-End LLM Speculative Decoding
Chen Shen, Rui Guo, Yang Cheng, Yang Lin, Zhixin Zhao, Yitao Hu*, Sheng Chen, Xiulong Liu, and Keqiu Li.
IEEE International Conference on High Performance Computing and Communications (HPCC), 2025. -
SmartCache: Two-Dimensional KV-Cache Similarity for Efficient Long-Context LLM Decoding
Chen Shen, Hao Chen, Kaining Hui, Zhixin Zhao, Yang Cheng, Yitao Hu*, Sheng Chen, Xiulong Liu, and Keqiu Li.
IEEE International Conference on High Performance Computing and Communications (HPCC), 2025. -
High-throughput Sampling, Communicating and Training for Reinforcement Learning Systems
Laiping Zhao, Xinan Dai, Zhixin Zhao, Yusong Xin, Yitao Hu*, Jun Qian, Jun Yao, and Keqiu Li.
IEEE/ACM International Symposium on Quality of Service (IWQoS), 2023.
- 2025: ASPLOS - Artifact Evaluation Reviewer
- 2023: ICA3PP - Reviewer
- 2021: The 2021 ICPC Shaanxi National Invitational: Silver Medal
- 2020: The 2020 ICPC Asia-East Continent Final: Bronze Medal
- 2020: The 45th ICPC Asia Regional Contest Shanghai Site: Silver Medal
- 2024: Academic Scholarships, Tianjin University
- 2023: Distinguished Academic Scholarship, Tianjin University
- 2022: Outstanding Graduate, Northwest A&F University
- 2021: Presidential Scholarship, Northwest A&F University
- 2020: National Encouragement Scholarship, Northwest A&F University
- 2019: National Encouragement Scholarship, Northwest A&F University
photography📸 ping-pong🏓 badminton🏸 ...

