Skip to content

Pull requests: vllm-project/vllm

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

[compile] Disable aot when eager backend is used. ready ONLY add when PR is ready to merge/full CI is needed
#30810 opened Dec 16, 2025 by zhxchen17 Loading…
5 tasks
[compile] Ignore VLLM_FORCE_AOT_LOAD from cache factors ready ONLY add when PR is ready to merge/full CI is needed
#30809 opened Dec 16, 2025 by zhxchen17 Loading…
5 tasks
[docker] Allow kv_connectors install to fail on arm64 ci/build
#30806 opened Dec 16, 2025 by amrmahdi Loading…
5 tasks
[CI] Skip ci failure test ready ONLY add when PR is ready to merge/full CI is needed
#30804 opened Dec 16, 2025 by yewentao256 Loading… v0.13.0
[PERF] Add interleaved memory allocation to NUMA module
#30800 opened Dec 16, 2025 by skaraban3807 Loading…
3 of 5 tasks
bump up compressed tensors version to 0.13.0 ci/build quantization ready ONLY add when PR is ready to merge/full CI is needed
#30799 opened Dec 16, 2025 by shanjiaz Loading…
5 tasks
[Bugfix] Fix DeepSeekV32 tool parser incorrect type conversion for array/object parameters deepseek Related to DeepSeek models
#30797 opened Dec 16, 2025 by fangtaosong Loading…
3 of 5 tasks
Fix nemotron_nas intermediate_size computation
#30795 opened Dec 16, 2025 by grzegorz-k-karch Loading…
5 tasks
[ROCm] [Bugfix] Fix torch sdpa hallucination ready ONLY add when PR is ready to merge/full CI is needed rocm Related to AMD ROCm
#30789 opened Dec 16, 2025 by tjtanaa Loading…
5 tasks
v0.13.0
[refactor] Add prefix support to embed_tokens in DeepSeek MTP deepseek Related to DeepSeek models
#30788 opened Dec 16, 2025 by zzhx1 Loading…
5 tasks
[CI/Build] Fix compatibility between #30244 and #30396 ready ONLY add when PR is ready to merge/full CI is needed
#30787 opened Dec 16, 2025 by DarkLight1337 Loading…
5 tasks
[Fix]Load kv-cache dtype from hf_quant_config.json automatically (fix for reverted PR) ready ONLY add when PR is ready to merge/full CI is needed
#30785 opened Dec 16, 2025 by danielafrimi Loading…
AWQ: Evaluate fused vs unfused GEMM on actual shape
#30783 opened Dec 16, 2025 by mgehre-amd Loading…
[CI] add polling for precompiled wheel in python_only_compile.sh
#30781 opened Dec 16, 2025 by Harry-Chen Loading…
3 of 5 tasks
Optimize workspace memory in DeepGEMM.
#30780 opened Dec 16, 2025 by halyavin Loading…
Algo
#30767 opened Dec 16, 2025 by Mercykid-bash Draft
5 tasks
ProTip! Follow long discussions with comments:>50.