-
-
Notifications
You must be signed in to change notification settings - Fork 12k
Pull requests: vllm-project/vllm
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
Add back support of tokenizer_mode == custom
fb-exported
meta-exported
#30812
opened Dec 16, 2025 by
henryoier
Loading…
[ROCm][CI] Reduce Flakiness For test_async_scheduling Using ROCM_ATTN With FP32
rocm
Related to AMD ROCm
v1
#30811
opened Dec 16, 2025 by
micah-wil
Loading…
[compile] Disable aot when eager backend is used.
ready
ONLY add when PR is ready to merge/full CI is needed
#30810
opened Dec 16, 2025 by
zhxchen17
Loading…
5 tasks
[compile] Ignore VLLM_FORCE_AOT_LOAD from cache factors
ready
ONLY add when PR is ready to merge/full CI is needed
#30809
opened Dec 16, 2025 by
zhxchen17
Loading…
5 tasks
[docker] Allow kv_connectors install to fail on arm64
ci/build
#30806
opened Dec 16, 2025 by
amrmahdi
Loading…
5 tasks
RayLLM Bugfix - Preserve obj store URL for multi engine_config creation
#30803
opened Dec 16, 2025 by
omer-dayan
Loading…
[PERF] Add interleaved memory allocation to NUMA module
#30800
opened Dec 16, 2025 by
skaraban3807
Loading…
3 of 5 tasks
bump up compressed tensors version to 0.13.0
ci/build
quantization
ready
ONLY add when PR is ready to merge/full CI is needed
#30799
opened Dec 16, 2025 by
shanjiaz
Loading…
5 tasks
[Bugfix] Fix DeepSeekV32 tool parser incorrect type conversion for array/object parameters
deepseek
Related to DeepSeek models
#30797
opened Dec 16, 2025 by
fangtaosong
Loading…
3 of 5 tasks
[BugFix][Async] clear spec tokens for preempted or resumed reqs in async
v1
#30796
opened Dec 16, 2025 by
izhuhaoran
Loading…
Fix nemotron_nas intermediate_size computation
#30795
opened Dec 16, 2025 by
grzegorz-k-karch
Loading…
5 tasks
[P/D] p2p_nccl: implement async KV loading for decode stage
kv-connector
v1
#30794
opened Dec 16, 2025 by
dongbo910220
Loading…
5 tasks
[refactor] Add prefix support to embed_tokens in DeepSeek MTP
deepseek
Related to DeepSeek models
#30788
opened Dec 16, 2025 by
zzhx1
Loading…
5 tasks
[CI/Build] Fix compatibility between #30244 and #30396
ready
ONLY add when PR is ready to merge/full CI is needed
#30787
opened Dec 16, 2025 by
DarkLight1337
Loading…
5 tasks
[Fix]Load kv-cache dtype from hf_quant_config.json automatically (fix for reverted PR)
ready
ONLY add when PR is ready to merge/full CI is needed
#30785
opened Dec 16, 2025 by
danielafrimi
Loading…
[Improvement] Persist CUDA compat libraries paths to prevent reset on Related to multi-modality (#4194)
nvidia
apt-get
ci/build
multi-modality
#30784
opened Dec 16, 2025 by
emricksini-h
Loading…
AWQ: Evaluate fused vs unfused GEMM on actual shape
#30783
opened Dec 16, 2025 by
mgehre-amd
Loading…
[CI] add polling for precompiled wheel in python_only_compile.sh
#30781
opened Dec 16, 2025 by
Harry-Chen
Loading…
3 of 5 tasks
[Feature]: Implement naive prepare/finalize class to replace naive dispatching in fused_moe/layer.py
#30775
opened Dec 16, 2025 by
teddygood
Loading…
3 of 5 tasks
Previous Next
ProTip!
Follow long discussions with comments:>50.