HabanaAI / vllm-fork Public

forked from vllm-project/vllm

Notifications You must be signed in to change notification settings
Fork 135
Star 85

Code
Issues 7
Pull requests 63
Discussions
Actions
Projects
Security
Insights

Additional navigation options

Code
Issues
Pull requests
Discussions
Actions
Projects
Security
Insights

Pull requests: HabanaAI/vllm-fork

Labels 19 Milestones 0

New pull request New

63 Open 2,045 Closed

Author

Filter by author

Uh oh!

There was an error while loading. Please reload this page.

Label

Filter by label

Uh oh!

There was an error while loading. Please reload this page.

Use alt + click/return to exclude labels

or ⇧ + click/return for logical OR

Projects

Filter by project

Uh oh!

There was an error while loading. Please reload this page.

Milestones

Filter by milestone

Uh oh!

There was an error while loading. Please reload this page.

Reviews

Filter by reviews

No reviews Review required Approved review Changes requested

Assignee

Filter by who’s assigned

Assigned to nobody

Uh oh!

There was an error while loading. Please reload this page.

Sort

Sort by

Newest Oldest Most commented Least commented Recently updated Least recently updated Best match

Most reactions

Pull requests list

Port hunyuan ocr model to Gaudi

#2213 opened Jan 8, 2026 by HeJunyan

Loading…

PD scripts update to upgrade mooncake and enable fp8_inc by default

#2212 opened Jan 7, 2026 by Yanli2190

Loading…

3 tasks

Enable fp32 softmax for qwen 7b models

#2210 opened Jan 6, 2026 by yangulei

Loading…

Fix the error of using the wrong model name in the document.

#2208 opened Dec 29, 2025 by wenbinc-Bin

Loading…

Fix AttributeError in fixed_sub_image_list for embedding models

#2206 opened Dec 24, 2025 by majunpo

Loading…

Multi-modal disaggregation for gemma, POC

#2205 opened Dec 23, 2025 by splotnikv • Draft

3 tasks

Add FA3 opt in INC and upgrade mooncake to 0.3.7

#2199 opened Dec 18, 2025 by Yanli2190

Loading…

3 tasks

Configure default 32K context for two-node setup

#2196 opened Dec 18, 2025 by huiqiwa

Loading…

Update cli_args.py

#2189 opened Dec 17, 2025 by michalkuligowski

Loading…

Enabled DeepSeek-Eagle on VLLM V0 for Gaudi

#2184 opened Dec 15, 2025 by gyou2021

Loading…

Delay prefix cache calculation to find longest common prefix

#2170 opened Dec 8, 2025 by ikurtchen

Loading…

3 tasks done

Bump actions/stale from 9.1.0 to 10.1.1 dependencies

Pull requests that update a dependency file

github_actions

Pull requests that update GitHub Actions code

#2169 opened Dec 8, 2025 by dependabot bot

Loading…

Enable delayed sampling for warmup also to remove graph compilation i…

#2164 opened Dec 4, 2025 by yeonsily

Loading…

3 tasks

Libint/add topk sampling scalar padding

#2160 opened Dec 1, 2025 by libinta

Loading…

3 tasks

fix bs>1 crash issue for ovis

#2158 opened Dec 1, 2025 by libinta

Loading…

3 tasks

add VLLM_ENGINE_PROFILER_SKIP_STEPS to the engine profiler

#2143 opened Nov 19, 2025 by yangulei

Loading…

[DeepSeek R1] chunked prefill warmup with chunk size

#2135 opened Nov 14, 2025 by jerrychenhf

Loading…

PD scripts update for 1.23 + fp8_inc

#2131 opened Nov 11, 2025 by Yanli2190

Loading…

Use default INC version in docker

#2120 opened Nov 5, 2025 by Yanli2190

Loading…

add mineru doc

#2112 opened Nov 3, 2025 by yingjie-han

Loading…

Workaround for Assertion error when embedding with bge-m3 in lazy mode

#2093 opened Oct 28, 2025 by slokesha

Loading…

fix wrong section for Qwen series doc

#2074 opened Oct 23, 2025 by heyuanliu-intel

Loading…

3 tasks

refactor(hpu_model_runner): restructure multimodal-related code

#2066 opened Oct 22, 2025 by Jing1Ling • Draft

3 tasks

Slokesha port ovis

#2063 opened Oct 21, 2025 by slokesha • Draft

3 tasks

[CS-1549] Eanble function call DeepSeek-V3.1

#2047 opened Oct 19, 2025 by JianyuLi01

Loading…

Previous 1 2 3 Next

Previous Next

ProTip! What’s not been updated in a month: updated:<2025-12-09.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

Uh oh!