[0.9.1] [BUGFIX] support mtp in disaggregated-prefill scenario #2444

JC-ut0 · 2025-08-19T14:37:29Z

What this PR does / why we need it?

support mtp in disaggregated-prefill scenario

Does this PR introduce any user-facing change?

No

How was this patch tested?

v0.9.1-dev
A3 [TP16] [DP4 TP4]
A3 4P1D

gemini-code-assist

Code Review

This pull request introduces two bug fixes to support MTP speculative decoding in a disaggregated prefill scenario. In vllm_ascend/attention/mla_v1.py, the calculation of actual_seq_lengths_q is corrected for torchair graph mode. In vllm_ascend/worker/model_runner_v1.py, the attention state is correctly set to SpecDecoding for deepseek_mtp in cases where it would have been misidentified as DecodeOnly. The changes are correct and well-targeted. I have no further suggestions.

wangxiyuan · 2025-08-20T08:12:26Z

please update the commit message

Signed-off-by: xuyexiong <[email protected]>

…m-project#1921)" This reverts commit 1cb7b10. Signed-off-by: xuyexiong <[email protected]>

gemini-code-assist bot reviewed Aug 19, 2025

View reviewed changes

JC-ut0 force-pushed the v0.9.1-dev branch 3 times, most recently from d02aff1 to ebe1b95 Compare August 20, 2025 07:52

github-actions bot added the module:tests label Aug 20, 2025

wangxiyuan approved these changes Aug 20, 2025

View reviewed changes

JC-ut0 added 2 commits August 20, 2025 16:41

[0.9.1] [BUGFIX] support mtp in disaggregated-prefill scenario

f4f60bb

Signed-off-by: xuyexiong <[email protected]>

Revert "[0.9.1][Fix] Removes explicit ATB extension registration (vll…

dc6c2a6

…m-project#1921)" This reverts commit 1cb7b10. Signed-off-by: xuyexiong <[email protected]>

JC-ut0 force-pushed the v0.9.1-dev branch from ebe1b95 to dc6c2a6 Compare August 20, 2025 08:42

github-actions bot removed the module:tests label Aug 20, 2025

wangxiyuan approved these changes Aug 20, 2025

View reviewed changes

wangxiyuan merged commit f64208b into vllm-project:v0.9.1-dev Aug 20, 2025
17 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[0.9.1] [BUGFIX] support mtp in disaggregated-prefill scenario #2444

[0.9.1] [BUGFIX] support mtp in disaggregated-prefill scenario #2444

Uh oh!

JC-ut0 commented Aug 19, 2025 •

edited

Loading

Uh oh!

gemini-code-assist bot left a comment

Uh oh!

wangxiyuan commented Aug 20, 2025

Uh oh!

Uh oh!

Uh oh!

[0.9.1] [BUGFIX] support mtp in disaggregated-prefill scenario #2444

[0.9.1] [BUGFIX] support mtp in disaggregated-prefill scenario #2444

Uh oh!

Conversation

JC-ut0 commented Aug 19, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

What this PR does / why we need it?

Does this PR introduce any user-facing change?

How was this patch tested?

Uh oh!

gemini-code-assist bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

wangxiyuan commented Aug 20, 2025

Uh oh!

Uh oh!

Uh oh!

JC-ut0 commented Aug 19, 2025 •

edited

Loading