-
Notifications
You must be signed in to change notification settings - Fork 350
Pull requests: vllm-project/vllm-ascend
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
Fix some ci issue and refactor modelrunner
module:core
module:tests
#2445
opened Aug 19, 2025 by
MengqingCao
Loading…
[0.9.1] [BUGFIX] support mtp in disaggregated-prefill scenario
#2444
opened Aug 19, 2025 by
JC-ut0
Loading…
Adapted the independent TP partitioning of the O matrix in the Qwen3-235B model for pure DP scenarios.
module:core
module:ops
#2443
opened Aug 19, 2025 by
coder-fny
Loading…
[main][bugfix] Fix bugs and refactor cached mask generation logic
#2442
opened Aug 19, 2025 by
rjg-lyh
Loading…
[0.9.1-DEV][BUGFIX] BugFix: Resolve the issue of waiting queue accumulation when requests are canceled.
documentation
Improvements or additions to documentation
#2441
opened Aug 19, 2025 by
wangxiaoteng666
Loading…
[main] Fix AddRMSNormW8A8Quant init bug and optimize the performance of the gemmarmsnorm operator of the gemma3 model on NPU
module:ops
module:tests
#2440
opened Aug 19, 2025 by
socrahow
Loading…
[1/N][Draft][Refactor]torchair pangu_moe modeling refactor
#2437
opened Aug 19, 2025 by
Angazenn
Loading…
Add gpt oss
guide
guide note
merge-conflicts
module:ops
#2436
opened Aug 19, 2025 by
ChenTaoyu-SJTU
Loading…
refact model runner v1
merge-conflicts
module:tests
#2435
opened Aug 19, 2025 by
weiguihua2
Loading…
[Scheduler] validate max_num_batched_tokens and max_model_len in AscendSchedulerConfig
module:tests
#2434
opened Aug 19, 2025 by
linfeng-yuan
Loading…
Add feature branch policy
documentation
Improvements or additions to documentation
#2432
opened Aug 19, 2025 by
Yikun
Loading…
[0.9.1][Doc] Add release note for Improvements or additions to documentation
v0.9.1rc3
documentation
#2431
opened Aug 19, 2025 by
shen-shanshan
Loading…
[DOC] update doc: LoRA with ACLGraph
documentation
Improvements or additions to documentation
module:core
#2430
opened Aug 19, 2025 by
paulyu12
Loading…
[bugfix] ascend schedule encountered an incorrect req block length in…
#2429
opened Aug 19, 2025 by
liziyu179
Loading…
[AclGraph] Adapt aclgraph into new graph dispatcher arch
module:core
module:tests
#2427
opened Aug 18, 2025 by
MengqingCao
Loading…
[MAIN][BUGFIX] BugFix: Resolve the issue of waiting queue accumulation when requests are canceled.
#2426
opened Aug 18, 2025 by
wangxiaoteng666
Loading…
refact model runner v1
merge-conflicts
module:tests
#2417
opened Aug 18, 2025 by
weiguihua2
Loading…
qwen3_moe/qwen25 support torchair graph
module:core
module:ops
module:tests
#2403
opened Aug 17, 2025 by
NicholasTao
Loading…
[main][bugfix] Unify MoE routing init with standard torch_npu operator
module:quantization
#2401
opened Aug 16, 2025 by
SlightwindSec
Loading…
[0.9.1][bugfix] Unify MoE routing init with standard torch_npu operator
documentation
Improvements or additions to documentation
module:quantization
#2400
opened Aug 15, 2025 by
SlightwindSec
Loading…
Previous Next
ProTip!
Adding no:label will show everything without a label.