-
Notifications
You must be signed in to change notification settings - Fork 405
[0.9.1][Doc] Add release note for v0.9.1rc3
#2431
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Conversation
Signed-off-by: shen-shanshan <[email protected]>
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Code Review
This pull request adds the release notes for v0.9.1rc3
and updates version numbers across various documentation and configuration files. The changes are consistent and correctly reflect the new release candidate. The modifications in README.md
, README.zh.md
, docs/source/community/versioning_policy.md
, and docs/source/conf.py
all align with the goal of documenting and configuring the v0.9.1rc3
release. The new release notes in docs/source/user_guide/release_notes.md
are comprehensive. The pull request appears to be in good shape for merging.
- Fix header include issue in rope [#2398](https://github.com/vllm-project/vllm-ascend/pull/2398) | ||
- Fix mtp config bug [#2412](https://github.com/vllm-project/vllm-ascend/pull/2412) | ||
- Fix error info and adapt `attn_metedata` refactor [#2402](https://github.com/vllm-project/vllm-ascend/pull/2402) | ||
- Removes explicit ATB extension registration [#1921](https://github.com/vllm-project/vllm-ascend/pull/1921) |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
remove this
- Fix bugs and refactor cached mask generation logic [#2326](https://github.com/vllm-project/vllm-ascend/pull/2326) | ||
- Fix configuration check logic about ascend scheduler [#2327](https://github.com/vllm-project/vllm-ascend/pull/2327) | ||
- Cancel the verification between deepseek-mtp and non-ascend scheduler in disaggregated-prefill deployment [#2368](https://github.com/vllm-project/vllm-ascend/pull/2368) | ||
- Fix protobuf version in Dockerfile [#2306](https://github.com/vllm-project/vllm-ascend/pull/2306) |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
- Fix protobuf version in Dockerfile [#2306](https://github.com/vllm-project/vllm-ascend/pull/2306) | |
- Fix issue that failed with ray distributed backend [#2306](https://github.com/vllm-project/vllm-ascend/pull/2306) |
Signed-off-by: shen-shanshan <[email protected]>
Signed-off-by: shen-shanshan <[email protected]>
Signed-off-by: shen-shanshan <[email protected]>
Signed-off-by: shen-shanshan <[email protected]>
Signed-off-by: shen-shanshan <[email protected]>
What this PR does / why we need it?
Add release note for
v0.9.1rc3
, find more details at #2396.Does this PR introduce any user-facing change?
How was this patch tested?