Skip to content

Commit bd4e31c

Browse files
QiJunelancelly
authored andcommitted
doc: update known issues (NVIDIA#6247)
Signed-off-by: junq <[email protected]> Signed-off-by: Lanyu Liao <[email protected]>
1 parent 033f62d commit bd4e31c

File tree

1 file changed

+1
-0
lines changed

1 file changed

+1
-0
lines changed

docs/source/release-notes.md

Lines changed: 1 addition & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -73,6 +73,7 @@ All published functionality in the Release Notes has been fully tested and verif
7373
### Known Issues
7474
- accuracy/test_cli_flow::TestGpt2::test_beam_search_large is broken.
7575
- Enabling disaggregated serving, MTP, and the overlap scheduler at the same time can lead to accuracy problems.
76+
- Full chunked attention support has been added for LLaMA4 to handle >8K sequences, with a known performance regression. The root cause is identified and will be fixed in a future release.
7677

7778
## TensorRT-LLM Release 0.20.0
7879

0 commit comments

Comments
 (0)