-
Notifications
You must be signed in to change notification settings - Fork 41
Pull requests: vllm-project/tpu-inference
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
[Model] Add vision encoder padding and warmup for Qwen2.5 VL model
#1151
opened Nov 21, 2025 by
kwang3939
Loading…
Centralizes environment variable access by routing variables reads through the envs.py module
#1147
opened Nov 21, 2025 by
xingliu14
Loading…
[MISC] Removed problematic local path for CONFTEST_DIR
#1141
opened Nov 20, 2025 by
JiriesKaileh
Loading…
Enable Pipeline Parallelism on Jax models
#1077
opened Nov 12, 2025 by
Chenyaaang
Loading…
1 of 8 tasks
Enable Pipeline Parallelism on Jax runner
#1053
opened Nov 8, 2025 by
Chenyaaang
Loading…
1 of 8 tasks
[Docs] fix dead links in multiple documentation pages
#1027
opened Nov 6, 2025 by
mattheliu
Loading…
3 tasks done
Implement runai model streamer for MODEL_IMPL_TYPE=flax_nnx
#955
opened Oct 27, 2025 by
amacaskill
Loading…
dtype in ModelConfig will be implicitly casted to torch.dtype so in tpu_jax, we need to check for torch dtype as well
#945
opened Oct 27, 2025 by
lc5211
Loading…
Previous Next
ProTip!
What’s not been updated in a month: updated:<2025-10-24.