Commit fd96a08
committed
rebase
Signed-off-by: junq <[email protected]>File tree
282 files changed
+2448
-1373
lines changed- benchmarks/cpp
- cpp
- cmake/modules
- include/tensorrt_llm/deep_gemm
- tensorrt_llm
- batch_manager
- kernels
- cutlass_kernels
- fp4_gemm
- fp8_rowwise_gemm
- fpA_intB_gemm
- launchers
- fused_gated_gemm
- int8_gemm
- low_latency_gemm
- trtllmGenKernels
- batchedGemm/trtllmGen_bmm_export
- gemmGatedAct/trtllmGen_gatedAct_export
- gemm/trtllmGen_gemm_export
- nanobind
- plugins
- bertAttentionPlugin
- cudaStreamPlugin
- eaglePlugin
- fusedLayernormPlugin
- gemmAllReducePlugin
- gemmPlugin
- gptAttentionCommon
- identityPlugin
- layernormQuantizationPlugin
- lookupPlugin
- loraPlugin
- lowLatencyGemmPlugin
- lowLatencyGemmSwigluPlugin
- mixtureOfExperts
- ncclPlugin
- qserveGemmPlugin
- quantizePerTokenPlugin
- quantizeTensorPlugin
- quantizeToFP4Plugin
- rmsnormQuantizationPlugin
- smoothQuantGemmPlugin
- weightOnlyGroupwiseQuantMatmulPlugin
- weightOnlyQuantMatmulPlugin
- pybind
- runtime
- thop
- tests/unit_tests
- executor
- kernels/fused_gated_gemm
- multi_gpu
- docker
- common
- docs/source
- blogs
- tech_blog
- commands/trtllm-serve
- developer-guide
- features
- auto_deploy/advanced
- legacy
- dev-on-cloud
- performance
- reference
- models
- torch
- examples
- apps
- auto_deploy
- bindings/executor
- cpp_library
- cpp/executor
- disaggregated
- draft_target_model
- eagle
- language_adapter
- llm-api
- lookahead
- medusa
- models
- contrib
- arctic
- baichuan
- bloom
- chatglm-6b
- chatglm2-6b
- chatglm3-6b-32k
- cogvlm
- dbrx
- deepseek_v1
- deepseek_v2
- dit
- falcon
- gptj
- gptneox
- grok
- hyperclovax
- internlm
- jais
- mmdit
- mpt
- opt
- skywork
- smaug
- stdit
- core
- bert
- commandr
- deepseek_v3
- enc_dec
- exaone
- gemma
- glm-4-9b
- gpt
- granite
- internlm2
- llama4
- llama
- mamba
- mixtral
- mllama
- multimodal
- nemotron_nas
- nemotron
- phi
- qwen2audio
- qwenvl
- qwen
- recurrentgemma
- vit
- whisper
- ngram
- openai_triton
- manual_plugin
- plugin_autogen
- python_plugin
- quantization
- redrafter
- sample_weight_stripping
- wide_ep/slurm_scripts
- jenkins
- scripts
- tensorrt_llm
- _torch
- auto_deploy/custom_ops
- custom_ops
- distributed
- models
- modules/fused_moe
- ops
- pyexecutor
- auto_parallel
- bench
- benchmark
- utils
- build
- dataclasses
- commands
- llmapi
- models
- eagle
- mmdit_sd3
- qwen
- stdit
- unet
- plugin
- scaffolding
- serve/scripts
- tools/plugin_gen/templates
- tests
- integration
- defs
- accuracy
- examples
- perf
- stress_test
- utils
- test_lists
- test-db
- unittest
- _torch
- executor
- modules
- multi_gpu_modeling
- multi_gpu
- thop/parallel
- api_stability
- llmapi
- others
- tools
- trt
- functional
- quantization
- triton_backend
- all_models/disaggregated_serving
- ci
- tools/inflight_batcher_llm
Some content is hidden
Large Commits have some content hidden by default. Use the searchbox below for content that may be hidden.
282 files changed
+2448
-1373
lines changedLarge diffs are not rendered by default.
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
135 | 135 | | |
136 | 136 | | |
137 | 137 | | |
138 | | - | |
| 138 | + | |
139 | 139 | | |
140 | 140 | | |
141 | 141 | | |
| |||
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
1146 | 1146 | | |
1147 | 1147 | | |
1148 | 1148 | | |
1149 | | - | |
| 1149 | + | |
1150 | 1150 | | |
1151 | 1151 | | |
1152 | 1152 | | |
| |||
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
1056 | 1056 | | |
1057 | 1057 | | |
1058 | 1058 | | |
1059 | | - | |
| 1059 | + | |
1060 | 1060 | | |
1061 | 1061 | | |
1062 | 1062 | | |
| |||
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
68 | 68 | | |
69 | 69 | | |
70 | 70 | | |
71 | | - | |
| 71 | + | |
72 | 72 | | |
73 | 73 | | |
74 | 74 | | |
| |||
138 | 138 | | |
139 | 139 | | |
140 | 140 | | |
141 | | - | |
| 141 | + | |
142 | 142 | | |
143 | | - | |
| 143 | + | |
144 | 144 | | |
145 | 145 | | |
146 | 146 | | |
| |||
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
116 | 116 | | |
117 | 117 | | |
118 | 118 | | |
119 | | - | |
| 119 | + | |
120 | 120 | | |
121 | 121 | | |
122 | 122 | | |
| |||
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
217 | 217 | | |
218 | 218 | | |
219 | 219 | | |
220 | | - | |
| 220 | + | |
221 | 221 | | |
222 | 222 | | |
223 | 223 | | |
| |||
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
174 | 174 | | |
175 | 175 | | |
176 | 176 | | |
177 | | - | |
| 177 | + | |
178 | 178 | | |
179 | 179 | | |
180 | 180 | | |
| |||
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
618 | 618 | | |
619 | 619 | | |
620 | 620 | | |
621 | | - | |
| 621 | + | |
622 | 622 | | |
623 | 623 | | |
624 | 624 | | |
625 | 625 | | |
626 | | - | |
| 626 | + | |
627 | 627 | | |
628 | 628 | | |
629 | 629 | | |
| |||
Lines changed: 12 additions & 12 deletions
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
106 | 106 | | |
107 | 107 | | |
108 | 108 | | |
109 | | - | |
| 109 | + | |
110 | 110 | | |
111 | 111 | | |
112 | 112 | | |
| |||
187 | 187 | | |
188 | 188 | | |
189 | 189 | | |
190 | | - | |
| 190 | + | |
191 | 191 | | |
192 | 192 | | |
193 | 193 | | |
| |||
215 | 215 | | |
216 | 216 | | |
217 | 217 | | |
218 | | - | |
| 218 | + | |
219 | 219 | | |
220 | 220 | | |
221 | 221 | | |
222 | | - | |
| 222 | + | |
223 | 223 | | |
224 | 224 | | |
225 | 225 | | |
226 | 226 | | |
227 | | - | |
| 227 | + | |
228 | 228 | | |
229 | 229 | | |
230 | 230 | | |
| |||
267 | 267 | | |
268 | 268 | | |
269 | 269 | | |
270 | | - | |
| 270 | + | |
271 | 271 | | |
272 | 272 | | |
273 | 273 | | |
| |||
303 | 303 | | |
304 | 304 | | |
305 | 305 | | |
306 | | - | |
| 306 | + | |
307 | 307 | | |
308 | 308 | | |
309 | 309 | | |
310 | | - | |
| 310 | + | |
311 | 311 | | |
312 | 312 | | |
313 | 313 | | |
314 | | - | |
| 314 | + | |
315 | 315 | | |
316 | 316 | | |
317 | 317 | | |
| |||
348 | 348 | | |
349 | 349 | | |
350 | 350 | | |
351 | | - | |
| 351 | + | |
352 | 352 | | |
353 | 353 | | |
354 | 354 | | |
| |||
376 | 376 | | |
377 | 377 | | |
378 | 378 | | |
379 | | - | |
| 379 | + | |
380 | 380 | | |
381 | 381 | | |
382 | 382 | | |
383 | 383 | | |
384 | 384 | | |
385 | | - | |
| 385 | + | |
386 | 386 | | |
387 | 387 | | |
388 | 388 | | |
| |||
0 commit comments