You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardExpand all lines: docs/source/torch/auto_deploy/auto-deploy.md
+1-1Lines changed: 1 addition & 1 deletion
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -10,7 +10,7 @@ This project is in active development and is currently in a prototype stage. The
10
10
AutoDeploy is a prototype designed to simplify and accelerate the deployment of PyTorch models, including off-the-shelf models like those from HuggingFace transformers library, to TensorRT-LLM.
11
11
12
12
<divalign="center">
13
-
<imgsrc="./ad_overview.png"alt="AutoDeploy integration with LLM API"width="70%">
13
+
<imgsrc="../../media/ad_overview.png"alt="AutoDeploy integration with LLM API"width="70%">
14
14
<p><em>AutoDeploy overview and relation with TensorRT-LLM's LLM api</em></p>
Copy file name to clipboardExpand all lines: docs/source/torch/auto_deploy/support_matrix.md
+71-14Lines changed: 71 additions & 14 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -9,24 +9,81 @@ The exported graph then undergoes a series of automated transformations, includi
9
9
**Bring Your Own Model**: AutoDeploy leverages `torch.export` and dynamic graph pattern matching, enabling seamless integration for a wide variety of models without relying on hard-coded architectures.
10
10
11
11
We support Hugging Face models that are compatible with `AutoModelForCausalLM` and `AutoModelForImageTextToText`.
12
-
Additionally, we have officially verified support for the following models:
12
+
In addition, we have officially validated the following models using the default configuration: runtime=trtllm, compile_backend=torch-compile, and attn_backend=flashinfer
13
13
14
14
<details>
15
15
<summary>Click to expand supported models list</summary>
16
16
17
-
| Model Series | HF Model Card | Model Factory | Precision | World Size | Runtime | Compile Backend ||| Attention Backend |||
0 commit comments