We read every piece of feedback, and take your input very seriously.
To see all available qualifiers, see our documentation.
There was an error while loading. Please reload this page.
1 parent 2a604fb commit 9f41d7aCopy full SHA for 9f41d7a
docs/source/engines/toolkit.mdx
@@ -1,7 +1,7 @@
1
# Inference Toolkit
2
3
In some cases, the model you're looking to deploy isn't supported by any of the high-performance inference engines. In this case,
4
-we provide a fallback option that works with most models. The Inference Toolkit supports models that are implemented in the
+we provide a fallback option. The Inference Toolkit supports models that are implemented in the
5
Transformers, Sentence-Transformers and Diffusers libraries, and wraps them in a light web server.
6
7
The Inference Toolkit is perfect for testing models and building demos, but isn't as production-ready as TGI, vLLM, SGLang, or llama.cpp.
0 commit comments