-
Notifications
You must be signed in to change notification settings - Fork 185
Adding Kubvernor to the list of implementors #1313
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Changes from 6 commits
96f2ceb
db14992
426f4c0
8dcb895
8e66626
061a556
5719437
f3895cf
4e2ac44
2847818
8b85635
6eec771
4683dca
File filter
Filter by extension
Conversations
Jump to
Diff view
Diff view
There are no files selected for viewing
| Original file line number | Diff line number | Diff line change |
|---|---|---|
| @@ -0,0 +1,10 @@ | ||
| apiVersion: gateway.networking.k8s.io/v1 | ||
| kind: Gateway | ||
| metadata: | ||
| name: inference-gateway | ||
| spec: | ||
| gatewayClassName: kubvernor-inference-gateway | ||
| listeners: | ||
| - name: http | ||
| port: 80 | ||
| protocol: HTTP |
| Original file line number | Diff line number | Diff line change |
|---|---|---|
| @@ -0,0 +1,20 @@ | ||
| apiVersion: gateway.networking.k8s.io/v1 | ||
| kind: HTTPRoute | ||
| metadata: | ||
| name: llm-route | ||
| spec: | ||
| parentRefs: | ||
| - group: gateway.networking.k8s.io | ||
| kind: Gateway | ||
| name: inference-gateway | ||
| rules: | ||
| - backendRefs: | ||
| - group: inference.networking.x-k8s.io | ||
| kind: InferencePool | ||
| name: vllm-llama3-8b-instruct | ||
| matches: | ||
| - path: | ||
| type: PathPrefix | ||
| value: / | ||
| timeouts: | ||
| request: 300s |
| Original file line number | Diff line number | Diff line change |
|---|---|---|
|
|
@@ -262,6 +262,42 @@ A cluster with: | |
| kubectl get httproute llm-route -o yaml | ||
| ``` | ||
|
|
||
| === "Kubvernor Rust API Gateway" | ||
|
|
||
| [Kubvernor Rust API Gateway](https://github.com/kubvernor/kubvernor) is a higly experimental project so not ready for production but it supports version v0.5.1 of Inference Extension Spec. | ||
|
|
||
| 1. Requirements | ||
| - Rust and Cargo installed | ||
|
||
|
|
||
| 2. Run Kubvernor Rust API Gateway as documented in [README](https://github.com/kubvernor/kubvernor/blob/main/README.md) | ||
dawid-nowak marked this conversation as resolved.
Outdated
Show resolved
Hide resolved
|
||
|
|
||
|
|
||
| 3. Deploy the Gateway | ||
|
|
||
| ```bash | ||
| kubectl apply -f https://github.com/kubernetes-sigs/gateway-api-inference-extension/raw/main/config/manifests/gateway/kubvernor/gateway.yaml | ||
| ``` | ||
|
|
||
| Confirm that the Gateway was assigned an IP address and reports a `Programmed=True` status: | ||
| ```bash | ||
| $ kubectl get gateway inference-gateway | ||
| NAME CLASS ADDRESS PROGRAMMED AGE | ||
| inference-gateway kubvernor-inference-gateway <MY_ADDRESS> True 22s | ||
| ``` | ||
|
|
||
| 5. Deploy the HTTPRoute | ||
|
|
||
| ```bash | ||
| kubectl apply -f https://github.com/kubernetes-sigs/gateway-api-inference-extension/raw/main/config/manifests/gateway/kubvernor/httproute.yaml | ||
| ``` | ||
|
|
||
| 6. Confirm that the HTTPRoute status conditions include `Accepted=True` and `ResolvedRefs=True`: | ||
|
|
||
| ```bash | ||
| kubectl get httproute llm-route -o yaml | ||
| ``` | ||
|
|
||
|
|
||
|
|
||
| ### Deploy the InferencePool and Endpoint Picker Extension | ||
|
|
||
|
|
@@ -404,3 +440,6 @@ A cluster with: | |
| ```bash | ||
| kubectl delete ns kgateway-system | ||
| ``` | ||
| === "Kubvernor" | ||
|
|
||
| No further clean up is needed. | ||
Uh oh!
There was an error while loading. Please reload this page.