Skip to content

Commit 7e27762

Browse files
committed
add model svc yamls
Signed-off-by: Ning Wang <[email protected]>
1 parent 2065f3c commit 7e27762

File tree

4 files changed

+66
-1
lines changed

4 files changed

+66
-1
lines changed
Lines changed: 14 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,14 @@
1+
apiVersion: v1
2+
kind: Service
3+
metadata:
4+
name: qwen3-32b-service
5+
namespace: default
6+
spec:
7+
selector:
8+
model.aibrix.ai/name: qwen3-32b
9+
ports:
10+
- protocol: TCP
11+
port: 8000
12+
targetPort: 8000
13+
nodePort: 30009
14+
type: NodePort
Lines changed: 37 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,37 @@
1+
apiVersion: v1
2+
kind: Service
3+
metadata:
4+
name: qwen3-8b-service
5+
namespace: default
6+
spec:
7+
selector:
8+
model.aibrix.ai/name: qwen3-8b
9+
ports:
10+
- protocol: TCP
11+
port: 8000
12+
targetPort: 8000
13+
nodePort: 30008
14+
type: NodePort
15+
16+
17+
apiVersion: v1
18+
kind: Service
19+
metadata:
20+
name: llama2-7b
21+
namespace: default
22+
labels:
23+
prometheus-discovery: "true"
24+
annotations:
25+
prometheus.io/scrape: "true"
26+
prometheus.io/path: "/metrics"
27+
prometheus.io/port: "8000"
28+
spec:
29+
selector:
30+
model.aibrix.ai/name: "llama2-7b"
31+
ports:
32+
- protocol: TCP
33+
name: metrics
34+
port: 8000
35+
targetPort: 8000
36+
nodePort: 30081
37+
type: NodePort
Lines changed: 14 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,14 @@
1+
apiVersion: v1
2+
kind: Service
3+
metadata:
4+
name: qwen3-8b-service
5+
namespace: default
6+
spec:
7+
selector:
8+
model.aibrix.ai/name: qwen3-8b
9+
ports:
10+
- protocol: TCP
11+
port: 8000
12+
targetPort: 8000
13+
nodePort: 30008
14+
type: NodePort

test/regression/v0.4.0/vllm/vllm-base.yaml

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -7,7 +7,7 @@ metadata:
77
name: qwen3-8b
88
namespace: default
99
spec:
10-
replicas: 1
10+
replicas: 2
1111
selector:
1212
matchLabels:
1313
model.aibrix.ai/name: qwen3-8b

0 commit comments

Comments
 (0)