Skip to content

Commit da9f22d

Browse files
Update quadlet and fleet image tags to 425da63
1 parent 425da63 commit da9f22d

3 files changed

Lines changed: 3 additions & 3 deletions

File tree

scenarios/quadlet/vllm-bench.container

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -3,7 +3,7 @@ Description=vLLM Benchmark Load Generator
33
After=vllm-server.service
44

55
[Container]
6-
Image=quay.io/redhat-et/vllm-server:v1.0.202603251001
6+
Image=quay.io/redhat-et/vllm-server:425da63
77
Volume=model-storage.volume:/models:ro,z
88
Network=mlops.network
99
Entrypoint=["bash", "-c", "until curl -sf http://vllm-server:8000/health; do sleep 10; done && vllm bench serve --base-url http://vllm-server:8000 --model Llama-3.2-1B-Instruct --tokenizer /models --request-rate 5.0 --num-prompts 500"]

scenarios/quadlet/vllm-server.container

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -3,7 +3,7 @@ Description=vLLM GPU Inference Server
33
After=network-online.target model-car.service
44

55
[Container]
6-
Image=quay.io/redhat-et/vllm-server:v1.0.202603251001
6+
Image=quay.io/redhat-et/vllm-server:425da63
77
Volume=model-storage.volume:/models:ro,z
88
PublishPort=8000:8000
99
AddDevice=nvidia.com/gpu=all

scenarios/scenario-01-device-edge/flightctl/fleet.yaml

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -14,7 +14,7 @@ spec:
1414
applications:
1515
- name: mlops-gpu-stack
1616
appType: quadlet
17-
image: quay.io/redhat-et/mlops-quadlet:v1.0.202603251512
17+
image: quay.io/redhat-et/mlops-quadlet:425da63
1818
config:
1919
- name: inference-server-metrics
2020
gitRef:

0 commit comments

Comments
 (0)