Skip to content
This repository was archived by the owner on Jul 24, 2025. It is now read-only.
Closed
Show file tree
Hide file tree
Changes from all commits
Commits
Show all changes
35 commits
Select commit Hold shift + click to select a range
65a5498
Init helm charts
jgchn Jun 12, 2025
3cd169b
Routing resources
jgchn Jun 12, 2025
9c4102f
epp things
kalantar Jun 13, 2025
c031edd
Merge pull request #1 from jgchn/helm-mk-1
jgchn Jun 13, 2025
92da319
helpers
kalantar Jun 13, 2025
201c9e9
Merge pull request #2 from jgchn/helm-mk-2
jgchn Jun 13, 2025
ec953bb
sidecar helper
kalantar Jun 13, 2025
7acc7bb
Merge pull request #3 from jgchn/helm-mk-2
jgchn Jun 13, 2025
23e67f6
PD deployments
jgchn Jun 13, 2025
10099c1
lws
kalantar Jun 16, 2025
a57e681
Merge pull request #4 from jgchn/mk-helm-3
jgchn Jun 16, 2025
29f6c02
Fixes to deployments
jgchn Jun 16, 2025
3126aea
Generate samples for facebook model
jgchn Jun 16, 2025
81c048c
Address inconsistencies
jgchn Jun 16, 2025
cdac5ed
resources helper
kalantar Jun 16, 2025
21648bc
Merge pull request #5 from jgchn/mk-helm-4
jgchn Jun 16, 2025
2d4710f
lws helpers
kalantar Jun 16, 2025
4b4d455
Merge branch 'helm' into mk-helm-4
kalantar Jun 16, 2025
23f13d6
Merge pull request #6 from jgchn/mk-helm-4
jgchn Jun 16, 2025
a982e20
Working example in kind
jgchn Jun 16, 2025
368e846
Add docs
jgchn Jun 16, 2025
6c9b8b9
multinode, uri
kalantar Jun 17, 2025
3df130a
fix
kalantar Jun 17, 2025
f1ba55b
Feedbacl
jgchn Jun 17, 2025
1038678
redo parallelism, resources
kalantar Jun 17, 2025
5d4d0e4
Merge branch 'helm' into mk-helm-5
kalantar Jun 17, 2025
f281bfe
remove debugLevel
kalantar Jun 17, 2025
e1e09ca
Merge pull request #7 from jgchn/mk-helm-5
jgchn Jun 17, 2025
7e94af4
Working example in kind
jgchn Jun 17, 2025
beaef49
rolebinding
kalantar Jun 23, 2025
330bdad
use service port in routing
kalantar Jun 26, 2025
2187613
cleanup, rolebinding
kalantar Jun 26, 2025
cfb6205
role renamed to permissions
kalantar Jun 26, 2025
1fbe58e
generate some args
kalantar Jun 26, 2025
cc7c16f
Merge pull request #8 from jgchn/mk-helm-6
jgchn Jun 26, 2025
File filter

Filter by extension

Filter by extension

Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
23 changes: 23 additions & 0 deletions helm/.helmignore
Original file line number Diff line number Diff line change
@@ -0,0 +1,23 @@
# Patterns to ignore when building packages.
# This supports shell glob matching, relative path matching, and
# negation (prefixed with !). Only one pattern per line.
.DS_Store
# Common VCS dirs
.git/
.gitignore
.bzr/
.bzrignore
.hg/
.hgignore
.svn/
# Common backup files
*.swp
*.bak
*.tmp
*.orig
*~
# Various IDEs
.project
.idea/
*.tmproj
.vscode/
24 changes: 24 additions & 0 deletions helm/Chart.yaml
Original file line number Diff line number Diff line change
@@ -0,0 +1,24 @@
apiVersion: v2
name: llm-d-modelservice
description: A Helm chart for ModelService

# A chart can be either an 'application' or a 'library' chart.
#
# Application charts are a collection of templates that can be packaged into versioned archives
# to be deployed.
#
# Library charts provide useful utilities or functions for the chart developer. They're included as
# a dependency of application charts to inject those utilities and functions into the rendering
# pipeline. Library charts do not define any templates and therefore cannot be deployed.
type: application

# This is the chart version. This version number should be incremented each time you make changes
# to the chart and its templates, including the app version.
# Versions are expected to follow Semantic Versioning (https://semver.org/)
version: 0.0.1

# This is the version number of the application being deployed. This version number should be
# incremented each time you make changes to the application. Versions are not expected to
# follow Semantic Versioning. They should reflect the version the application is using.
# It is recommended to use it with quotes.
appVersion: "0.0.1"
44 changes: 44 additions & 0 deletions helm/examples/README.md
Original file line number Diff line number Diff line change
@@ -0,0 +1,44 @@
# Examples

Contains example values file and their rendered templates.

```
cd helm
helm template [RELEASE-NAME] . -f [VALUES-FILEPATH]
```

1. `vllm-sim` in Kind

Make sure there is a gateway (Kgteway or Istio) deployed in the cluster named `llm-d-inference-gateway` or change values file accordingly.

```
helm template llmd-sim . -f examples/values-vllm-sim.yaml > examples/output-vllm-sim.yaml
```

Remove `protocol: tcp` in `initContainers` and `readinessProbe` and `livenessProbe` from epp deployment


2. `facebook/opt-125m`: downloads from Hugging Face

```
helm template facebook . -f examples/values-facebook.yaml > examples/output-facebook.yaml
```


Port forward the inference gateway

```
k port-forward svc/llm-d-inference-gateway-istio 8000:80
```

Send a request

```
curl http://localhost:8000/v1/completions -vvv \
-H "Content-Type: application/json" \
-H "x-model-name: facebook/opt-125m" \
-d '{
"model": "facebook/opt-125m",
"prompt": "Hello, "
}'
```
Loading