Skip to content

Commit fd54352

Browse files
committed
generating new manifests
1 parent c8317ff commit fd54352

File tree

2 files changed

+5
-4
lines changed

2 files changed

+5
-4
lines changed

config/crd/bases/inference.networking.x-k8s.io_inferencemodels.yaml

Lines changed: 4 additions & 3 deletions
Original file line numberDiff line numberDiff line change
@@ -70,7 +70,7 @@ spec:
7070
modelName:
7171
description: |-
7272
ModelName is the name of the model as it will be set in the "model" parameter for an incoming request.
73-
ModelNames are expected to be unique for a specific InferencePool
73+
ModelNames must be unique for a referencing InferencePool
7474
(names can be reused for a different pool in the same cluster).
7575
The modelName with the oldest creation timestamp is retained, and the incoming
7676
InferenceModel is sets the Ready status to false with a corresponding reason.
@@ -139,8 +139,9 @@ spec:
139139
140140
If only one model is specified and it has a weight greater than 0, 100%
141141
of the traffic is forwarded to that model. If weight is set to 0, no
142-
traffic should be forwarded for this model. If unspecified, weight
143-
defaults to 1.
142+
traffic should be forwarded for this model. If all model weights are unspecified,
143+
weights will be treated as equal. If a weight is specified for some TargetModels, and not
144+
others. The unspecified weights will be treated as zero.
144145
format: int32
145146
maximum: 1000000
146147
minimum: 0

config/crd/bases/inference.networking.x-k8s.io_inferencepools.yaml

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -58,7 +58,7 @@ spec:
5858
pattern: ^(([A-Za-z0-9][-A-Za-z0-9_.]*)?[A-Za-z0-9])?$
5959
type: string
6060
description: |-
61-
Selector defines a map of label to watch model server pods
61+
Selector defines a map of labels to watch model server pods
6262
that should be included in the InferencePool.
6363
In some cases, implementations may translate this field to a Service selector, so this matches the simple
6464
map used for Service selectors instead of the full Kubernetes LabelSelector type.

0 commit comments

Comments
 (0)