Skip to content

Commit

Permalink
Add example for inference extension
Browse files Browse the repository at this point in the history
Hold until
GoogleCloudPlatform/monitoring-dashboard-samples#929
is submitted and public page is generated.
  • Loading branch information
JeffLuoo committed Feb 24, 2025
1 parent f0e896b commit 0df6183
Show file tree
Hide file tree
Showing 2 changed files with 33 additions and 0 deletions.
3 changes: 3 additions & 0 deletions examples/inference-extension/README.md
Original file line number Diff line number Diff line change
@@ -0,0 +1,3 @@
# Inference Extension sample manifests

Please refer to the [Google Cloud documentation](https://cloud.google.com/stackdriver/docs/managed-prometheus/exporters/inference-extension) for how to use these manifests.
30 changes: 30 additions & 0 deletions examples/inference-extension/pod-monitoring.yaml
Original file line number Diff line number Diff line change
@@ -0,0 +1,30 @@
# Copyright 2025 Google LLC
#
# Licensed under the Apache License, Version 2.0 (the "License");
# you may not use this file except in compliance with the License.
# You may obtain a copy of the License at
#
# https://www.apache.org/licenses/LICENSE-2.0
#
# Unless required by applicable law or agreed to in writing, software
# distributed under the License is distributed on an "AS IS" BASIS,
# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
# See the License for the specific language governing permissions and
# limitations under the License.

apiVersion: monitoring.googleapis.com/v1
kind: PodMonitoring
metadata:
name: inference-extension
labels:
app.kubernetes.io/name: inference-gateway
app.kubernetes.io/part-of: google-cloud-managed-prometheus
spec:
endpoints:
- port: metrics
scheme: http
interval: 5s
path: /metrics
selector:
matchLabels:
app: inference-gateway-ext-proc

0 comments on commit 0df6183

Please sign in to comment.