Skip to content

Comments

[WIP] MetricsV2 + OTel+Prometheus sample and Grafana dashboard#3154

Draft
csviri wants to merge 32 commits intooperator-framework:nextfrom
csviri:otel-metrics-grafana
Draft

[WIP] MetricsV2 + OTel+Prometheus sample and Grafana dashboard#3154
csviri wants to merge 32 commits intooperator-framework:nextfrom
csviri:otel-metrics-grafana

Conversation

@csviri
Copy link
Collaborator

@csviri csviri commented Feb 4, 2026

Goal of this PR is to provide a OTel + Prometheus + Grafana setup. So we:

  1. verify integration with OTel
  2. Provide a default Grafana dashboard for metrics.
  3. Changes webpage sample to use Otel and deploy the app and dasboard to a cluster with OTel + Prometheus + Grafana. This should be easy to done by a users, so dey can check and validate the dashboard.
  4. new metrics implementations

Notes on new metrics implementation:

  • only static gauges (lives until the operator lives) - that makes much easir to manage them
  • no manual counter removal (should not be done)
  • sanitized names and labels according to best practices
  • added some metrics counter for last attempt retries.

@openshift-ci openshift-ci bot added the do-not-merge/work-in-progress Indicates that a PR should not merge because it is a work in progress. label Feb 4, 2026
@csviri csviri changed the title OTel+Prometheus sample and Graphana dashboard [WIP] OTel+Prometheus sample and Graphana dashboard Feb 4, 2026
@csviri csviri changed the title [WIP] OTel+Prometheus sample and Graphana dashboard [WIP] OTel+Prometheus sample and Grfana dashboard Feb 8, 2026
@csviri csviri changed the title [WIP] OTel+Prometheus sample and Grfana dashboard [WIP] OTel+Prometheus sample and Grafana dashboard Feb 8, 2026
@csviri csviri force-pushed the otel-metrics-grafana branch from c7e6ca2 to ece63e8 Compare February 8, 2026 15:25
@csviri
Copy link
Collaborator Author

csviri commented Feb 9, 2026

JVM metrics:

image image

@csviri
Copy link
Collaborator Author

csviri commented Feb 9, 2026

JODSK metrics:

image

Added TODOs to improve those, like we should not have controller name as suffix, rather as a tag:

operator_sdk_reconciliations_executions_webpagestandalonedependentsreconciler

@csviri csviri linked an issue Feb 10, 2026 that may be closed by this pull request
@csviri csviri requested review from metacosm and xstefank February 10, 2026 11:50
@csviri csviri changed the title [WIP] OTel+Prometheus sample and Grafana dashboard [WIP] MetricsV2 + OTel+Prometheus sample and Grafana dashboard Feb 11, 2026
@csviri csviri force-pushed the otel-metrics-grafana branch from 5f446ac to 0e0c122 Compare February 17, 2026 10:50
csviri and others added 13 commits February 21, 2026 13:06
…ork#3106)

- Adds utility that provides methods to update resources using comparable resource versions
- Integrates this utility to the core of the framework (thus simplifying `ReconciliationDispatcher`)
  -  note that this introduces a change in the behavior for the `UpdateControl.patchStatus` (and others), since it won't trigger the reconiliation for the event in that update.
-  Renames former `ReconcilerUtils` to `ReconcilerUtilsInternal`, this is breaking but that utils was never advertised for non-internal usage
- Includes also fixes for ControllerEventSource event filtering
- Improves TemporaryResourceCache event filtering algorithm

Signed-off-by: Attila Mészáros <a_meszaros@apple.com>
Signed-off-by: Steve Hawkins <shawkins@redhat.com>
Co-authored-by: Steve Hawkins <shawkins@redhat.com>
Signed-off-by: Attila Mészáros <a_meszaros@apple.com>
Signed-off-by: Attila Mészáros <a_meszaros@apple.com>
Signed-off-by: Attila Mészáros <a_meszaros@apple.com>
Signed-off-by: Attila Mészáros <a_meszaros@apple.com>
Signed-off-by: Attila Mészáros <a_meszaros@apple.com>
Signed-off-by: Attila Mészáros <a_meszaros@apple.com>
Signed-off-by: Attila Mészáros <a_meszaros@apple.com>
Signed-off-by: Attila Mészáros <a_meszaros@apple.com>
Signed-off-by: Attila Mészáros <a_meszaros@apple.com>
Signed-off-by: Attila Mészáros <a_meszaros@apple.com>
Signed-off-by: Attila Mészáros <a_meszaros@apple.com>
Signed-off-by: Attila Mészáros <a_meszaros@apple.com>
Signed-off-by: Attila Mészáros <a_meszaros@apple.com>
Signed-off-by: Attila Mészáros <a_meszaros@apple.com>
Signed-off-by: Attila Mészáros <a_meszaros@apple.com>
Signed-off-by: Attila Mészáros <a_meszaros@apple.com>
Signed-off-by: Attila Mészáros <a_meszaros@apple.com>
Signed-off-by: Attila Mészáros <a_meszaros@apple.com>
Signed-off-by: Attila Mészáros <a_meszaros@apple.com>
Signed-off-by: Attila Mészáros <a_meszaros@apple.com>
Signed-off-by: Attila Mészáros <a_meszaros@apple.com>
Signed-off-by: Attila Mészáros <a_meszaros@apple.com>
Signed-off-by: Attila Mészáros <a_meszaros@apple.com>
Signed-off-by: Attila Mészáros <a_meszaros@apple.com>
Signed-off-by: Attila Mészáros <a_meszaros@apple.com>
Signed-off-by: Attila Mészáros <a_meszaros@apple.com>
Signed-off-by: Attila Mészáros <a_meszaros@apple.com>
Signed-off-by: Attila Mészáros <a_meszaros@apple.com>
Signed-off-by: Attila Mészáros <a_meszaros@apple.com>
Signed-off-by: Attila Mészáros <a_meszaros@apple.com>
@csviri csviri force-pushed the otel-metrics-grafana branch from 88c118c to cf9eb57 Compare February 21, 2026 12:06
Signed-off-by: Attila Mészáros <a_meszaros@apple.com>
@openshift-merge-robot
Copy link

PR needs rebase.

Details

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes-sigs/prow repository.

@openshift-merge-robot openshift-merge-robot added the needs-rebase Indicates a PR cannot be merged because it has merge conflicts with HEAD. label Feb 23, 2026
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

do-not-merge/work-in-progress Indicates that a PR should not merge because it is a work in progress. needs-rebase Indicates a PR cannot be merged because it has merge conflicts with HEAD.

Projects

None yet

2 participants