Skip to content

Conversation

@timkpaine
Copy link
Member

@timkpaine timkpaine commented Apr 15, 2024

Adding an option to periodically scrape the ray metrics actor, and publish to appropriate perspective tables.

I tested this mostly low scale (2-5 second scraping logic, running for a few hours, adding unit test, that kind of thing). It's possible we want this to be more of an "experimental" feature, and/or restrict the tables and metrics we actually care to scrape/publish, but I think this is cool enough for now that we should publish this as an option.

leaving for a few hours, my local cluster cpu/memory was largely the same throughout, and looked like this:
image
which also seems to reflect closely the default option for our cluster, which looks like this:

image

(Note this is all running on my local, head node)

With minimal effort, we suddenly get access to all these (and more!) perspective metrics:

image

and the metrics are pretty expansive,

image

@timkpaine timkpaine force-pushed the tbg/prometheus branch 2 times, most recently from 8dc5377 to 67493ec Compare April 15, 2024 15:47
@github-actions
Copy link
Contributor

github-actions bot commented Apr 15, 2024

Test Results

3 tests  +1   3 ✅ +1   1m 23s ⏱️ +38s
1 suites ±0   0 💤 ±0 
1 files   ±0   0 ❌ ±0 

Results for commit ad17147. ± Comparison against base commit f689ff8.

♻️ This comment has been updated with latest results.

@gauglertodd gauglertodd linked an issue May 6, 2024 that may be closed by this pull request
@timkpaine timkpaine added the type: feature Feature requests label Jun 11, 2024
@gauglertodd gauglertodd force-pushed the tbg/prometheus branch 7 times, most recently from 46f55f9 to 1abc30b Compare September 9, 2024 16:26
Co-authored-by: Todd Gaugler <[email protected]>
Signed-off-by: Tim Paine <[email protected]>

Adding prometheus dashboard support

Co-authored-by: Todd Gaugler <[email protected]>
Signed-off-by: Tim Paine <[email protected]>

.

.

...

...

...

...

...
@ptomecek
Copy link
Collaborator

@gauglertodd @timkpaine What's the status of this PR?

@timkpaine timkpaine marked this pull request as draft August 14, 2025 20:02
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

type: feature Feature requests

Projects

None yet

Development

Successfully merging this pull request may close these issues.

Add Support for Prometheus Metrics

3 participants