Skip to content

Conversation

@DanielABrennand
Copy link
Collaborator

Adds two data scrapers (Node_exporter for hardware, cadvisor for docker containers) to monitor the vm. Then adds prometheus to collate this information and pass to grafana where it is graphed in dashboards. Grafana also has alerts setup to inform us if either the api container or pgadmin goes down for more than 3 mins and if the vm is using more than 95% of either its RAM or cpu for over 5 mins.
Note if testing this locally: in the docker-compose file the enviromental variable GF_SERVER_ROOT_URL must be commented out for grafana
Additionally the provisioned contact point for the alerting is a dummy url and in production will need to be changed manually.

@DanielABrennand DanielABrennand linked an issue Oct 14, 2025 that may be closed by this pull request
@NathanCummings NathanCummings merged commit 4b52e39 into develop Nov 27, 2025
2 checks passed
@NathanCummings NathanCummings deleted the Monitoring branch November 27, 2025 14:54
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

Monitoring and alerts

3 participants