Observability
Fix
- Fix missing ingress slo in slo reporting.
Changed
- Upgrade to fluent-bit 3.0.6 to fix some recent CVEs.
Changed
- Upgraded upstream chart from 5.47.2 to 5.48.0 - see changelog for more information.
Added
- Add datasource variable to Prometheus dashboard
Fixed
- Adjust panel positions to fill width and move Mimir related panel under related section
Changed
- ServiceMonitors Overview dashboard: add RAM usage estimation
Changed
- Change “Node utilization” dashboard to use allocatable over capacity, to better reflect the available resources of the nodes.
Changed
- Upgraded chart dependency from kube-prometheus-stack-58.5.2 to kube-prometheus-stack-58.5.2
- prometheus upgraded from 2.51.2 to 2.52.0
- thanos ruler upgraded from 0.34.1 to 0.35.0
Changed
- Upgraded chart dependency from kube-prometheus-stack-58.5.2 to kube-prometheus-stack-58.5.2
- prometheus upgraded from 2.51.2 to 2.52.0
- thanos ruler upgraded from 0.34.1 to 0.35.0
Fixed
- Fix loki and mimir mixins recording rules
- Fix atlas dashboard tags.
- Fix storage related panes on zot’s dashboards
- prometheus: scraping info can now be filtered by cluster
- add some basic linting configuration so we can track down issues in dashboards.
Added
- ServiceMonitors overview dashboard
- ServiceMonitors details dashboard
Changed
- Add resources requests for the gateway.
- Add hpa resource and enable it by default for the querier and the distributor.
- Enable hpa for the gateway.