Dashboards

  • Added

    • Add datasource variable to Prometheus dashboard

    Fixed

    • Adjust panel positions to fill width and move Mimir related panel under related section
  • Changed

    • ServiceMonitors Overview dashboard: add RAM usage estimation
  • Changed

    • Change “Node utilization” dashboard to use allocatable over capacity, to better reflect the available resources of the nodes.
  • Fixed

    • Fix loki and mimir mixins recording rules
    • Fix atlas dashboard tags.
    • Fix storage related panes on zot’s dashboards
    • prometheus: scraping info can now be filtered by cluster
    • add some basic linting configuration so we can track down issues in dashboards.

    Added

    • ServiceMonitors overview dashboard
    • ServiceMonitors details dashboard
  • Changed

    • Change “Worker node utilization” dashboard to “Node utilization”, also allowing to analyze data for control plane nodes.
  • Added

    • Add dashboard “Worker node utilization”.
  • Changed

    • Updated “In-cluster container registry (Zot)” dashboard to use metric kubelet_volume_stats_used_bytes for storage used.
  • Changed

    • Move node-problem-detector to be aws only.

    Fixed

    • Fix Grafana Cloud service-level dashboard in case we have duplicate clusrer names in different installations.
    • Invalid datasource variable name in mimir cost estimate dashboard.
  • Fixed

    • Fix Mimir / writes resources disk usage related graphs.
  • Changed

    • Improved many details on the dashboard “In-cluster container registry (Zot)”.
    • Change net-exporter dashboard ownership from turtles to cabbage.
    • Change cluster-total.json dashboard ownership from turtles to cabbage.
    • Change namespace-by-pod.json dashboard ownership from turtles to cabbage.
    • Change namespace-by-workload.json dashboard ownership from turtles to cabbage.
    • Change pod-total.json dashboard ownership from turtles to cabbage.
    • Change workload-total.json dashboard ownership from turtles to cabbage.
    • Update “Ingress NGINX Controller Connection Distribution” dashboard file to schema version 39.
    • Update “Giant Swarm / Kubernetes Persistent Volumes” dashboard file to replace old graph panels with new time series panels.
    • Update “Security: Falco Dashboard” dashboard file to replace old graph panel with new time series panel, old table with new table panel.

    Fixed

    • Fix Mimir / Reads resources Disk usage graphs.

    Removed

    • Remove “Microstorage” dashboard.

This part of our documentation refers to our vintage product. The content may be not valid anymore for our current product. Please check our new documentation hub for the latest state of our docs.