Dashboards

  • Fixed

    • Make Nodes Overview dashboard work for Workload Clusters and assign to team phoenix.

    Changed

    • Adjusted the ENA dashboard to have an overview of the data across time
  • Changed

    • added link from prometheus to prometheus/availability dashboard
    • added link from prometheus/availability to prometheus dashboard
  • Fixed

    • prometheus availability over period
  • Added

    • Add Loki cost estimation dashboard

    Changed

    • rework of requests vs usage to use the same metrics as kube-mixins and change filtering options
    • requests vs usage dashboard made public
  • Changed

    • Move cilium dashboard to public dashboards.
  • Changed

    • Add AWS ENA Performance
    • Updated team labels for team-rocket
    • Add graph in Node Overview to identify emptydir growth
    • Update kube-mixins to 0.12
    • Added Etcd health for monitoring the Etcd key space status
    • Adjusted the K8s Api Perfomance master nodes memory dashoboard and using node_memory_MemAvailable_bytes instead of node_memory_MemFree_bytes
  • Added

    • Prometheus availability dashboard

    Changed

    • Small improvements for Prometheus dashboard
  • Changed

    • Change the name of metric ETCD Backend Quota Low Space to ETCD Keyspace usage.
  • Added

    • Prometheus - opsrecipe dashboard
    • Prometheus Overview dashboards - from prometheus-mixins
    • Add ETCD Backend Quota Low Space to K8s API Performance Dashboard.

    Changed

    • Make main prometheus dashboard public
  • Changed

    • Monitoring Dashboard Updated
    ?? helm/dashboards/dashboards/shared/public/alertmanager-overview.json
    

    Added

    • Alertmanager / Overview Dashboard

    Changed

    • Prometheus dashboard improvements: available node resources, scraped metrics info and rules info.