Highlights for the week ending 2024-09-26
Observability
dashboardsversion 3.24.0
- Updated Alertmanager dashboard to show related logs.
- Add Loki mixins dashboards update script.
- Update Mimir mixins dashboards via script.
- Fix Alloy mixin tags.
alloy-app version 0.5.2 introduces the following changes:
- Add a helm chart templating test to the ci pipeline.
- Add tests with ats in the CI pipeline.
- Push alloy as a gateway component in collections.
kyverno-policies-observability version 0.5.0
- Remove the policy for
ServiceMonitorandPodMonitorrelabelling schemas as we no longer need the enforcement.
- Remove the policy for
fluent-logshipping-app version 5.2.2
- Fix the Nginx Parser based on the upstream parser.
logging-operator version 0.12.1
- Fix usage of structured metadata for clusters before v20.
- Move high cardinality values into structured metadata.
- Add Kubernetes audit log
resourcelabel,filenamelabel, andoutput streamlabel. - Rename the
node_namelabel intonodeto match the metric label.
- Add “manual e2e” testing procedure.
- Add PR message template referring to the manual testing procedure.
observability-bundle version 1.6.2:
- Fixed
alloyMetricscatalog
- Fixed
observability-operator version 0.6.0:
- Require observability-bundle >= 1.6.2 for Alloy monitoring agent support; this is due to the incorrect
alloyMetricscatalogue in observability-bundle - Fix invalid Alloy config due to missing comma on external labels
- Disable logger development mode to avoid panicking; use zap as a logger.
- Fix CircleCI release pipeline.
- Add manual e2e testing procedure and script.
- Require observability-bundle >= 1.6.2 for Alloy monitoring agent support; this is due to the incorrect
prometheus-meta-operator version 4.79.0:
- Remove unused
#alertand#alert-test-installationslack integration.
- Remove unused
prometheus-rules version 4.15.2:
- Update
MimirHPAReachedMaxReplicasoperation recipe link - Fix aggregation rule of the
slo:current_burn_rate:ratioslo. - Remove aggregation of slo:period_error_budget_remaining:ratio` as this value can be easily computed and creates a lot of time series in Grafana Cloud
- Add aggregations for SLO metrics to export them to the Grafana cloud
- Add
MimirHPAReachedMaxReplicasalert to detect when Mimir’s HPAs have reached maximum capacity. - Added dashboards to several Mimir alerts
- Change
IRSAACMCertificateExpiringInLessThan60DaystoIRSAACMCertificateExpiringInLessThan45Days. The ACM certificate is renewed 60 days before expiration, and the alert can fire prematurely.
- Update
tekton-dashboard-loki-proxy version 0.4.0:
- Change
app.giantswarm.io/*labels toapplication.giantswarm.io/ - Update Golang to v1.23.1
- Change
Cluster management
aws-pod-identity-webhook version 1.17.0:
- Fix VPA being ineffective due to referring to a non-existing
Deploymentname
- Fix VPA being ineffective due to referring to a non-existing
aws-crossplane-cluster-config-operator version 0.3.0
- Configure the Crossplane
ProviderConfigto use the CAPA controller role directly without going through a middleman. For this to work, the CAPA controller must have the correct trust policy granting access to the Crossplane provider’s service account. - Write a value
oidcDomainsto the config map containing all service account issuer domains, as defined by the newaws.giantswarm.io/irsa-trust-domainsannotation on the AWSCluster. The primary domain is still written to valueoidcDomain.
- Configure the Crossplane
- Remove deprecation message for
customNodeLabelsandcustomNodeTaints, because they are not deprecated. - Allow configuring
kube-controller-manager--node-cidr-mask-sizeflag. - Chart: Support multiple service account issuers.\ Change
providerIntegration.controlPlane.kubeadmConfig.clusterConfiguration.apiServer.serviceAccountIssuerto pluralproviderIntegration.controlPlane.kubeadmConfig.clusterConfiguration.apiServer.serviceAccountIssuersand render them in the specified order as--service-account-issuerparameters for the API server. - Only add the
customNodeLabelsvalue to the kubeletnode-labelsargument in theKubeadmConfigwhencustomNodeLabelsis defined.
- Remove deprecation message for
Connectivity
- aws-load-balancer-controller-app version 2.1.0
- Update to upstream version 2.8.3.
Security
kyverno-policies-dx version 0.5.1
- Use
EnforceandAuditvalidationFailureAction.
- Use
kyverno-policies-ux version 0.7.3
cluster-namesnow targets Cluster by GVK- Use
EnforcevalidationFailureAction.
kyverno-app version 0.18.0
- Update
Kyvernoto the upstream version v1.12.5.
- Update
kyverno-crds version 1.12.0
- Update Kyverno CRDs to Kyverno v1.12.
kyverno-policies version 0.21.0
- Update to upstream
Kyverno Policiesversion 1.12.5. - Don’t push to vsphere-app-collection, capz-app-collection, capa-app-collection or cloud-director-app-collection. We started to consume kyverno-policies from security-bundle.
- Update to upstream