Highlights for the week ending 2024-10-31
General
- security-bundle version
1.9.0
introduces breaking changes. When upgrading to this version withFalco
enabled, theFalco App
may fail to upgrade due to a breaking change in the upstream chart. To complete the upgrade seamlessly, disable and then re-enable theFalco App
by settingapps.falco.enabled=[false|true]
in the security-bundle user values ConfigMap.
Observability
dashboards version
3.26.0
- Introduced “Loki - Slow Queries” dashboard for enhanced query performance insights.
- Transferred ownership from
BigMac
toShield
for better team alignment. - Resynced
alloy
,loki
, andmimir
mixins from upstream to ensure feature parity.
logging-operator version
0.14.0
- Default logging agent switched to
Alloy
, replacingPromtail
for improved performance.
- Default logging agent switched to
kube-prometheus-stack-app version
12.0.0
- Updated chart dependency to
kube-prometheus-stack-65.1.1
. - Upgraded
prometheus-operator
from0.75.0
to0.77.1
. - Prometheus upgraded from
2.53.0
to2.54.1
. - Grafana upgraded from
8.2.0
to8.5.0
. - Thanos ruler upgraded from
0.35.1
to0.36.1
. - Prometheus-node-exporter upgraded from
1.8.1
to1.8.2
. - Removed legacy in-house SLO framework to streamline integrations.
- Updated chart dependency to
prometheus-operator-crd version
12.0.0
- Upgraded CRDs chart from
13.0.2
(prometheus-operator0.75.2
) to15.0.0
(prometheus-operator0.77.1
). See upstream changelog for more details.
- Upgraded CRDs chart from
prometheus-meta-operator version
4.81.0
- Created new
monitoring-agent
inhibitions based on existingprometheus-agent
configurations for tool-agnostic monitoring. - Added
customer
label toOpsGenie
alerts to enhance alert specificity.
- Created new
prometheus-rules version
4.23.0
- Renamed all
prometheus-agent
related inhibitions tomonitoring-agent
inhibitions for clarity. - Standardized inhibition alert naming:
InhibitionPrometheusAgentFailing
andInhibitionPrometheusAgentShardsMissing
. - Corrected
statefulset.rules
naming to avoid overwritingdeployment.rules
. - Adjusted
KubeletVolumeSpaceTooLow
alert threshold to only trigger when space is critically low, relying onnode-problem-detector
otherwise. - Updated
aggregation:giantswarm:cluster_release_version
expression to include Cluster API clusters. - Updated
InhibitionControlPlaneUnhealthy
for all Cluster API clusters, not just MCs. - Added alert for
StatefulsetNotSatisfiedAtlas
. - Updated
alloy-app
to0.6.1
, including an upgrade to upstream version1.4.2
and aCiliumNetworkPolicy
fix for clustering.
- Renamed all
oauth2-proxy-app version
3.0.2
- Implemented
NetworkPolicy
to allow traffic tooauth2-proxy
. - Removed
cert-manager
ingress annotations to resolve ingress validation issues.
- Implemented
observability-bundle version
1.8.0
- Upgraded
prometheus-agent
fromv0.6.9
tov0.7.0
. - Added
extraArgs
to enable features like WAL truncation. - Upgraded
kube-prometheus-stack
from61.0.0
to65.1.1
. - Updated
prometheus-operator CRDs
from0.73.0
to0.75.0
. - Prometheus-operator upgraded from
0.75.0
to0.77.1
. - Prometheus upgraded from
2.53.0
to2.54.1
. - Grafana upgraded from
8.2.0
to8.5.0
. - Thanos ruler upgraded from
0.35.1
to0.36.1
. - Prometheus-node-exporter upgraded from
1.8.1
to1.8.2
. - Added missing
depends on
annotations foralloy-metrics
andalloy-logs
to ensure correct deployment order.
- Upgraded
Security
kyverno-policies-connectivity version
0.6.1
- Added
/tmp
emptyDir
volume to workload cluster IPJob
.
- Added
- Introduced feature gates for enabling/disabling individual
Falco
components.
- Introduced feature gates for enabling/disabling individual
starboard-exporter version
0.8.0
- Added Vertical Pod Autoscaler (VPA) configuration, enabled by default for optimized resource usage.
- Disabled logger development mode to enhance stability.
- Disabled
PodSecurityPolicy
by default. - Exposed port
8081
for health/liveness probes.
- Updated
Trivy
to upstream versionv0.56.1
for enhanced security scanning. - Disabled
PSPs
.
- Updated
trivy-operator-app version
0.10.2
- Aligned
Trivy
versions betweenTrivy
operator and the upstream project tov0.56.1
.
- Aligned
security-bundle version
1.9.0
- Updated
kyverno
(app) tov0.18.1
. - Updated
kyverno-crds
(app) tov1.12.0
. - Updated
kyverno-policies
(app) tov0.21.0
. - Updated
starboard-exporter
(app) tov0.8.0
. - Updated
trivy-operator
(app) tov0.10.2
. - Updated
trivy
(app) tov0.13.0
. - Updated
falco
(app) tov0.9.1
.
- Updated
Connectivity
- dns-operator-route53 version
0.10.0
- Added optional
--role-arn
flag to specify the roleARN
to assume when interacting withRoute53
.
- Added optional
Fleet management
app-admission-controller version
0.26.2
- Extended the
/healthz
endpoint to verify certificate validity and allowKubernetes
liveness probes to manage restarts if errors occur.
- Extended the
app-operator version
6.11.2
- Updated dependencies to ensure compatibility and security.