Skip to content

Commit ae76431

Browse files
committed
feat(module): add DVCR capacity alert
Signed-off-by: Pavel Tishkov <[email protected]>
1 parent b3d8865 commit ae76431

File tree

1 file changed

+29
-4
lines changed

1 file changed

+29
-4
lines changed

monitoring/prometheus-rules/dvcr.yaml

Lines changed: 29 additions & 4 deletions
Original file line numberDiff line numberDiff line change
@@ -10,8 +10,8 @@
1010
plk_protocol_version: "1"
1111
plk_markup_format: "markdown"
1212
plk_labels_as_annotations: "pod"
13-
plk_create_group_if_not_exists__d8_virtualization_health: "D8VirtualizationHealth,tier=~tier,prometheus=deckhouse,kubernetes=~kubernetes"
14-
plk_grouped_by__d8_virtualization_health: "D8VirtualizationHealth,tier=~tier,prometheus=deckhouse,kubernetes=~kubernetes"
13+
plk_create_group_if_not_exists__d8_virtualization_dvcr_health: "D8VirtualizationDVCRHealth,tier=~tier,prometheus=deckhouse,kubernetes=~kubernetes"
14+
plk_grouped_by__d8_virtualization_dvcr_health: "D8VirtualizationDVCRHealth,tier=~tier,prometheus=deckhouse,kubernetes=~kubernetes"
1515
summary: The dvcr Pod is NOT Ready.
1616
description: |
1717
The recommended course of action:
@@ -27,10 +27,35 @@
2727
annotations:
2828
plk_protocol_version: "1"
2929
plk_markup_format: "markdown"
30-
plk_create_group_if_not_exists__d8_virtualization_health: "D8VirtualizationHealth,tier=~tier,prometheus=deckhouse,kubernetes=~kubernetes"
31-
plk_grouped_by__d8_virtualization_health: "D8VirtualizationHealth,tier=~tier,prometheus=deckhouse,kubernetes=~kubernetes"
30+
plk_create_group_if_not_exists__d8_virtualization_dvcr_health: "D8VirtualizationDVCRHealth,tier=~tier,prometheus=deckhouse,kubernetes=~kubernetes"
31+
plk_grouped_by__d8_virtualization_dvcr_health: "D8VirtualizationDVCRHealth,tier=~tier,prometheus=deckhouse,kubernetes=~kubernetes"
3232
summary: The dvcr Pod is NOT Running.
3333
description: |
3434
The recommended course of action:
3535
1. Retrieve details of the Deployment: `kubectl -n d8-virtualization describe deploy dvcr`
3636
2. View the status of the Pod and try to figure out why it is not running: `kubectl -n d8-virtualization describe pod -l app=dvcr`
37+
38+
- alert: D8VirtualizationDVCRLowFreeSpace
39+
expr: |
40+
min by (persistentvolumeclaim, namespace) (
41+
(kubelet_volume_stats_available_bytes{namespace="d8-virtualization", persistentvolumeclaim="dvcr"} < 5 * 1024 * 1024 * 1024)
42+
or
43+
(kubelet_volume_stats_available_bytes{namespace="d8-virtualization", persistentvolumeclaim="dvcr"} / (kubelet_volume_stats_capacity_bytes / 100) < 20)
44+
)
45+
for: 10m
46+
labels:
47+
severity_level: "6"
48+
tier: cluster
49+
annotations:
50+
plk_protocol_version: "1"
51+
plk_markup_format: "markdown"
52+
plk_create_group_if_not_exists__d8_virtualization_dvcr_health: "D8VirtualizationDVCRHealth,tier=~tier,prometheus=deckhouse,kubernetes=~kubernetes"
53+
plk_grouped_by__d8_virtualization_dvcr_health: "D8VirtualizationDVCRHealth,tier=~tier,prometheus=deckhouse,kubernetes=~kubernetes"
54+
summary: DVCR storage has low free space.
55+
description: |
56+
The DVCR storage has less than 5GB or 20% free space remaining.
57+
58+
The recommended course of action:
59+
60+
- Check PVC details: `kubectl -n d8-virtualization describe pvc dvcr`
61+
- Extend DVCR storage size in the virtualization module settings or remove unnecessary images (ClusterVirtualImage\VirtualImage).

0 commit comments

Comments
 (0)