OpenShift Container Storage is now OpenShift Data Foundation starting with version 4.9.
Chapter 4. Monitoring the OpenShift Data Foundation Logical Volume Manager Operator
When the OpenShift Data Foundation Logical Volume Manager Operator is installed using the OpenShift Web Console, you can monitor the cluster using the Block and File
dashboard in the console by default. However, when you use RHACM to install the OpenShift Data Foundation Logical Volume Manager Operator, you need to configure the RHACM Observability to monitor the all the SNO cluster from one place.
You can monitor the OpenShift Data Foundation Logical Volume Manager Operator by viewing the metrics exported by the operator on the RHACM dashboards and the alerts that are triggered. Enable RHACM Observability as described in the Observability guide.
- Metrics
-
Add the following
topolvm
metrics to the allow list as specified in the Adding custom metrics section:
topolvm_thinpool_data_percent topolvm_thinpool_metadata_percent topolvm_thinpool_size_bytes
topolvm_thinpool_data_percent topolvm_thinpool_metadata_percent topolvm_thinpool_size_bytes
Copy to Clipboard Copied! Toggle word wrap Toggle overflow -
Add the following
Metrics are updated every 10 minutes or when there is a change in the thin-pool, such as a new logical volume creation.
- Alerts
- When the thin pool and volume group are filled up, further operations fail and might lead to data loss. The Logical Volume Manager Operator sends the following alerts the usage of the thin pool and volume group crosses certain value:
Alert | Description |
---|---|
VolumeGroupUsageAtThresholdNearFull | This alert is triggered when both the volume group and thin pool utilization cross 75% on nodes. Data deletion or volume group expansion is required. |
VolumeGroupUsageAtThresholdCritical | This alert is triggered when both the volume group and thin pool utilization cross 85% on nodes.VolumeGroup is critically full. Data deletion or volume group expansion is required. |
ThinPoolDataUsageAtThresholdNearFull | This alert is triggered when the thin pool data utilization in the volume group crosses 75% on nodes. Data deletion or thin pool expansion is required. |
ThinPoolDataUsageAtThresholdCritical | This alert is triggered when the thin pool data utilization in the volume group crosses 85% on nodes. Data deletion or thin pool expansion is required. |
ThinPoolMetaDataUsageAtThresholdNearFull | This alert is triggered when the thin pool metadata utilization in the volume group crosses 75% on nodes. Data deletion or thin pool expansion is required. |
ThinPoolMetaDataUsageAtThresholdCritical | This alert is triggered when the thin pool metadata utilization in the volume group crosses 85% on nodes. Data deletion or thin pool expansion is required. |