Chapter 10. Scaling Cluster Monitoring Operator

10.1. Overview
Copier lien

OpenShift Container Platform exposes metrics that can be collected and stored in back-ends by the cluster-monitoring-operator. As an OpenShift Container Platform administrator, you can view system resources, containers and components metrics in one dashboard interface, Grafana.

This topic provides information on scaling the cluster monitoring operator.

If you want to use Prometheus with persistent storage, you must set the openshift_cluster_monitoring_operator_prometheus_storage_enabled variable in your Ansible inventory file to true.

10.2. Recommendations for OpenShift Container Platform
Copier lien

Use at least three infrastructure (infra) nodes.
Use at least three openshift-container-storage nodes with non-volatile memory express (NVMe) drives.
Use persistent block storage, such as OpenShift Container Storage (OCS) Block.

10.3. Capacity Planning for Cluster Monitoring Operator
Copier lien

Various tests were performed for different scale sizes. The Prometheus database grew, as reflected in the table below.

Note

The Prometheus storage requirements below are not prescriptive. Higher resource consumption might be observed in your cluster depending on workload activity and resource use.

Expand

Table 10.1. Prometheus Database storage requirements based on number of nodes/pods in the cluster
Number of Nodes	Number of Pods	Prometheus storage growth per day	Prometheus storage growth per 15 days	RAM Space (per scale size)	Network (per tsdb chunk)
50	1800	6.3 GB	94 GB	6 GB	16 MB
100	3600	13 GB	195 GB	10 GB	26 MB
150	5400	19 GB	283 GB	12 GB	36 MB
200	7200	25 GB	375 GB	14 GB	46 MB

In the above calculation, approximately 20 percent of the expected size was added as overhead to ensure that the storage requirements do not exceed the calculated value.

The above calculation was developed for the default OpenShift Container Platform cluster-monitoring-operator. For higher scale, edit the openshift_cluster_monitoring_operator_prometheus_storage_capacity variable in the Ansible inventory file, which defaults to 50Gi.

Note

CPU utilization has minor impact. The ratio is approximately 1 core out of 40 per 50 nodes and 1800 pods.

10.3.1. Lab Environment
Copier lien

All experiments were performed in an OpenShift Container Platform on OpenStack environment:

Infra nodes (VMs) - 40 cores, 157 GB RAM.
CNS nodes (VMs) - 16 cores, 62 GB RAM, NVMe drives.

10.3.2. Prerequisites
Copier lien

Based on your scale destination, compute and set the relevant PV size for the Prometheus data store. Since the default Prometheus pods replicas is 2, for 100 nodes with 3600 pods you will need 188 GB.

For example:

195 GB (space per 15 days ) * 2 (pods) = 390 GB free

Based on this equation, set openshift_cluster_monitoring_operator_prometheus_storage_capacity=195Gi.

Ce contenu n'est pas disponible dans la langue sélectionnée.

10.1. Overview
Copier lien

10.2. Recommendations for OpenShift Container Platform
Copier lien

10.3. Capacity Planning for Cluster Monitoring Operator
Copier lien

10.3.1. Lab Environment
Copier lien

10.3.2. Prerequisites
Copier lien

Apprendre

Essayez, achetez et vendez

Communautés

À propos de la documentation Red Hat

Rendre l’open source plus inclusif

À propos de Red Hat

Theme

Red Hat legal and privacy links

Red Hat legal and privacy links

Ce contenu n'est pas disponible dans la langue sélectionnée.

Chapter 10. Scaling Cluster Monitoring Operator

10.1. OverviewCopier lienLien copié sur presse-papiers!

10.2. Recommendations for OpenShift Container PlatformCopier lienLien copié sur presse-papiers!

10.3. Capacity Planning for Cluster Monitoring OperatorCopier lienLien copié sur presse-papiers!

10.3.1. Lab EnvironmentCopier lienLien copié sur presse-papiers!

10.3.2. PrerequisitesCopier lienLien copié sur presse-papiers!

Apprendre

Essayez, achetez et vendez

Communautés

À propos de la documentation Red Hat

Rendre l’open source plus inclusif

À propos de Red Hat

Theme

Red Hat legal and privacy links

Red Hat legal and privacy links

10.1. Overview
Copier lien

10.2. Recommendations for OpenShift Container Platform
Copier lien

10.3. Capacity Planning for Cluster Monitoring Operator
Copier lien

10.3.1. Lab Environment
Copier lien

10.3.2. Prerequisites
Copier lien