Home
Products
OpenShift Container Platform
4.14
Support
Chapter 4. Remote health monitoring with connected clusters

Chapter 4. Remote health monitoring with connected clusters

4.1. About remote health monitoring
Copy link

OpenShift Container Platform collects telemetry and configuration data about your cluster and reports it to Red Hat by using the Telemeter Client and the Insights Operator. The data that is provided to Red Hat enables the benefits outlined in this document.

A cluster that reports data to Red Hat through Telemetry and the Insights Operator is considered a connected cluster.

Telemetry is the term that Red Hat uses to describe the information being sent to Red Hat by the OpenShift Container Platform Telemeter Client. Lightweight attributes are sent from connected clusters to Red Hat to enable subscription management automation, monitor the health of clusters, assist with support, and improve customer experience.

The Insights Operator gathers OpenShift Container Platform configuration data and sends it to Red Hat. The data is used to produce insights about potential issues that a cluster might be exposed to. These insights are communicated to cluster administrators on OpenShift Cluster Manager.

More information is provided in this document about these two processes.

Telemetry and Insights Operator benefits

Telemetry and the Insights Operator enable the following benefits for end-users:

Enhanced identification and resolution of issues. Events that might seem normal to an end-user can be observed by Red Hat from a broader perspective across a fleet of clusters. Some issues can be more rapidly identified from this point of view and resolved without an end-user needing to open a support case or file a Jira issue.
Advanced release management. OpenShift Container Platform offers the candidate, fast, and stable release channels, which enable you to choose an update strategy. The graduation of a release from fast to stable is dependent on the success rate of updates and on the events seen during upgrades. With the information provided by connected clusters, Red Hat can improve the quality of releases to stable channels and react more rapidly to issues found in the fast channels.
Targeted prioritization of new features and functionality. The data collected provides insights about which areas of OpenShift Container Platform are used most. With this information, Red Hat can focus on developing the new features and functionality that have the greatest impact for our customers.
A streamlined support experience. You can provide a cluster ID for a connected cluster when creating a support ticket on the Red Hat Customer Portal. This enables Red Hat to deliver a streamlined support experience that is specific to your cluster, by using the connected information. This document provides more information about that enhanced support experience.
Predictive analytics. The insights displayed for your cluster on OpenShift Cluster Manager are enabled by the information collected from connected clusters. Red Hat is investing in applying deep learning, machine learning, and artificial intelligence automation to help identify issues that OpenShift Container Platform clusters are exposed to.

4.1.1. About Telemetry
Copy link

Telemetry sends a carefully chosen subset of the cluster monitoring metrics to Red Hat. The Telemeter Client fetches the metrics values every four minutes and thirty seconds and uploads the data to Red Hat. These metrics are described in this document.

This stream of data is used by Red Hat to monitor the clusters in real-time and to react as necessary to problems that impact our customers. It also allows Red Hat to roll out OpenShift Container Platform upgrades to customers to minimize service impact and continuously improve the upgrade experience.

This debugging information is available to Red Hat Support and Engineering teams with the same restrictions as accessing data reported through support cases. All connected cluster information is used by Red Hat to help make OpenShift Container Platform better and more intuitive to use.

4.1.1.1. Information collected by Telemetry
Copy link

The following information is collected by Telemetry:

4.1.1.1.1. System information
Copy link

Version information, including the OpenShift Container Platform cluster version and installed update details that are used to determine update version availability
Update information, including the number of updates available per cluster, the channel and image repository used for an update, update progress information, and the number of errors that occur in an update
The unique random identifier that is generated during an installation
Configuration details that help Red Hat Support to provide beneficial support for customers, including node configuration at the cloud infrastructure level, hostnames, IP addresses, Kubernetes pod names, namespaces, and services
The OpenShift Container Platform framework components installed in a cluster and their condition and status
Events for all namespaces listed as "related objects" for a degraded Operator
Information about degraded software
Information about the validity of certificates
The name of the provider platform that OpenShift Container Platform is deployed on and the data center location

4.1.1.1.2. Sizing Information
Copy link

Sizing information about clusters, machine types, and machines, including the number of CPU cores and the amount of RAM used for each
The number of etcd members and the number of objects stored in the etcd cluster
Number of application builds by build strategy type

4.1.1.1.3. Usage information
Copy link

Usage information about components, features, and extensions
Usage details about Technology Previews and unsupported configurations

Telemetry does not collect identifying information such as usernames or passwords. Red Hat does not intend to collect personal information. If Red Hat discovers that personal information has been inadvertently received, Red Hat will delete such information. To the extent that any telemetry data constitutes personal data, please refer to the Red Hat Privacy Statement for more information about Red Hat’s privacy practices.

4.1.2. About the Insights Operator
Copy link

Users of OpenShift Container Platform can display the report of each cluster in the Insights Advisor service on Red Hat Hybrid Cloud Console. If any issues have been identified, Insights provides further details and, if available, steps on how to solve a problem.

The Insights Operator does not collect identifying information, such as user names, passwords, or certificates. See Red Hat Insights Data & Application Security for information about Red Hat Insights data collection and controls.

Red Hat uses all connected cluster information to:

Identify potential cluster issues and provide a solution and preventive actions in the Insights Advisor service on Red Hat Hybrid Cloud Console
Improve OpenShift Container Platform by providing aggregated and critical information to product and support teams
Make OpenShift Container Platform more intuitive

4.1.2.1. Information collected by the Insights Operator
Copy link

The following information is collected by the Insights Operator:

General information about your cluster and its components to identify issues that are specific to your OpenShift Container Platform version and environment
Configuration files, such as the image registry configuration, of your cluster to determine incorrect settings and issues that are specific to parameters you set
Errors that occur in the cluster components
Progress information of running updates, and the status of any component upgrades
Details of the platform that OpenShift Container Platform is deployed on, such as Amazon Web Services, and the region that the cluster is located in
Cluster workload information transformed into discreet Secure Hash Algorithm (SHA) values, which allows Red Hat to assess workloads for security and version vulnerabilities without disclosing sensitive details
If an Operator reports an issue, information is collected about core OpenShift Container Platform pods in the openshift-* and kube-* projects. This includes state, resource, security context, volume information, and more.

4.1.3. Understanding Telemetry and Insights Operator data flow
Copy link

The Telemeter Client collects selected time series data from the Prometheus API. The time series data is uploaded to api.openshift.com every four minutes and thirty seconds for processing.

The Insights Operator gathers selected data from the Kubernetes API and the Prometheus API into an archive. The archive is uploaded to OpenShift Cluster Manager every two hours for processing. The Insights Operator also downloads the latest Insights analysis from OpenShift Cluster Manager. This is used to populate the Insights status pop-up that is included in the Overview page in the OpenShift Container Platform web console.

All of the communication with Red Hat occurs over encrypted channels by using Transport Layer Security (TLS) and mutual certificate authentication. All of the data is encrypted in transit and at rest.

Access to the systems that handle customer data is controlled through multi-factor authentication and strict authorization controls. Access is granted on a need-to-know basis and is limited to required operations.

Telemetry and Insights Operator data flow

Telemetry and Insights Operator data flow

4.1.4. Additional details about how remote health monitoring data is used
Copy link

The information collected to enable remote health monitoring is detailed in Information collected by Telemetry and Information collected by the Insights Operator.

As further described in the preceding sections of this document, Red Hat collects data about your use of the Red Hat Product(s) for purposes such as providing support and upgrades, optimizing performance or configuration, minimizing service impacts, identifying and remediating threats, troubleshooting, improving the offerings and user experience, responding to issues, and for billing purposes if applicable.

Collection safeguards

Red Hat employs technical and organizational measures designed to protect the telemetry and configuration data.

Sharing

Red Hat may share the data collected through Telemetry and the Insights Operator internally within Red Hat to improve your user experience. Red Hat may share telemetry and configuration data with its business partners in an aggregated form that does not identify customers to help the partners better understand their markets and their customers’ use of Red Hat offerings or to ensure the successful integration of products jointly supported by those partners.

Third parties

Red Hat may engage certain third parties to assist in the collection, analysis, and storage of the Telemetry and configuration data.

User control / enabling and disabling telemetry and configuration data collection

You may disable OpenShift Container Platform Telemetry and the Insights Operator by following the instructions in Remote health reporting.

4.2. Showing data collected by remote health monitoring
Copy link

As an administrator, you can review the metrics collected by Telemetry and the Insights Operator.

4.2.1. Showing data collected by Telemetry
Copy link

You can view the cluster and components time series data captured by Telemetry.

Prerequisites

You have installed the OpenShift Container Platform CLI (oc).
You have access to the cluster as a user with the cluster-admin role or the cluster-monitoring-view role.

Procedure

Run the following command, which queries a cluster’s Prometheus service and returns the full set of time series data captured by Telemetry:

curl -G -k -H "Authorization: Bearer $(oc whoami -t)" \
https://$(oc get route prometheus-k8s-federate -n \
openshift-monitoring -o jsonpath="{.spec.host}")/federate \
--data-urlencode 'match[]={__name__=~"cluster:usage:.*"}' \
--data-urlencode 'match[]={__name__="count:up0"}' \
--data-urlencode 'match[]={__name__="count:up1"}' \
--data-urlencode 'match[]={__name__="cluster_version"}' \
--data-urlencode 'match[]={__name__="cluster_version_available_updates"}' \
--data-urlencode 'match[]={__name__="cluster_version_capability"}' \
--data-urlencode 'match[]={__name__="cluster_operator_up"}' \
--data-urlencode 'match[]={__name__="cluster_operator_conditions"}' \
--data-urlencode 'match[]={__name__="cluster_version_payload"}' \
--data-urlencode 'match[]={__name__="cluster_installer"}' \
--data-urlencode 'match[]={__name__="cluster_infrastructure_provider"}' \
--data-urlencode 'match[]={__name__="cluster_feature_set"}' \
--data-urlencode 'match[]={__name__="instance:etcd_object_counts:sum"}' \
--data-urlencode 'match[]={__name__="ALERTS",alertstate="firing"}' \
--data-urlencode 'match[]={__name__="code:apiserver_request_total:rate:sum"}' \
--data-urlencode 'match[]={__name__="cluster:capacity_cpu_cores:sum"}' \
--data-urlencode 'match[]={__name__="cluster:capacity_memory_bytes:sum"}' \
--data-urlencode 'match[]={__name__="cluster:cpu_usage_cores:sum"}' \
--data-urlencode 'match[]={__name__="cluster:memory_usage_bytes:sum"}' \
--data-urlencode 'match[]={__name__="openshift:cpu_usage_cores:sum"}' \
--data-urlencode 'match[]={__name__="openshift:memory_usage_bytes:sum"}' \
--data-urlencode 'match[]={__name__="workload:cpu_usage_cores:sum"}' \
--data-urlencode 'match[]={__name__="workload:memory_usage_bytes:sum"}' \
--data-urlencode 'match[]={__name__="cluster:virt_platform_nodes:sum"}' \
--data-urlencode 'match[]={__name__="cluster:node_instance_type_count:sum"}' \
--data-urlencode 'match[]={__name__="cnv:vmi_status_running:count"}' \
--data-urlencode 'match[]={__name__="cluster:vmi_request_cpu_cores:sum"}' \
--data-urlencode 'match[]={__name__="node_role_os_version_machine:cpu_capacity_cores:sum"}' \
--data-urlencode 'match[]={__name__="node_role_os_version_machine:cpu_capacity_sockets:sum"}' \
--data-urlencode 'match[]={__name__="subscription_sync_total"}' \
--data-urlencode 'match[]={__name__="olm_resolution_duration_seconds"}' \
--data-urlencode 'match[]={__name__="csv_succeeded"}' \
--data-urlencode 'match[]={__name__="csv_abnormal"}' \
--data-urlencode 'match[]={__name__="cluster:kube_persistentvolumeclaim_resource_requests_storage_bytes:provisioner:sum"}' \
--data-urlencode 'match[]={__name__="cluster:kubelet_volume_stats_used_bytes:provisioner:sum"}' \
--data-urlencode 'match[]={__name__="ceph_cluster_total_bytes"}' \
--data-urlencode 'match[]={__name__="ceph_cluster_total_used_raw_bytes"}' \
--data-urlencode 'match[]={__name__="ceph_health_status"}' \
--data-urlencode 'match[]={__name__="odf_system_raw_capacity_total_bytes"}' \
--data-urlencode 'match[]={__name__="odf_system_raw_capacity_used_bytes"}' \
--data-urlencode 'match[]={__name__="odf_system_health_status"}' \
--data-urlencode 'match[]={__name__="job:ceph_osd_metadata:count"}' \
--data-urlencode 'match[]={__name__="job:kube_pv:count"}' \
--data-urlencode 'match[]={__name__="job:odf_system_pvs:count"}' \
--data-urlencode 'match[]={__name__="job:ceph_pools_iops:total"}' \
--data-urlencode 'match[]={__name__="job:ceph_pools_iops_bytes:total"}' \
--data-urlencode 'match[]={__name__="job:ceph_versions_running:count"}' \
--data-urlencode 'match[]={__name__="job:noobaa_total_unhealthy_buckets:sum"}' \
--data-urlencode 'match[]={__name__="job:noobaa_bucket_count:sum"}' \
--data-urlencode 'match[]={__name__="job:noobaa_total_object_count:sum"}' \
--data-urlencode 'match[]={__name__="odf_system_bucket_count", system_type="OCS", system_vendor="Red Hat"}' \
--data-urlencode 'match[]={__name__="odf_system_objects_total", system_type="OCS", system_vendor="Red Hat"}' \
--data-urlencode 'match[]={__name__="noobaa_accounts_num"}' \
--data-urlencode 'match[]={__name__="noobaa_total_usage"}' \
--data-urlencode 'match[]={__name__="console_url"}' \
--data-urlencode 'match[]={__name__="cluster:ovnkube_master_egress_routing_via_host:max"}' \
--data-urlencode 'match[]={__name__="cluster:network_attachment_definition_instances:max"}' \
--data-urlencode 'match[]={__name__="cluster:network_attachment_definition_enabled_instance_up:max"}' \
--data-urlencode 'match[]={__name__="cluster:ingress_controller_aws_nlb_active:sum"}' \
--data-urlencode 'match[]={__name__="cluster:route_metrics_controller_routes_per_shard:min"}' \
--data-urlencode 'match[]={__name__="cluster:route_metrics_controller_routes_per_shard:max"}' \
--data-urlencode 'match[]={__name__="cluster:route_metrics_controller_routes_per_shard:avg"}' \
--data-urlencode 'match[]={__name__="cluster:route_metrics_controller_routes_per_shard:median"}' \
--data-urlencode 'match[]={__name__="cluster:openshift_route_info:tls_termination:sum"}' \
--data-urlencode 'match[]={__name__="insightsclient_request_send_total"}' \
--data-urlencode 'match[]={__name__="cam_app_workload_migrations"}' \
--data-urlencode 'match[]={__name__="cluster:apiserver_current_inflight_requests:sum:max_over_time:2m"}' \
--data-urlencode 'match[]={__name__="cluster:alertmanager_integrations:max"}' \
--data-urlencode 'match[]={__name__="cluster:telemetry_selected_series:count"}' \
--data-urlencode 'match[]={__name__="openshift:prometheus_tsdb_head_series:sum"}' \
--data-urlencode 'match[]={__name__="openshift:prometheus_tsdb_head_samples_appended_total:sum"}' \
--data-urlencode 'match[]={__name__="monitoring:container_memory_working_set_bytes:sum"}' \
--data-urlencode 'match[]={__name__="namespace_job:scrape_series_added:topk3_sum1h"}' \
--data-urlencode 'match[]={__name__="namespace_job:scrape_samples_post_metric_relabeling:topk3"}' \
--data-urlencode 'match[]={__name__="monitoring:haproxy_server_http_responses_total:sum"}' \
--data-urlencode 'match[]={__name__="rhmi_status"}' \
--data-urlencode 'match[]={__name__="status:upgrading:version:rhoam_state:max"}' \
--data-urlencode 'match[]={__name__="state:rhoam_critical_alerts:max"}' \
--data-urlencode 'match[]={__name__="state:rhoam_warning_alerts:max"}' \
--data-urlencode 'match[]={__name__="rhoam_7d_slo_percentile:max"}' \
--data-urlencode 'match[]={__name__="rhoam_7d_slo_remaining_error_budget:max"}' \
--data-urlencode 'match[]={__name__="cluster_legacy_scheduler_policy"}' \
--data-urlencode 'match[]={__name__="cluster_master_schedulable"}' \
--data-urlencode 'match[]={__name__="che_workspace_status"}' \
--data-urlencode 'match[]={__name__="che_workspace_started_total"}' \
--data-urlencode 'match[]={__name__="che_workspace_failure_total"}' \
--data-urlencode 'match[]={__name__="che_workspace_start_time_seconds_sum"}' \
--data-urlencode 'match[]={__name__="che_workspace_start_time_seconds_count"}' \
--data-urlencode 'match[]={__name__="cco_credentials_mode"}' \
--data-urlencode 'match[]={__name__="cluster:kube_persistentvolume_plugin_type_counts:sum"}' \
--data-urlencode 'match[]={__name__="visual_web_terminal_sessions_total"}' \
--data-urlencode 'match[]={__name__="acm_managed_cluster_info"}' \
--data-urlencode 'match[]={__name__="cluster:vsphere_vcenter_info:sum"}' \
--data-urlencode 'match[]={__name__="cluster:vsphere_esxi_version_total:sum"}' \
--data-urlencode 'match[]={__name__="cluster:vsphere_node_hw_version_total:sum"}' \
--data-urlencode 'match[]={__name__="openshift:build_by_strategy:sum"}' \
--data-urlencode 'match[]={__name__="rhods_aggregate_availability"}' \
--data-urlencode 'match[]={__name__="rhods_total_users"}' \
--data-urlencode 'match[]={__name__="instance:etcd_disk_wal_fsync_duration_seconds:histogram_quantile",quantile="0.99"}' \
--data-urlencode 'match[]={__name__="instance:etcd_mvcc_db_total_size_in_bytes:sum"}' \
--data-urlencode 'match[]={__name__="instance:etcd_network_peer_round_trip_time_seconds:histogram_quantile",quantile="0.99"}' \
--data-urlencode 'match[]={__name__="instance:etcd_mvcc_db_total_size_in_use_in_bytes:sum"}' \
--data-urlencode 'match[]={__name__="instance:etcd_disk_backend_commit_duration_seconds:histogram_quantile",quantile="0.99"}' \
--data-urlencode 'match[]={__name__="jaeger_operator_instances_storage_types"}' \
--data-urlencode 'match[]={__name__="jaeger_operator_instances_strategies"}' \
--data-urlencode 'match[]={__name__="jaeger_operator_instances_agent_strategies"}' \
--data-urlencode 'match[]={__name__="appsvcs:cores_by_product:sum"}' \
--data-urlencode 'match[]={__name__="nto_custom_profiles:count"}' \
--data-urlencode 'match[]={__name__="openshift_csi_share_configmap"}' \
--data-urlencode 'match[]={__name__="openshift_csi_share_secret"}' \
--data-urlencode 'match[]={__name__="openshift_csi_share_mount_failures_total"}' \
--data-urlencode 'match[]={__name__="openshift_csi_share_mount_requests_total"}' \
--data-urlencode 'match[]={__name__="cluster:velero_backup_total:max"}' \
--data-urlencode 'match[]={__name__="cluster:velero_restore_total:max"}' \
--data-urlencode 'match[]={__name__="eo_es_storage_info"}' \
--data-urlencode 'match[]={__name__="eo_es_redundancy_policy_info"}' \
--data-urlencode 'match[]={__name__="eo_es_defined_delete_namespaces_total"}' \
--data-urlencode 'match[]={__name__="eo_es_misconfigured_memory_resources_info"}' \
--data-urlencode 'match[]={__name__="cluster:eo_es_data_nodes_total:max"}' \
--data-urlencode 'match[]={__name__="cluster:eo_es_documents_created_total:sum"}' \
--data-urlencode 'match[]={__name__="cluster:eo_es_documents_deleted_total:sum"}' \
--data-urlencode 'match[]={__name__="pod:eo_es_shards_total:max"}' \
--data-urlencode 'match[]={__name__="eo_es_cluster_management_state_info"}' \
--data-urlencode 'match[]={__name__="imageregistry:imagestreamtags_count:sum"}' \
--data-urlencode 'match[]={__name__="imageregistry:operations_count:sum"}' \
--data-urlencode 'match[]={__name__="log_logging_info"}' \
--data-urlencode 'match[]={__name__="log_collector_error_count_total"}' \
--data-urlencode 'match[]={__name__="log_forwarder_pipeline_info"}' \
--data-urlencode 'match[]={__name__="log_forwarder_input_info"}' \
--data-urlencode 'match[]={__name__="log_forwarder_output_info"}' \
--data-urlencode 'match[]={__name__="cluster:log_collected_bytes_total:sum"}' \
--data-urlencode 'match[]={__name__="cluster:log_logged_bytes_total:sum"}' \
--data-urlencode 'match[]={__name__="cluster:kata_monitor_running_shim_count:sum"}' \
--data-urlencode 'match[]={__name__="platform:hypershift_hostedclusters:max"}' \
--data-urlencode 'match[]={__name__="platform:hypershift_nodepools:max"}' \
--data-urlencode 'match[]={__name__="namespace:noobaa_unhealthy_bucket_claims:max"}' \
--data-urlencode 'match[]={__name__="namespace:noobaa_buckets_claims:max"}' \
--data-urlencode 'match[]={__name__="namespace:noobaa_unhealthy_namespace_resources:max"}' \
--data-urlencode 'match[]={__name__="namespace:noobaa_namespace_resources:max"}' \
--data-urlencode 'match[]={__name__="namespace:noobaa_unhealthy_namespace_buckets:max"}' \
--data-urlencode 'match[]={__name__="namespace:noobaa_namespace_buckets:max"}' \
--data-urlencode 'match[]={__name__="namespace:noobaa_accounts:max"}' \
--data-urlencode 'match[]={__name__="namespace:noobaa_usage:max"}' \
--data-urlencode 'match[]={__name__="namespace:noobaa_system_health_status:max"}' \
--data-urlencode 'match[]={__name__="ocs_advanced_feature_usage"}' \
--data-urlencode 'match[]={__name__="os_image_url_override:sum"}'

$ curl -G -k -H "Authorization: Bearer $(oc whoami -t)" \
https://$(oc get route prometheus-k8s-federate -n \
openshift-monitoring -o jsonpath="{.spec.host}")/federate \
--data-urlencode 'match[]={__name__=~"cluster:usage:.*"}' \
--data-urlencode 'match[]={__name__="count:up0"}' \
--data-urlencode 'match[]={__name__="count:up1"}' \
--data-urlencode 'match[]={__name__="cluster_version"}' \
--data-urlencode 'match[]={__name__="cluster_version_available_updates"}' \
--data-urlencode 'match[]={__name__="cluster_version_capability"}' \
--data-urlencode 'match[]={__name__="cluster_operator_up"}' \
--data-urlencode 'match[]={__name__="cluster_operator_conditions"}' \
--data-urlencode 'match[]={__name__="cluster_version_payload"}' \
--data-urlencode 'match[]={__name__="cluster_installer"}' \
--data-urlencode 'match[]={__name__="cluster_infrastructure_provider"}' \
--data-urlencode 'match[]={__name__="cluster_feature_set"}' \
--data-urlencode 'match[]={__name__="instance:etcd_object_counts:sum"}' \
--data-urlencode 'match[]={__name__="ALERTS",alertstate="firing"}' \
--data-urlencode 'match[]={__name__="code:apiserver_request_total:rate:sum"}' \
--data-urlencode 'match[]={__name__="cluster:capacity_cpu_cores:sum"}' \
--data-urlencode 'match[]={__name__="cluster:capacity_memory_bytes:sum"}' \
--data-urlencode 'match[]={__name__="cluster:cpu_usage_cores:sum"}' \
--data-urlencode 'match[]={__name__="cluster:memory_usage_bytes:sum"}' \
--data-urlencode 'match[]={__name__="openshift:cpu_usage_cores:sum"}' \
--data-urlencode 'match[]={__name__="openshift:memory_usage_bytes:sum"}' \
--data-urlencode 'match[]={__name__="workload:cpu_usage_cores:sum"}' \
--data-urlencode 'match[]={__name__="workload:memory_usage_bytes:sum"}' \
--data-urlencode 'match[]={__name__="cluster:virt_platform_nodes:sum"}' \
--data-urlencode 'match[]={__name__="cluster:node_instance_type_count:sum"}' \
--data-urlencode 'match[]={__name__="cnv:vmi_status_running:count"}' \
--data-urlencode 'match[]={__name__="cluster:vmi_request_cpu_cores:sum"}' \
--data-urlencode 'match[]={__name__="node_role_os_version_machine:cpu_capacity_cores:sum"}' \
--data-urlencode 'match[]={__name__="node_role_os_version_machine:cpu_capacity_sockets:sum"}' \
--data-urlencode 'match[]={__name__="subscription_sync_total"}' \
--data-urlencode 'match[]={__name__="olm_resolution_duration_seconds"}' \
--data-urlencode 'match[]={__name__="csv_succeeded"}' \
--data-urlencode 'match[]={__name__="csv_abnormal"}' \
--data-urlencode 'match[]={__name__="cluster:kube_persistentvolumeclaim_resource_requests_storage_bytes:provisioner:sum"}' \
--data-urlencode 'match[]={__name__="cluster:kubelet_volume_stats_used_bytes:provisioner:sum"}' \
--data-urlencode 'match[]={__name__="ceph_cluster_total_bytes"}' \
--data-urlencode 'match[]={__name__="ceph_cluster_total_used_raw_bytes"}' \
--data-urlencode 'match[]={__name__="ceph_health_status"}' \
--data-urlencode 'match[]={__name__="odf_system_raw_capacity_total_bytes"}' \
--data-urlencode 'match[]={__name__="odf_system_raw_capacity_used_bytes"}' \
--data-urlencode 'match[]={__name__="odf_system_health_status"}' \
--data-urlencode 'match[]={__name__="job:ceph_osd_metadata:count"}' \
--data-urlencode 'match[]={__name__="job:kube_pv:count"}' \
--data-urlencode 'match[]={__name__="job:odf_system_pvs:count"}' \
--data-urlencode 'match[]={__name__="job:ceph_pools_iops:total"}' \
--data-urlencode 'match[]={__name__="job:ceph_pools_iops_bytes:total"}' \
--data-urlencode 'match[]={__name__="job:ceph_versions_running:count"}' \
--data-urlencode 'match[]={__name__="job:noobaa_total_unhealthy_buckets:sum"}' \
--data-urlencode 'match[]={__name__="job:noobaa_bucket_count:sum"}' \
--data-urlencode 'match[]={__name__="job:noobaa_total_object_count:sum"}' \
--data-urlencode 'match[]={__name__="odf_system_bucket_count", system_type="OCS", system_vendor="Red Hat"}' \
--data-urlencode 'match[]={__name__="odf_system_objects_total", system_type="OCS", system_vendor="Red Hat"}' \
--data-urlencode 'match[]={__name__="noobaa_accounts_num"}' \
--data-urlencode 'match[]={__name__="noobaa_total_usage"}' \
--data-urlencode 'match[]={__name__="console_url"}' \
--data-urlencode 'match[]={__name__="cluster:ovnkube_master_egress_routing_via_host:max"}' \
--data-urlencode 'match[]={__name__="cluster:network_attachment_definition_instances:max"}' \
--data-urlencode 'match[]={__name__="cluster:network_attachment_definition_enabled_instance_up:max"}' \
--data-urlencode 'match[]={__name__="cluster:ingress_controller_aws_nlb_active:sum"}' \
--data-urlencode 'match[]={__name__="cluster:route_metrics_controller_routes_per_shard:min"}' \
--data-urlencode 'match[]={__name__="cluster:route_metrics_controller_routes_per_shard:max"}' \
--data-urlencode 'match[]={__name__="cluster:route_metrics_controller_routes_per_shard:avg"}' \
--data-urlencode 'match[]={__name__="cluster:route_metrics_controller_routes_per_shard:median"}' \
--data-urlencode 'match[]={__name__="cluster:openshift_route_info:tls_termination:sum"}' \
--data-urlencode 'match[]={__name__="insightsclient_request_send_total"}' \
--data-urlencode 'match[]={__name__="cam_app_workload_migrations"}' \
--data-urlencode 'match[]={__name__="cluster:apiserver_current_inflight_requests:sum:max_over_time:2m"}' \
--data-urlencode 'match[]={__name__="cluster:alertmanager_integrations:max"}' \
--data-urlencode 'match[]={__name__="cluster:telemetry_selected_series:count"}' \
--data-urlencode 'match[]={__name__="openshift:prometheus_tsdb_head_series:sum"}' \
--data-urlencode 'match[]={__name__="openshift:prometheus_tsdb_head_samples_appended_total:sum"}' \
--data-urlencode 'match[]={__name__="monitoring:container_memory_working_set_bytes:sum"}' \
--data-urlencode 'match[]={__name__="namespace_job:scrape_series_added:topk3_sum1h"}' \
--data-urlencode 'match[]={__name__="namespace_job:scrape_samples_post_metric_relabeling:topk3"}' \
--data-urlencode 'match[]={__name__="monitoring:haproxy_server_http_responses_total:sum"}' \
--data-urlencode 'match[]={__name__="rhmi_status"}' \
--data-urlencode 'match[]={__name__="status:upgrading:version:rhoam_state:max"}' \
--data-urlencode 'match[]={__name__="state:rhoam_critical_alerts:max"}' \
--data-urlencode 'match[]={__name__="state:rhoam_warning_alerts:max"}' \
--data-urlencode 'match[]={__name__="rhoam_7d_slo_percentile:max"}' \
--data-urlencode 'match[]={__name__="rhoam_7d_slo_remaining_error_budget:max"}' \
--data-urlencode 'match[]={__name__="cluster_legacy_scheduler_policy"}' \
--data-urlencode 'match[]={__name__="cluster_master_schedulable"}' \
--data-urlencode 'match[]={__name__="che_workspace_status"}' \
--data-urlencode 'match[]={__name__="che_workspace_started_total"}' \
--data-urlencode 'match[]={__name__="che_workspace_failure_total"}' \
--data-urlencode 'match[]={__name__="che_workspace_start_time_seconds_sum"}' \
--data-urlencode 'match[]={__name__="che_workspace_start_time_seconds_count"}' \
--data-urlencode 'match[]={__name__="cco_credentials_mode"}' \
--data-urlencode 'match[]={__name__="cluster:kube_persistentvolume_plugin_type_counts:sum"}' \
--data-urlencode 'match[]={__name__="visual_web_terminal_sessions_total"}' \
--data-urlencode 'match[]={__name__="acm_managed_cluster_info"}' \
--data-urlencode 'match[]={__name__="cluster:vsphere_vcenter_info:sum"}' \
--data-urlencode 'match[]={__name__="cluster:vsphere_esxi_version_total:sum"}' \
--data-urlencode 'match[]={__name__="cluster:vsphere_node_hw_version_total:sum"}' \
--data-urlencode 'match[]={__name__="openshift:build_by_strategy:sum"}' \
--data-urlencode 'match[]={__name__="rhods_aggregate_availability"}' \
--data-urlencode 'match[]={__name__="rhods_total_users"}' \
--data-urlencode 'match[]={__name__="instance:etcd_disk_wal_fsync_duration_seconds:histogram_quantile",quantile="0.99"}' \
--data-urlencode 'match[]={__name__="instance:etcd_mvcc_db_total_size_in_bytes:sum"}' \
--data-urlencode 'match[]={__name__="instance:etcd_network_peer_round_trip_time_seconds:histogram_quantile",quantile="0.99"}' \
--data-urlencode 'match[]={__name__="instance:etcd_mvcc_db_total_size_in_use_in_bytes:sum"}' \
--data-urlencode 'match[]={__name__="instance:etcd_disk_backend_commit_duration_seconds:histogram_quantile",quantile="0.99"}' \
--data-urlencode 'match[]={__name__="jaeger_operator_instances_storage_types"}' \
--data-urlencode 'match[]={__name__="jaeger_operator_instances_strategies"}' \
--data-urlencode 'match[]={__name__="jaeger_operator_instances_agent_strategies"}' \
--data-urlencode 'match[]={__name__="appsvcs:cores_by_product:sum"}' \
--data-urlencode 'match[]={__name__="nto_custom_profiles:count"}' \
--data-urlencode 'match[]={__name__="openshift_csi_share_configmap"}' \
--data-urlencode 'match[]={__name__="openshift_csi_share_secret"}' \
--data-urlencode 'match[]={__name__="openshift_csi_share_mount_failures_total"}' \
--data-urlencode 'match[]={__name__="openshift_csi_share_mount_requests_total"}' \
--data-urlencode 'match[]={__name__="cluster:velero_backup_total:max"}' \
--data-urlencode 'match[]={__name__="cluster:velero_restore_total:max"}' \
--data-urlencode 'match[]={__name__="eo_es_storage_info"}' \
--data-urlencode 'match[]={__name__="eo_es_redundancy_policy_info"}' \
--data-urlencode 'match[]={__name__="eo_es_defined_delete_namespaces_total"}' \
--data-urlencode 'match[]={__name__="eo_es_misconfigured_memory_resources_info"}' \
--data-urlencode 'match[]={__name__="cluster:eo_es_data_nodes_total:max"}' \
--data-urlencode 'match[]={__name__="cluster:eo_es_documents_created_total:sum"}' \
--data-urlencode 'match[]={__name__="cluster:eo_es_documents_deleted_total:sum"}' \
--data-urlencode 'match[]={__name__="pod:eo_es_shards_total:max"}' \
--data-urlencode 'match[]={__name__="eo_es_cluster_management_state_info"}' \
--data-urlencode 'match[]={__name__="imageregistry:imagestreamtags_count:sum"}' \
--data-urlencode 'match[]={__name__="imageregistry:operations_count:sum"}' \
--data-urlencode 'match[]={__name__="log_logging_info"}' \
--data-urlencode 'match[]={__name__="log_collector_error_count_total"}' \
--data-urlencode 'match[]={__name__="log_forwarder_pipeline_info"}' \
--data-urlencode 'match[]={__name__="log_forwarder_input_info"}' \
--data-urlencode 'match[]={__name__="log_forwarder_output_info"}' \
--data-urlencode 'match[]={__name__="cluster:log_collected_bytes_total:sum"}' \
--data-urlencode 'match[]={__name__="cluster:log_logged_bytes_total:sum"}' \
--data-urlencode 'match[]={__name__="cluster:kata_monitor_running_shim_count:sum"}' \
--data-urlencode 'match[]={__name__="platform:hypershift_hostedclusters:max"}' \
--data-urlencode 'match[]={__name__="platform:hypershift_nodepools:max"}' \
--data-urlencode 'match[]={__name__="namespace:noobaa_unhealthy_bucket_claims:max"}' \
--data-urlencode 'match[]={__name__="namespace:noobaa_buckets_claims:max"}' \
--data-urlencode 'match[]={__name__="namespace:noobaa_unhealthy_namespace_resources:max"}' \
--data-urlencode 'match[]={__name__="namespace:noobaa_namespace_resources:max"}' \
--data-urlencode 'match[]={__name__="namespace:noobaa_unhealthy_namespace_buckets:max"}' \
--data-urlencode 'match[]={__name__="namespace:noobaa_namespace_buckets:max"}' \
--data-urlencode 'match[]={__name__="namespace:noobaa_accounts:max"}' \
--data-urlencode 'match[]={__name__="namespace:noobaa_usage:max"}' \
--data-urlencode 'match[]={__name__="namespace:noobaa_system_health_status:max"}' \
--data-urlencode 'match[]={__name__="ocs_advanced_feature_usage"}' \
--data-urlencode 'match[]={__name__="os_image_url_override:sum"}'

Copy to Clipboard

Toggle word wrap

4.2.2. Showing data collected by the Insights Operator
Copy link

You can review the data that is collected by the Insights Operator.

Prerequisites

Access to the cluster as a user with the cluster-admin role.

Procedure

Find the name of the currently running pod for the Insights Operator:

INSIGHTS_OPERATOR_POD=$(oc get pods --namespace=openshift-insights -o custom-columns=:metadata.name --no-headers  --field-selector=status.phase=Running)

$ INSIGHTS_OPERATOR_POD=$(oc get pods --namespace=openshift-insights -o custom-columns=:metadata.name --no-headers  --field-selector=status.phase=Running)

Copy to Clipboard

Toggle word wrap

Copy the recent data archives collected by the Insights Operator:

oc cp openshift-insights/$INSIGHTS_OPERATOR_POD:/var/lib/insights-operator ./insights-data

$ oc cp openshift-insights/$INSIGHTS_OPERATOR_POD:/var/lib/insights-operator ./insights-data

Copy to Clipboard

Toggle word wrap

The recent Insights Operator archives are now available in the insights-data directory.

4.3. Remote health reporting
Copy link

You can opt in, enable, or opt out, disable, reporting health and usage data for your cluster.

4.3.1. Enabling remote health reporting
Copy link

If you or your organization have disabled remote health reporting, you can enable this feature again. You can see that remote health reporting is disabled from the message Insights not available in the Status tile on the OpenShift Container Platform web console Overview page.

To enable remote health reporting, you must change the global cluster pull secret with a new authorization token. Enabling remote health reporting enables both Insights Operator and Telemetry.

4.3.2. Changing your global cluster pull secret to enable remote health reporting
Copy link

You can change your existing global cluster pull secret to enable remote health reporting. If you have disabled remote health monitoring, you must download a new pull secret with your console.openshift.com access token from Red Hat OpenShift Cluster Manager.

Prerequisites

Access to the cluster as a user with the cluster-admin role.
Access to OpenShift Cluster Manager.

Procedure

Go to the Downloads page on the Red Hat Hybrid Cloud Console.

From Tokens Pull secret, click the Download button.

The pull-secret file contains your cloud.openshift.com access token in JSON format:

{
  "auths": {
    "cloud.openshift.com": {
      "auth": "<your_token>",
      "email": "<email_address>"
    }
  }
}

{
  "auths": {
    "cloud.openshift.com": {
      "auth": "<your_token>",
      "email": "<email_address>"
    }
  }
}

Copy to Clipboard

Toggle word wrap

Download the global cluster pull secret to your local file system.

oc get secret/pull-secret -n openshift-config --template='{{index .data ".dockerconfigjson" | base64decode}}' > pull-secret

$ oc get secret/pull-secret -n openshift-config --template='{{index .data ".dockerconfigjson" | base64decode}}' > pull-secret

Copy to Clipboard

Toggle word wrap

Make a backup copy of your pull secret.
```
cp pull-secret pull-secret-backup
```
```
$ cp pull-secret pull-secret-backup
```
Copy to Clipboard Toggle word wrap
Open the pull-secret file in a text editor.
Append the cloud.openshift.com JSON entry from the pull-secret file that you downloaded earlier into the auths file.
Save the file.
Update the secret in your cluster by running the following command:
```
oc set data secret/pull-secret -n openshift-config --from-file=.dockerconfigjson=pull-secret
```
```
oc set data secret/pull-secret -n openshift-config --from-file=.dockerconfigjson=pull-secret
```
Copy to Clipboard Toggle word wrap
You might need to wait several minutes for the secret to update and your cluster to begin reporting.

Verification

For a verification check from the OpenShift Container Platform web console, complete the following steps:
1. Go to the Overview page on the OpenShift Container Platform web console.
2. View the Insights section in the Status tile that reports the number of issues found.
For a verification check from the OpenShift CLI (oc), enter the following command and then check that the value of the status parameter states false:
```
oc get co insights -o jsonpath='{.status.conditions[?(@.type=="Disabled")]}'
```
```
$ oc get co insights -o jsonpath='{.status.conditions[?(@.type=="Disabled")]}'
```
Copy to Clipboard Toggle word wrap

4.3.3. Consequences of disabling remote health reporting
Copy link

In OpenShift Container Platform, customers can disable reporting usage information.

Before you disable remote health reporting, read the following benefits of a connected cluster:

Red Hat can react more quickly to problems and better support our customers.
Red Hat can better understand how product upgrades impact clusters.
Connected clusters help to simplify the subscription and entitlement process.
Connected clusters enable the OpenShift Cluster Manager service to offer an overview of your clusters and their subscription status.

Note

Consider leaving health and usage reporting enabled for pre-production, test, and production clusters. This means that Red Hat can participate in qualifying OpenShift Container Platform in your environments and react more rapidly to product issues.

The following lists some consequences of disabling remote health reporting on a connected cluster:

Red Hat cannot view the success of product upgrades or the health of your clusters without an open support case.
Red Hat cannot use configuration data to better triage customer support cases and identify which configurations our customers find important.
The OpenShift Cluster Manager cannot show data about your clusters, which includes health and usage information.
You must manually enter your subscription information in the console.redhat.com web console without the benefit of automatic usage reporting.

In restricted networks, Telemetry and Insights data still gets gathered through the appropriate configuration of your proxy.

4.3.4. Disabling remote health reporting
Copy link

You can change your existing global cluster pull secret to disable remote health reporting. This configuration disables both Telemetry and the Insights Operator.

Prerequisites

You have access to the cluster as a user with the cluster-admin role.

Procedure

Download the global cluster pull secret to your local file system:
```
oc extract secret/pull-secret -n openshift-config --to=.
```
```
$ oc extract secret/pull-secret -n openshift-config --to=.
```
Copy to Clipboard Toggle word wrap
In a text editor, edit the .dockerconfigjson file that you downloaded by removing the cloud.openshift.com JSON entry:
```
"cloud.openshift.com":{"auth":"<hash>","email":"<email_address>"}
```
```
"cloud.openshift.com":{"auth":"<hash>","email":"<email_address>"}
```
Copy to Clipboard Toggle word wrap
Save the file.
Update the secret in your cluster. For more information, see "Updating the global cluster pull secret".
You might need to wait several minutes for the secret to update in your cluster.

4.3.5. Registering your disconnected cluster
Copy link

Register your disconnected OpenShift Container Platform cluster on the Red Hat Hybrid Cloud Console so that your cluster does not get impacted by disabling remote health reporting. For more information, see "Consequences of disabling remote health reporting".

Important

By registering your disconnected cluster, you can continue to report your subscription usage to Red Hat. Red Hat can then return accurate usage and capacity trends associated with your subscription, so that you can use the returned information to better organize subscription allocations across all of your resources.

Prerequisites

You logged in to the OpenShift Container Platform web console as the cluster-admin role.
You can log in to the Red Hat Hybrid Cloud Console.

Procedure

Go to the Register disconnected cluster web page on the Red Hat Hybrid Cloud Console.
Optional: To access the Register disconnected cluster web page from the home page of the Red Hat Hybrid Cloud Console, go to the Clusters navigation menu item and then select the Register cluster button.
Enter your cluster’s details in the provided fields on the Register disconnected cluster page.
From the Subscription settings section of the page, select the subscription settings that apply to your Red Hat subscription offering.
To register your disconnected cluster, select the Register cluster button.

How does the subscriptions service show my subscription data?(Getting Started with the Subscription Service)

4.3.6. Updating the global cluster pull secret
Copy link

You can update the global pull secret for your cluster by either replacing the current pull secret or appending a new pull secret.

Use the procedure when you need a separate registry to store images than the registry used during installation.

Prerequisites

You have access to the cluster as a user with the cluster-admin role.

Procedure

Optional: To append a new pull secret to the existing pull secret, complete the following steps:
1. Enter the following command to download the pull secret:
  $ oc get secret/pull-secret -n openshift-config --template='{{index .data ".dockerconfigjson" | base64decode}}' > <pull_secret_location>
  1
  Copy to Clipboard Toggle word wrap
  1
  <pull_secret_location>: Include the path to the pull secret file.
2. Enter the following command to add the new pull secret:
  $ oc registry login --registry="<registry>" \
  1
  --auth-basic="<username>:<password>" \
  2
  --to=<pull_secret_location>
  3
  Copy to Clipboard Toggle word wrap
  1
  <registry>: Include the new registry. You can include many repositories within the same registry, for example: --registry="<registry/my-namespace/my-repository>.
  2
  <username>:<password>: Include the credentials of the new registry.
  3
  <pull_secret_location>: Include the path to the pull secret file.
  You can also perform a manual update to the pull secret file.
Enter the following command to update the global pull secret for your cluster:
```
oc set data secret/pull-secret -n openshift-config --from-file=.dockerconfigjson=<pull_secret_location>
```
```
$ oc set data secret/pull-secret -n openshift-config --from-file=.dockerconfigjson=<pull_secret_location> 
```
1
Copy to Clipboard Toggle word wrap
1
<pull_secret_location>: Include the path to the new pull secret file.
This update rolls out to all nodes, which can take some time depending on the size of your cluster.
Note
As of OpenShift Container Platform 4.7.4, changes to the global pull secret no longer trigger a node drain or reboot.

Additional resources

Transferring cluster ownership

4.4. Using Insights to identify issues with your cluster
Copy link

Insights repeatedly analyzes the data Insights Operator sends, which includes workload recommendations from Deployment Validation Operator (DVO). Users of OpenShift Container Platform can display the results in the Insights Advisor service on Red Hat Hybrid Cloud Console.

4.4.1. About Red Hat Insights Advisor for OpenShift Container Platform
Copy link

You can use the Insights advisor service to assess and monitor the health of your OpenShift Container Platform clusters. Whether you are concerned about individual clusters, or with your whole infrastructure, it is important to be aware of the exposure of your cluster infrastructure to issues that can affect service availability, fault tolerance, performance, or security.

If the cluster has the Deployment Validation Operator (DVO) installed the recommendations also highlight workloads whose configuration might lead to cluster health issues.

The results of the Insights analysis are available in the Insights advisor service on Red Hat Hybrid Cloud Console. In the Red Hat Hybrid Cloud Console, you can perform the following actions:

View clusters and workloads affected by specific recommendations.
Use robust filtering capabilities to refine your results to those recommendations.
Learn more about individual recommendations, details about the risks they present, and get resolutions tailored to your individual clusters.
Share results with other stakeholders.

Additional resources

Using the Deployment Validation Operator in your Red Hat Insights for OpenShift workflow

4.4.2. Understanding Insights Advisor recommendations
Copy link

The Insights advisor service bundles information about various cluster states and component configurations that can negatively affect the service availability, fault tolerance, performance, or security of your clusters and workloads. This information set is called a recommendation in the Insights advisor service. Recommendations for clusters includes the following information:

Name: A concise description of the recommendation
Added: When the recommendation was published to the Insights Advisor service archive
Category: Whether the issue has the potential to negatively affect service availability, fault tolerance, performance, or security
Total risk: A value derived from the likelihood that the condition will negatively affect your cluster or workload, and the impact on operations if that were to happen
Clusters: A list of clusters on which a recommendation is detected
Description: A brief synopsis of the issue, including how it affects your clusters

4.4.3. Displaying potential issues with your cluster
Copy link

This section describes how to display the Insights report in Insights Advisor on OpenShift Cluster Manager.

Note that Insights repeatedly analyzes your cluster and shows the latest results. These results can change, for example, if you fix an issue or a new issue has been detected.

Prerequisites

Your cluster is registered on OpenShift Cluster Manager.
Remote health reporting is enabled, which is the default.
You are logged in to OpenShift Cluster Manager.

Procedure

Navigate to Advisor Recommendations on OpenShift Cluster Manager.
Depending on the result, the Insights advisor service displays one of the following:
- No matching recommendations found, if Insights did not identify any issues.
- A list of issues Insights has detected, grouped by risk (low, moderate, important, and critical).
- No clusters yet, if Insights has not yet analyzed the cluster. The analysis starts shortly after the cluster has been installed, registered, and connected to the internet.
If any issues are displayed, click the > icon in front of the entry for more details.
Depending on the issue, the details can also contain a link to more information from Red Hat about the issue.

4.4.4. Displaying all Insights advisor service recommendations
Copy link

The Recommendations view, by default, only displays the recommendations that are detected on your clusters. However, you can view all of the recommendations in the advisor service’s archive.

Prerequisites

Remote health reporting is enabled, which is the default.
Your cluster is registered on Red Hat Hybrid Cloud Console.
You are logged in to OpenShift Cluster Manager.

Procedure

Navigate to Advisor Recommendations on OpenShift Cluster Manager.
Click the X icons next to the Clusters Impacted and Status filters.
You can now browse through all of the potential recommendations for your cluster.

4.4.5. Advisor recommendation filters
Copy link

The Insights advisor service can return a large number of recommendations. To focus on your most critical recommendations, you can apply filters to the Advisor recommendations list to remove low-priority recommendations.

By default, filters are set to only show enabled recommendations that are impacting one or more clusters. To view all or disabled recommendations in the Insights library, you can customize the filters.

To apply a filter, select a filter type and then set its value based on the options that are available in the drop-down list. You can apply multiple filters to the list of recommendations.

You can set the following filter types:

Name: Search for a recommendation by name.
Total risk: Select one or more values from Critical, Important, Moderate, and Low indicating the likelihood and the severity of a negative impact on a cluster.
Impact: Select one or more values from Critical, High, Medium, and Low indicating the potential impact to the continuity of cluster operations.
Likelihood: Select one or more values from Critical, High, Medium, and Low indicating the potential for a negative impact to a cluster if the recommendation comes to fruition.
Category: Select one or more categories from Service Availability, Performance, Fault Tolerance, Security, and Best Practice to focus your attention on.
Status: Click a radio button to show enabled recommendations (default), disabled recommendations, or all recommendations.
Clusters impacted: Set the filter to show recommendations currently impacting one or more clusters, non-impacting recommendations, or all recommendations.
Risk of change: Select one or more values from High, Moderate, Low, and Very low indicating the risk that the implementation of the resolution could have on cluster operations.

4.4.5.1. Filtering Insights advisor service recommendations
Copy link

As an OpenShift Container Platform cluster manager, you can filter the recommendations that are displayed on the recommendations list. By applying filters, you can reduce the number of reported recommendations and concentrate on your highest priority recommendations.

The following procedure demonstrates how to set and remove Category filters; however, the procedure is applicable to any of the filter types and respective values.

Prerequisites

You are logged in to the OpenShift Cluster Manager in the Hybrid Cloud Console.

Procedure

Go to OpenShift > Advisor > Recommendations.
In the main, filter-type drop-down list, select the Category filter type.
Expand the filter-value drop-down list and select the checkbox next to each category of recommendation you want to view. Leave the checkboxes for unnecessary categories clear.
Optional: Add additional filters to further refine the list.

Only recommendations from the selected categories are shown in the list.

Verification

After applying filters, you can view the updated recommendations list. The applied filters are added next to the default filters.

4.4.5.2. Removing filters from Insights advisor service recommendations
Copy link

You can apply multiple filters to the list of recommendations. When ready, you can remove them individually or completely reset them.

Removing filters individually

Click the X icon next to each filter, including the default filters, to remove them individually.

Removing all non-default filters

Click Reset filters to remove only the filters that you applied, leaving the default filters in place.

4.4.6. Disabling Insights advisor service recommendations
Copy link

You can disable specific recommendations that affect your clusters, so that they no longer appear in your reports. It is possible to disable a recommendation for a single cluster or all of your clusters.

Note

Disabling a recommendation for all of your clusters also applies to any future clusters.

Prerequisites

Remote health reporting is enabled, which is the default.
Your cluster is registered on OpenShift Cluster Manager.
You are logged in to OpenShift Cluster Manager.

Procedure

Navigate to Advisor Recommendations on OpenShift Cluster Manager.
Optional: Use the Clusters Impacted and Status filters as needed.
Disable an alert by using one of the following methods:
- To disable an alert:
  1. Click the Options menu for that alert, and then click Disable recommendation.
  2. Enter a justification note and click Save.
- To view the clusters affected by this alert before disabling the alert:
  1. Click the name of the recommendation to disable. You are directed to the single recommendation page.
  2. Review the list of clusters in the Affected clusters section.
  3. Click Actions Disable recommendation to disable the alert for all of your clusters.
  4. Enter a justification note and click Save.

4.4.7. Enabling a previously disabled Insights advisor service recommendation
Copy link

When a recommendation is disabled for all clusters, you no longer see the recommendation in the Insights advisor service. You can change this behavior.

Prerequisites

Remote health reporting is enabled, which is the default.
Your cluster is registered on OpenShift Cluster Manager.
You are logged in to OpenShift Cluster Manager.

Procedure

Navigate to Advisor Recommendations on OpenShift Cluster Manager.
Filter the recommendations to display on the disabled recommendations:
1. From the Status drop-down menu, select Status.
2. From the Filter by status drop-down menu, select Disabled.
3. Optional: Clear the Clusters impacted filter.
Locate the recommendation to enable.
Click the Options menu , and then click Enable recommendation.

4.4.8. About Insights advisor service recommendations for workloads
Copy link

You can use the Red Hat Insights for OpenShift advisor service to view and manage information about recommendations that affect not only your clusters, but also your workloads. The advisor service takes advantage of deployment validation and helps OpenShift cluster administrators to see all runtime violations of deployment policies. You can see recommendations for workloads at OpenShift > Advisor > Workloads on the Red Hat Hybrid Cloud Console. For more information, see these additional resources:

4.4.9. Displaying the Insights status in the web console
Copy link

Insights repeatedly analyzes your cluster and you can display the status of identified potential issues of your cluster in the OpenShift Container Platform web console. This status shows the number of issues in the different categories and, for further details, links to the reports in OpenShift Cluster Manager.

Prerequisites

Your cluster is registered in OpenShift Cluster Manager.
Remote health reporting is enabled, which is the default.
You are logged in to the OpenShift Container Platform web console.

Procedure

Navigate to Home Overview in the OpenShift Container Platform web console.
Click Insights on the Status card.
The pop-up window lists potential issues grouped by risk. Click the individual categories or View all recommendations in Insights Advisor to display more details.

4.5. Using the Insights Operator
Copy link

The Insights Operator periodically gathers configuration and component failure status and, by default, reports that data every two hours to Red Hat. This information enables Red Hat to assess configuration and deeper failure data than is reported through Telemetry. Users of OpenShift Container Platform can display the report in the Insights Advisor service on Red Hat Hybrid Cloud Console.

4.5.1. Understanding Insights Operator alerts
Copy link

The Insights Operator declares alerts through the Prometheus monitoring system to the Alertmanager. You can view these alerts in the Alerting UI in the OpenShift Container Platform web console by using one of the following methods:

In the Administrator perspective, click Observe Alerting.
In the Developer perspective, click Observe <project_name> Alerts tab.

Currently, Insights Operator sends the following alerts when the conditions are met:

Expand

Table 4.1. Insights Operator alerts
Alert	Description
`InsightsDisabled`	Insights Operator is disabled.
`SimpleContentAccessNotAvailable`	Simple content access is not enabled in Red Hat Subscription Management.
`InsightsRecommendationActive`	Insights has an active recommendation for the cluster.

4.5.2. Disabling Insights Operator alerts
Copy link

To prevent the Insights Operator from sending alerts to the cluster Prometheus instance, you edit the support secret. If the support secret doesn’t exist, you must create it when you first add custom configurations. Note that configurations within the support secret take precedence over the default settings defined in the pod.yaml file. To prevent the Insights Operator from sending alerts to the cluster Prometheus instance, you edit the support secret. Note that this secret is created by default. The configurations stored in the support secret take precedence over any default settings specified in the pod.yaml file.

Prerequisites

Remote health reporting is enabled, which is the default.
You are logged in to the OpenShift Container Platform web console as cluster-admin.

Procedure

Navigate to Workloads Secrets.
On the Secrets page, select All Projects from the Project list, and then set Show default projects to on.
Select the openshift-config project from the Projects list.
Search for the support secret by using the Search by name field.
- If the secret exists:
Click the Options menu , and then click Edit Secret.
Click Add Key/Value.
1. In the Key field, enter disableInsightsAlerts.
2. In the Value field, enter True.
  - If the secret does not exist:
3. Click Create Key/value secret.
  1. In the Secret name field, enter support.
  2. In the Key field, enter disableInsightsAlerts.
  3. In the Value field, enter True.
4. Click Create.

After you save the changes, Insights Operator no longer sends alerts to the cluster Prometheus instance.

4.5.3. Enabling Insights Operator alerts
Copy link

When alerts are disabled, the Insights Operator no longer sends alerts to the cluster Prometheus instance. You can change this behavior.

Prerequisites

Remote health reporting is enabled, which is the default.
You are logged in to the OpenShift Container Platform web console as cluster-admin.

Procedure

Navigate to Workloads Secrets.
On the Secrets page, select All Projects from the Project list, and then set Show default projects to ON.
Select the openshift-config project from the Projects list.
Search for the support secret by using the Search by name field.
Click the Options menu , and then click Edit Secret.
For the disableInsightsAlerts key, set the Value field to false.

After you save the changes, Insights Operator again sends alerts to the cluster Prometheus instance.

4.5.4. Downloading your Insights Operator archive
Copy link

Insights Operator stores gathered data in an archive located in the openshift-insights namespace of your cluster. You can download and review the data that is gathered by the Insights Operator.

Prerequisites

You have access to the cluster as a user with the cluster-admin role.

Procedure

Find the name of the running pod for the Insights Operator:

oc get pods --namespace=openshift-insights -o custom-columns=:metadata.name --no-headers  --field-selector=status.phase=Running

$ oc get pods --namespace=openshift-insights -o custom-columns=:metadata.name --no-headers  --field-selector=status.phase=Running

Copy to Clipboard

Toggle word wrap

Copy the recent data archives collected by the Insights Operator:
```
oc cp openshift-insights/<insights_operator_pod_name>:/var/lib/insights-operator ./insights-data
```
```
$ oc cp openshift-insights/<insights_operator_pod_name>:/var/lib/insights-operator ./insights-data 
```
1
Copy to Clipboard Toggle word wrap
1
Replace <insights_operator_pod_name> with the pod name output from the preceding command.

The recent Insights Operator archives are now available in the insights-data directory.

4.5.5. Viewing Insights Operator gather durations
Copy link

You can view the time it takes for the Insights Operator to gather the information contained in the archive. This helps you to understand Insights Operator resource usage and issues with Insights Advisor.

Prerequisites

A recent copy of your Insights Operator archive.

Procedure

From your archive, open /insights-operator/gathers.json.

The file contains a list of Insights Operator gather operations:

    {
      "name": "clusterconfig/authentication",
      "duration_in_ms": 730, 
      "records_count": 1,
      "errors": null,
      "panic": null
    }

    {
      "name": "clusterconfig/authentication",
      "duration_in_ms": 730,


      "records_count": 1,
      "errors": null,
      "panic": null
    }

Copy to Clipboard

Toggle word wrap

1: duration_in_ms is the amount of time in milliseconds for each gather operation.

Inspect each gather operation for abnormalities.

4.5.6. Disabling the Insights Operator gather operations
Copy link

You can disable the Insights Operator gather operations. Disabling the gather operations gives you the ability to increase privacy for your organization as Insights Operator will no longer gather and send Insights cluster reports to Red Hat. This will disable Insights analysis and recommendations for your cluster without affecting other core functions that require communication with Red Hat such as cluster transfers. You can view a list of attempted gather operations for your cluster from the /insights-operator/gathers.json file in your Insights Operator archive. Be aware that some gather operations only occur when certain conditions are met and might not appear in your most recent archive.

Important

The InsightsDataGather custom resource is a Technology Preview feature only. Technology Preview features are not supported with Red Hat production service level agreements (SLAs) and might not be functionally complete. Red Hat does not recommend using them in production. These features provide early access to upcoming product features, enabling customers to test functionality and provide feedback during the development process.

For more information about the support scope of Red Hat Technology Preview features, see Technology Preview Features Support Scope.

Note

If you enable Technology Preview in your cluster, the Insights Operator runs gather operations in individual pods. This is part of the Technology Preview feature set for the Insights Operator and supports the new data gathering features.

Prerequisites

You are logged in to the OpenShift Container Platform web console as a user with the cluster-admin role.

Procedure

Navigate to Administration CustomResourceDefinitions.
On the CustomResourceDefinitions page, use the Search by name field to find the InsightsDataGather resource definition and click it.
On the CustomResourceDefinition details page, click the Instances tab.
Click cluster, and then click the YAML tab.

Disable the gather operations by performing one of the following edits to the InsightsDataGather configuration file:

To disable all the gather operations, enter all under the disabledGatherers key:

apiVersion: config.openshift.io/v1alpha1
kind: InsightsDataGather
metadata:
....

spec: 
  gatherConfig:
    disabledGatherers:
      - all

apiVersion: config.openshift.io/v1alpha1
kind: InsightsDataGather
metadata:
....

spec:


  gatherConfig:
    disabledGatherers:
      - all

Copy to Clipboard

Toggle word wrap

1: The spec parameter specifies gather configurations.
2: The all value disables all gather operations.

To disable individual gather operations, enter their values under the disabledGatherers key:

spec:
  gatherConfig:
    disabledGatherers:
      - clusterconfig/container_images 
      - clusterconfig/host_subnets
      - workloads/workload_info

spec:
  gatherConfig:
    disabledGatherers:
      - clusterconfig/container_images


      - clusterconfig/host_subnets
      - workloads/workload_info

Copy to Clipboard

Toggle word wrap

1: Example individual gather operation

Click Save.
After you save the changes, the Insights Operator gather configurations are updated and the operations will no longer occur.

Note

Disabling gather operations degrades the Insights advisor service’s ability to offer effective recommendations for your cluster.

4.5.7. Enabling the Insights Operator gather operations
Copy link

You can enable the Insights Operator gather operations, if the gather operations have been disabled.

Important

For more information about the support scope of Red Hat Technology Preview features, see Technology Preview Features Support Scope.

Prerequisites

You are logged in to the OpenShift Container Platform web console as a user with the cluster-admin role.

Procedure

Navigate to Administration CustomResourceDefinitions.
On the CustomResourceDefinitions page, use the Search by name field to find the InsightsDataGather resource definition and click it.
On the CustomResourceDefinition details page, click the Instances tab.
Click cluster, and then click the YAML tab.

Enable the gather operations by performing one of the following edits:

To enable all disabled gather operations, remove the gatherConfig stanza:

apiVersion: config.openshift.io/v1alpha1
kind: InsightsDataGather
metadata:
....

spec:
  gatherConfig: 
    disabledGatherers: all

apiVersion: config.openshift.io/v1alpha1
kind: InsightsDataGather
metadata:
....

spec:
  gatherConfig:


    disabledGatherers: all

Copy to Clipboard

Toggle word wrap

1: Remove the gatherConfig stanza to enable all gather operations.

To enable individual gather operations, remove their values under the disabledGatherers key:

spec:
  gatherConfig:
    disabledGatherers:
      - clusterconfig/container_images 
      - clusterconfig/host_subnets
      - workloads/workload_info

spec:
  gatherConfig:
    disabledGatherers:
      - clusterconfig/container_images


      - clusterconfig/host_subnets
      - workloads/workload_info

Copy to Clipboard

Toggle word wrap

1: Remove one or more gather operations.

Click Save.
After you save the changes, the Insights Operator gather configurations are updated and the affected gather operations start.

Note

Disabling gather operations degrades Insights Advisor’s ability to offer effective recommendations for your cluster.

4.5.8. Running an Insights Operator gather operation
Copy link

You can run Insights Operator data gather operations on demand. The following procedures describe how to run the default list of gather operations using the OpenShift web console or CLI. You can customize the on demand gather function to exclude any gather operations you choose. Disabling gather operations from the default list degrades Insights Advisor’s ability to offer effective recommendations for your cluster. If you have previously disabled Insights Operator gather operations in your cluster, this procedure will override those parameters.

Important

The DataGather custom resource is a Technology Preview feature only. Technology Preview features are not supported with Red Hat production service level agreements (SLAs) and might not be functionally complete. Red Hat does not recommend using them in production. These features provide early access to upcoming product features, enabling customers to test functionality and provide feedback during the development process.

For more information about the support scope of Red Hat Technology Preview features, see Technology Preview Features Support Scope.

Note

4.5.8.1. Running an Insights Operator gather operation using the web console
Copy link

You can run an Insights Operator gather operation using the OpenShift Container Platform web console.

Prerequisites

You are logged in to the OpenShift Container Platform web console as a user with the cluster-admin role.

Procedure

Navigate to Administration CustomResourceDefinitions.
On the CustomResourceDefinitions page, use the Search by name field to find the DataGather resource definition and click it.
On the CustomResourceDefinition details page, click the Instances tab.
Click Create DataGather.
To create a new DataGather operation, edit the configuration file:
```
apiVersion: insights.openshift.io/v1alpha1
kind: DataGather
metadata:
  name: <your_data_gather> 
spec:
 gatherers: 
   - name: workloads
     state: Disabled
```
```
apiVersion: insights.openshift.io/v1alpha1
kind: DataGather
metadata:
  name: <your_data_gather> 
```
1
```
spec:
 gatherers: 
```
2
```
   - name: workloads
     state: Disabled
```
Copy to Clipboard Toggle word wrap
1
Replace the <your_data_gather> with a unique name for your gather operation.
2
Enter individual gather operations to disable under the gatherers parameter. This example disables the workloads data gather operation and will run the remainder of the default operations. To run the complete list of default gather operations, leave the spec parameter empty. You can find the complete list of gather operations in the Insights Operator documentation.
Click Save.

Verification

Navigate to Workloads Pods.
On the Pods page, select the Project pulldown menu, and then turn on Show default projects.
Select the openshift-insights project from the Project pulldown menu.
Check that your new gather operation is prefixed with your chosen name under the list of pods in the openshift-insights project. Upon completion, the Insights Operator automatically uploads the data to Red Hat for processing.

4.5.8.2. Running an Insights Operator gather operation using the OpenShift CLI
Copy link

You can run an Insights Operator gather operation using the OpenShift Container Platform command-line interface.

Prerequisites

You are logged in to OpenShift Container Platform as a user with the cluster-admin role.

Procedure

Enter the following command to run the gather operation:
```
oc apply -f <your_datagather_definition>.yaml
```
```
$ oc apply -f <your_datagather_definition>.yaml
```
Copy to Clipboard Toggle word wrap
Replace <your_datagather_definition>.yaml with a configuration file using the following parameters:
```
apiVersion: insights.openshift.io/v1alpha1
kind: DataGather
metadata:
  name: <your_data_gather> 
spec:
 gatherers: 
   - name: workloads
     state: Disabled
```
```
apiVersion: insights.openshift.io/v1alpha1
kind: DataGather
metadata:
  name: <your_data_gather> 
```
1
```
spec:
 gatherers: 
```
2
```
   - name: workloads
     state: Disabled
```
Copy to Clipboard Toggle word wrap
1
Replace the <your_data_gather> with a unique name for your gather operation.
2
Enter individual gather operations to disable under the gatherers parameter. This example disables the workloads data gather operation and will run the remainder of the default operations. To run the complete list of default gather operations, leave the spec parameter empty. You can find the complete list of gather operations in the Insights Operator documentation.

Verification

Check that your new gather operation is prefixed with your chosen name under the list of pods in the openshift-insights project. Upon completion, the Insights Operator automatically uploads the data to Red Hat for processing.

4.5.9. Configuring Insights Operator
Copy link

You can configure Insights Operator to meet the needs of your organization. The Insights Operator is configured using a combination of the default configurations in the pod.yaml file in the Insights Operator Config directory and the configurations stored in the support secret in the openshift-config namespace. The support secret does not exist by default and must be created when adding custom configurations for the first time. Configurations in the support secret override the defaults set in the pod.yaml file.

The table below describes the available configuration attributes:

Expand

Table 4.2. Insights Operator configurable attributes
Attribute name	Description	Value type	Default value
`enableGlobalObfuscation` Copy to Clipboard Toggle word wrap	Enables the global obfuscation of IP addresses and the cluster domain name.	Boolean	`false`
`endpoint` Copy to Clipboard Toggle word wrap	Sets the upload endpoint.	URL	https://console.redhat.com/api/ingress/v1/upload
`clusterTransferEndpoint` Copy to Clipboard Toggle word wrap	The endpoint for checking and downloading cluster transfer data.	URL	https://api.openshift.com/api/accounts_mgmt/v1/cluster_transfers/
`clusterTransferInterval` Copy to Clipboard Toggle word wrap	Sets the frequency for checking available cluster transfers.	Time interval	`24h`
`conditionalGathererEndpoint` Copy to Clipboard Toggle word wrap	Sets the endpoint for providing conditional gathering rule definitions.	URL	https://console.redhat.com/api/gathering/gathering_rules
`httpProxy, httpsProxy, noProxy` Copy to Clipboard Toggle word wrap	Set custom proxy for Insights Operator.	URL	No default
`interval` Copy to Clipboard Toggle word wrap	Sets the data gathering and upload frequency.	Time interval	`2h`
`reportEndpoint` Copy to Clipboard Toggle word wrap	Specifies the endpoint for downloading the latest Insights analysis. For the specified default value, ensure that you replace `<cluster_id>` with the unique identified, cluster ID, of your cluster.	URL	https://console.redhat.com/api/insights-results-aggregator/v2/cluster/<cluster_id>/reports
`reportPullingDelay` Copy to Clipboard Toggle word wrap	Sets the delay between data upload and data download.	Time interval	`60s`
`reportPullingTimeout` Copy to Clipboard Toggle word wrap	Sets the timeout value for the Insights download request.	Time interval	`30s`
`reportMinRetryTime` Copy to Clipboard Toggle word wrap	Sets the time for when to retry a request.	Time interval	`30s`
`scaEndpoint` Copy to Clipboard Toggle word wrap	Specifies the endpoint for downloading the simple content access (SCA) entitlements.	URL	https://api.openshift.com/api/accounts_mgmt/v1/entitlement_certificates
`scaInterval` Copy to Clipboard Toggle word wrap	Specifies the frequency of the simple content access entitlements download.	Time interval	`8h`
`scaPullDisabled` Copy to Clipboard Toggle word wrap	Disables the simple content access entitlements download.	Boolean	`false`

4.5.9.1. Creating the insights-config ConfigMap object
Copy link

This procedure describes how to create the insights-config ConfigMap object for the Insights Operator to set custom configurations.

Important

Red Hat recommends you consult Red Hat Support before making changes to the default Insights Operator configuration.

Prerequisites

You are logged in to the OpenShift Container Platform web console as a user with cluster-admin role.

Procedure

Navigate to Workloads Secrets.
On the Secrets page, select All Projects from the Project list, and then set Show default projects to on.
Select the openshift-config project from the Project list.
Search for the support secret using the Search by name field. If it does not exist, click Create Key/value secret to create it.
Click the Options menu for the secret, and then click Edit Secret.
Click Add Key/Value.
Enter an attribute name with an appropriate value (see table above), and click Save.
Repeat the above steps for any additional configurations.

4.6. Using remote health reporting in a restricted network
Copy link

You can manually gather and upload Insights Operator archives to diagnose issues from a restricted network.

To use the Insights Operator in a restricted network, you must:

Create a copy of your Insights Operator archive.
Upload the Insights Operator archive to console.redhat.com.

Additionally, you can select to obfuscate the Insights Operator data before upload.

4.6.1. Running an Insights Operator gather operation
Copy link

You must run a gather operation to create an Insights Operator archive.

Prerequisites

You are logged in to OpenShift Container Platform as cluster-admin.

Procedure

Create a file named gather-job.yaml using this template:

apiVersion: batch/v1
kind: Job
metadata:
  name: insights-operator-job
  annotations:
    config.openshift.io/inject-proxy: insights-operator
spec:
  backoffLimit: 6
  ttlSecondsAfterFinished: 600
  template:
    spec:
      restartPolicy: OnFailure
      serviceAccountName: operator
      nodeSelector:
        beta.kubernetes.io/os: linux
        node-role.kubernetes.io/master: ""
      tolerations:
      - effect: NoSchedule
        key: node-role.kubernetes.io/master
        operator: Exists
      - effect: NoExecute
        key: node.kubernetes.io/unreachable
        operator: Exists
        tolerationSeconds: 900
      - effect: NoExecute
        key: node.kubernetes.io/not-ready
        operator: Exists
        tolerationSeconds: 900
      volumes:
      - name: snapshots
        emptyDir: {}
      - name: service-ca-bundle
        configMap:
          name: service-ca-bundle
          optional: true
      initContainers:
      - name: insights-operator
        image: quay.io/openshift/origin-insights-operator:latest
        terminationMessagePolicy: FallbackToLogsOnError
        volumeMounts:
        - name: snapshots
          mountPath: /var/lib/insights-operator
        - name: service-ca-bundle
          mountPath: /var/run/configmaps/service-ca-bundle
          readOnly: true
        ports:
        - containerPort: 8443
          name: https
        resources:
          requests:
            cpu: 10m
            memory: 70Mi
        args:
        - gather
        - -v=4
        - --config=/etc/insights-operator/server.yaml
      containers:
        - name: sleepy
          image: quay.io/openshift/origin-base:latest
          args:
            - /bin/sh
            - -c
            - sleep 10m
          volumeMounts: [{name: snapshots, mountPath: /var/lib/insights-operator}]

apiVersion: batch/v1
kind: Job
metadata:
  name: insights-operator-job
  annotations:
    config.openshift.io/inject-proxy: insights-operator
spec:
  backoffLimit: 6
  ttlSecondsAfterFinished: 600
  template:
    spec:
      restartPolicy: OnFailure
      serviceAccountName: operator
      nodeSelector:
        beta.kubernetes.io/os: linux
        node-role.kubernetes.io/master: ""
      tolerations:
      - effect: NoSchedule
        key: node-role.kubernetes.io/master
        operator: Exists
      - effect: NoExecute
        key: node.kubernetes.io/unreachable
        operator: Exists
        tolerationSeconds: 900
      - effect: NoExecute
        key: node.kubernetes.io/not-ready
        operator: Exists
        tolerationSeconds: 900
      volumes:
      - name: snapshots
        emptyDir: {}
      - name: service-ca-bundle
        configMap:
          name: service-ca-bundle
          optional: true
      initContainers:
      - name: insights-operator
        image: quay.io/openshift/origin-insights-operator:latest
        terminationMessagePolicy: FallbackToLogsOnError
        volumeMounts:
        - name: snapshots
          mountPath: /var/lib/insights-operator
        - name: service-ca-bundle
          mountPath: /var/run/configmaps/service-ca-bundle
          readOnly: true
        ports:
        - containerPort: 8443
          name: https
        resources:
          requests:
            cpu: 10m
            memory: 70Mi
        args:
        - gather
        - -v=4
        - --config=/etc/insights-operator/server.yaml
      containers:
        - name: sleepy
          image: quay.io/openshift/origin-base:latest
          args:
            - /bin/sh
            - -c
            - sleep 10m
          volumeMounts: [{name: snapshots, mountPath: /var/lib/insights-operator}]

Copy to Clipboard

Toggle word wrap

Copy your insights-operator image version:

oc get -n openshift-insights deployment insights-operator -o yaml

$ oc get -n openshift-insights deployment insights-operator -o yaml

Copy to Clipboard

Toggle word wrap

Example output

apiVersion: apps/v1
kind: Deployment
metadata:
  name: insights-operator
  namespace: openshift-insights
# ...
spec:
  template:
# ...
    spec:
      containers:
      - args:
# ...
        image: registry.ci.openshift.org/ocp/4.15-2023-10-12-212500@sha256:a0aa581400805ad0... 
# ...

apiVersion: apps/v1
kind: Deployment
metadata:
  name: insights-operator
  namespace: openshift-insights
# ...
spec:
  template:
# ...
    spec:
      containers:
      - args:
# ...
        image: registry.ci.openshift.org/ocp/4.15-2023-10-12-212500@sha256:a0aa581400805ad0...


# ...

Copy to Clipboard

Toggle word wrap

1: Specifies your insights-operator image version.

Paste your image version in gather-job.yaml:

apiVersion: batch/v1
kind: Job
metadata:
  name: insights-operator-job
# ...
spec:
# ...
  template:
    spec:
    initContainers:
    - name: insights-operator
      image: image: registry.ci.openshift.org/ocp/4.15-2023-10-12-212500@sha256:a0aa581400805ad0... 
      terminationMessagePolicy: FallbackToLogsOnError
      volumeMounts:

apiVersion: batch/v1
kind: Job
metadata:
  name: insights-operator-job
# ...
spec:
# ...
  template:
    spec:
    initContainers:
    - name: insights-operator
      image: image: registry.ci.openshift.org/ocp/4.15-2023-10-12-212500@sha256:a0aa581400805ad0...


      terminationMessagePolicy: FallbackToLogsOnError
      volumeMounts:

Copy to Clipboard

Toggle word wrap

1: Replace any existing value with your insights-operator image version.

Create the gather job:

oc apply -n openshift-insights -f gather-job.yaml

$ oc apply -n openshift-insights -f gather-job.yaml

Copy to Clipboard

Toggle word wrap

Find the name of the job pod:

oc describe -n openshift-insights job/insights-operator-job

$ oc describe -n openshift-insights job/insights-operator-job

Copy to Clipboard

Toggle word wrap

Example output

Name:             insights-operator-job
Namespace:        openshift-insights
# ...
Events:
  Type    Reason            Age    From            Message
  ----    ------            ----   ----            -------
  Normal  SuccessfulCreate  7m18s  job-controller  Created pod: insights-operator-job-<your_job>

Name:             insights-operator-job
Namespace:        openshift-insights
# ...
Events:
  Type    Reason            Age    From            Message
  ----    ------            ----   ----            -------
  Normal  SuccessfulCreate  7m18s  job-controller  Created pod: insights-operator-job-<your_job>

Copy to Clipboard

Toggle word wrap

where: insights-operator-job-<your_job> is the name of the pod.

Verify that the operation has finished:

oc logs -n openshift-insights insights-operator-job-<your_job> insights-operator

$ oc logs -n openshift-insights insights-operator-job-<your_job> insights-operator

Copy to Clipboard

Toggle word wrap

Example output

I0407 11:55:38.192084       1 diskrecorder.go:34] Wrote 108 records to disk in 33ms

I0407 11:55:38.192084       1 diskrecorder.go:34] Wrote 108 records to disk in 33ms

Copy to Clipboard

Toggle word wrap

Save the created archive:

oc cp openshift-insights/insights-operator-job-<your_job>:/var/lib/insights-operator ./insights-data

$ oc cp openshift-insights/insights-operator-job-<your_job>:/var/lib/insights-operator ./insights-data

Copy to Clipboard

Toggle word wrap

Clean up the job:

oc delete -n openshift-insights job insights-operator-job

$ oc delete -n openshift-insights job insights-operator-job

Copy to Clipboard

Toggle word wrap

4.6.2. Uploading an Insights Operator archive
Copy link

You can manually upload an Insights Operator archive to console.redhat.com to diagnose potential issues.

Prerequisites

You are logged in to OpenShift Container Platform as cluster-admin.
You have a workstation with unrestricted internet access.
You have created a copy of the Insights Operator archive.

Procedure

Download the dockerconfig.json file:

oc extract secret/pull-secret -n openshift-config --to=.

$ oc extract secret/pull-secret -n openshift-config --to=.

Copy to Clipboard

Toggle word wrap

Copy your "cloud.openshift.com" "auth" token from the dockerconfig.json file:

{
  "auths": {
    "cloud.openshift.com": {
      "auth": "<your_token>",
      "email": "asd@redhat.com"
    }
}

{
  "auths": {
    "cloud.openshift.com": {
      "auth": "<your_token>",
      "email": "asd@redhat.com"
    }
}

Copy to Clipboard

Toggle word wrap

Upload the archive to console.redhat.com:

curl -v -H "User-Agent: insights-operator/one10time200gather184a34f6a168926d93c330 cluster/<cluster_id>" -H "Authorization: Bearer <your_token>" -F "upload=@<path_to_archive>; type=application/vnd.redhat.openshift.periodic+tar" https://console.redhat.com/api/ingress/v1/upload

$ curl -v -H "User-Agent: insights-operator/one10time200gather184a34f6a168926d93c330 cluster/<cluster_id>" -H "Authorization: Bearer <your_token>" -F "upload=@<path_to_archive>; type=application/vnd.redhat.openshift.periodic+tar" https://console.redhat.com/api/ingress/v1/upload

Copy to Clipboard

Toggle word wrap

where <cluster_id> is your cluster ID, <your_token> is the token from your pull secret, and <path_to_archive> is the path to the Insights Operator archive.

If the operation is successful, the command returns a "request_id" and "account_number":

Example output

* Connection #0 to host console.redhat.com left intact
{"request_id":"393a7cf1093e434ea8dd4ab3eb28884c","upload":{"account_number":"6274079"}}%

* Connection #0 to host console.redhat.com left intact
{"request_id":"393a7cf1093e434ea8dd4ab3eb28884c","upload":{"account_number":"6274079"}}%

Copy to Clipboard

Toggle word wrap

Verification steps

Log in to https://console.redhat.com/openshift.
Click the Clusters menu in the left pane.
To display the details of the cluster, click the cluster name.
Open the Insights Advisor tab of the cluster.
If the upload was successful, the tab displays one of the following:
- Your cluster passed all recommendations, if Insights Advisor did not identify any issues.
- A list of issues that Insights Advisor has detected, prioritized by risk (low, moderate, important, and critical).

4.6.3. Enabling Insights Operator data obfuscation
Copy link

You can enable obfuscation to mask sensitive and identifiable IPv4 addresses and cluster base domains that the Insights Operator sends to console.redhat.com.

Warning

Although this feature is available, Red Hat recommends keeping obfuscation disabled for a more effective support experience.

Obfuscation assigns non-identifying values to cluster IPv4 addresses, and uses a translation table that is retained in memory to change IP addresses to their obfuscated versions throughout the Insights Operator archive before uploading the data to console.redhat.com.

For cluster base domains, obfuscation changes the base domain to a hardcoded substring. For example, cluster-api.openshift.example.com becomes cluster-api.<CLUSTER_BASE_DOMAIN>.

The following procedure enables obfuscation using the support secret in the openshift-config namespace.

Prerequisites

You are logged in to the OpenShift Container Platform web console as cluster-admin.

Procedure

Navigate to Workloads Secrets.
Select the openshift-config project.
Search for the support secret using the Search by name field. If it does not exist, click Create Key/value secret to create it.
Click the Options menu , and then click Edit Secret.
Click Add Key/Value.
Create a key named enableGlobalObfuscation with a value of true, and click Save.
Navigate to Workloads Pods
Select the openshift-insights project.
Find the insights-operator pod.
To restart the insights-operator pod, click the Options menu , and then click Delete Pod.

Verification

Navigate to Workloads Secrets.
Select the openshift-insights project.
Search for the obfuscation-translation-table secret using the Search by name field.

If the obfuscation-translation-table secret exists, then obfuscation is enabled and working.

Alternatively, you can inspect /insights-operator/gathers.json in your Insights Operator archive for the value "is_global_obfuscation_enabled": true.

4.7. Importing simple content access entitlements with Insights Operator
Copy link

Insights Operator periodically imports your simple content access entitlements from OpenShift Cluster Manager and stores them in the etc-pki-entitlement secret in the openshift-config-managed namespace. Simple content access is a capability in Red Hat subscription tools which simplifies the behavior of the entitlement tooling. This feature makes it easier to consume the content provided by your Red Hat subscriptions without the complexity of configuring subscription tooling.

Insights Operator imports simple content access entitlements every eight hours, but can be configured or disabled using the support secret in the openshift-config namespace.

Note

Simple content access must be enabled in Red Hat Subscription Management for the importing to function.

4.7.1. Configuring simple content access import interval
Copy link

You can configure how often the Insights Operator imports the simple content access entitlements by using the support secret in the openshift-config namespace. The entitlement import normally occurs every eight hours, but you can shorten this interval if you update your simple content access configuration in Red Hat Subscription Management.

This procedure describes how to update the import interval to one hour.

Prerequisites

You are logged in to the OpenShift Container Platform web console as a user with the cluster-admin role.

Procedure

Navigate to Workloads Secrets.
Select the openshift-config project.
Search for the support secret by using the Search by name field. If it does not exist, click Create Key/value secret to create it.
- If the secret exists:
  1. Click the Options menu , and then click Edit Secret.
  2. Click Add Key/Value.
    In the Key field, enter scaInterval.
    In the Value field, enter 1h.
- If the secret does not exist:
  1. Click Create Key/value secret.
    In the Secret name field, enter support.
    In the Key field, enter scaInterval.
    In the Value field, enter 1h.
  2. Click Create.
Note
The interval 1h can also be entered as 60m for 60 minutes.

4.7.2. Disabling simple content access import
Copy link

You can disable the importing of simple content access entitlements by using the support secret in the openshift-config namespace.

Prerequisites

You are logged in to the OpenShift Container Platform web console as cluster-admin.

Procedure

Navigate to Workloads Secrets.
Select the openshift-config project.
Search for the support secret using the Search by name field.
- If the secret exists:
  1. Click the Options menu , and then click Edit Secret.
  2. Click Add Key/Value.
    In the Key field, enter scaPullDisabled.
    In the Value field, enter true.
- If the secret does not exist:
  1. Click Create Key/value secret.
    In the Secret name field, enter support.
    In the Key field, enter scaPullDisabled.
    In the Value field, enter true.
  2. Click Create.
The simple content access entitlement import is now disabled.
Note
To enable the simple content access import again, edit the support secret and delete the scaPullDisabled key.

4.7.3. Enabling a previously disabled simple content access import
Copy link

If the importing of simple content access entitlements is disabled, the Insights Operator does not import simple content access entitlements. You can change this behavior.

Prerequisites

You are logged in to the OpenShift Container Platform web console as a user with the cluster-admin role.

Procedure

Navigate to Workloads Secrets.
Select the openshift-config project.
Search for the support secret by using the Search by name field.
Click the Options menu , and then click Edit Secret.
For the scaPullDisabled key, set the Value field to false.
The simple content access entitlement import is now disabled.

Chapter 4. Remote health monitoring with connected clusters

4.1. About remote health monitoringCopy linkLink copied to clipboard!

4.1.1. About TelemetryCopy linkLink copied to clipboard!

4.1.1.1. Information collected by TelemetryCopy linkLink copied to clipboard!

4.1.1.1.1. System informationCopy linkLink copied to clipboard!

4.1.1.1.2. Sizing InformationCopy linkLink copied to clipboard!

4.1.1.1.3. Usage informationCopy linkLink copied to clipboard!

4.1.2. About the Insights OperatorCopy linkLink copied to clipboard!

4.1.2.1. Information collected by the Insights OperatorCopy linkLink copied to clipboard!

4.1.3. Understanding Telemetry and Insights Operator data flowCopy linkLink copied to clipboard!

4.1.4. Additional details about how remote health monitoring data is usedCopy linkLink copied to clipboard!

4.2. Showing data collected by remote health monitoringCopy linkLink copied to clipboard!

4.2.1. Showing data collected by TelemetryCopy linkLink copied to clipboard!

4.2.2. Showing data collected by the Insights OperatorCopy linkLink copied to clipboard!

4.3. Remote health reportingCopy linkLink copied to clipboard!

4.3.1. Enabling remote health reportingCopy linkLink copied to clipboard!

4.3.2. Changing your global cluster pull secret to enable remote health reportingCopy linkLink copied to clipboard!

4.3.3. Consequences of disabling remote health reportingCopy linkLink copied to clipboard!

4.3.4. Disabling remote health reportingCopy linkLink copied to clipboard!

4.3.5. Registering your disconnected clusterCopy linkLink copied to clipboard!

4.3.6. Updating the global cluster pull secretCopy linkLink copied to clipboard!

4.4. Using Insights to identify issues with your clusterCopy linkLink copied to clipboard!

4.4.1. About Red Hat Insights Advisor for OpenShift Container PlatformCopy linkLink copied to clipboard!

4.4.2. Understanding Insights Advisor recommendationsCopy linkLink copied to clipboard!

4.4.3. Displaying potential issues with your clusterCopy linkLink copied to clipboard!

4.4.4. Displaying all Insights advisor service recommendationsCopy linkLink copied to clipboard!

4.4.5. Advisor recommendation filtersCopy linkLink copied to clipboard!

4.4.5.1. Filtering Insights advisor service recommendationsCopy linkLink copied to clipboard!

4.4.5.2. Removing filters from Insights advisor service recommendationsCopy linkLink copied to clipboard!

4.4.6. Disabling Insights advisor service recommendationsCopy linkLink copied to clipboard!

4.4.7. Enabling a previously disabled Insights advisor service recommendationCopy linkLink copied to clipboard!

4.4.8. About Insights advisor service recommendations for workloadsCopy linkLink copied to clipboard!

4.4.9. Displaying the Insights status in the web consoleCopy linkLink copied to clipboard!

4.5. Using the Insights OperatorCopy linkLink copied to clipboard!

4.5.1. Understanding Insights Operator alertsCopy linkLink copied to clipboard!

4.5.2. Disabling Insights Operator alertsCopy linkLink copied to clipboard!

4.5.3. Enabling Insights Operator alertsCopy linkLink copied to clipboard!

4.5.4. Downloading your Insights Operator archiveCopy linkLink copied to clipboard!

4.5.5. Viewing Insights Operator gather durationsCopy linkLink copied to clipboard!

4.5.6. Disabling the Insights Operator gather operationsCopy linkLink copied to clipboard!

4.5.7. Enabling the Insights Operator gather operationsCopy linkLink copied to clipboard!

4.5.8. Running an Insights Operator gather operationCopy linkLink copied to clipboard!

4.5.8.1. Running an Insights Operator gather operation using the web consoleCopy linkLink copied to clipboard!

4.5.8.2. Running an Insights Operator gather operation using the OpenShift CLICopy linkLink copied to clipboard!

4.5.9. Configuring Insights OperatorCopy linkLink copied to clipboard!

4.5.9.1. Creating the insights-config ConfigMap objectCopy linkLink copied to clipboard!

4.6. Using remote health reporting in a restricted networkCopy linkLink copied to clipboard!

4.6.1. Running an Insights Operator gather operationCopy linkLink copied to clipboard!

4.6.2. Uploading an Insights Operator archiveCopy linkLink copied to clipboard!

4.6.3. Enabling Insights Operator data obfuscationCopy linkLink copied to clipboard!

4.7. Importing simple content access entitlements with Insights OperatorCopy linkLink copied to clipboard!

4.7.1. Configuring simple content access import intervalCopy linkLink copied to clipboard!

4.7.2. Disabling simple content access importCopy linkLink copied to clipboard!

4.7.3. Enabling a previously disabled simple content access importCopy linkLink copied to clipboard!

Learn

Try, buy, & sell

Communities

About Red Hat Documentation

Making open source more inclusive

About Red Hat

Theme

Red Hat legal and privacy links

Red Hat legal and privacy links

4.1. About remote health monitoring
Copy link

4.1.1. About Telemetry
Copy link

4.1.1.1. Information collected by Telemetry
Copy link

4.1.1.1.1. System information
Copy link

4.1.1.1.2. Sizing Information
Copy link

4.1.1.1.3. Usage information
Copy link

4.1.2. About the Insights Operator
Copy link

4.1.2.1. Information collected by the Insights Operator
Copy link

4.1.3. Understanding Telemetry and Insights Operator data flow
Copy link

4.1.4. Additional details about how remote health monitoring data is used
Copy link

4.2. Showing data collected by remote health monitoring
Copy link

4.2.1. Showing data collected by Telemetry
Copy link

4.2.2. Showing data collected by the Insights Operator
Copy link

4.3. Remote health reporting
Copy link

4.3.1. Enabling remote health reporting
Copy link

4.3.2. Changing your global cluster pull secret to enable remote health reporting
Copy link

4.3.3. Consequences of disabling remote health reporting
Copy link

4.3.4. Disabling remote health reporting
Copy link

4.3.5. Registering your disconnected cluster
Copy link

4.3.6. Updating the global cluster pull secret
Copy link

4.4. Using Insights to identify issues with your cluster
Copy link

4.4.1. About Red Hat Insights Advisor for OpenShift Container Platform
Copy link

4.4.2. Understanding Insights Advisor recommendations
Copy link

4.4.3. Displaying potential issues with your cluster
Copy link

4.4.4. Displaying all Insights advisor service recommendations
Copy link

4.4.5. Advisor recommendation filters
Copy link

4.4.5.1. Filtering Insights advisor service recommendations
Copy link

4.4.5.2. Removing filters from Insights advisor service recommendations
Copy link

4.4.6. Disabling Insights advisor service recommendations
Copy link

4.4.7. Enabling a previously disabled Insights advisor service recommendation
Copy link

4.4.8. About Insights advisor service recommendations for workloads
Copy link

4.4.9. Displaying the Insights status in the web console
Copy link

4.5. Using the Insights Operator
Copy link

4.5.1. Understanding Insights Operator alerts
Copy link

4.5.2. Disabling Insights Operator alerts
Copy link

4.5.3. Enabling Insights Operator alerts
Copy link

4.5.4. Downloading your Insights Operator archive
Copy link

4.5.5. Viewing Insights Operator gather durations
Copy link

4.5.6. Disabling the Insights Operator gather operations
Copy link

4.5.7. Enabling the Insights Operator gather operations
Copy link

4.5.8. Running an Insights Operator gather operation
Copy link

4.5.8.1. Running an Insights Operator gather operation using the web console
Copy link

4.5.8.2. Running an Insights Operator gather operation using the OpenShift CLI
Copy link

4.5.9. Configuring Insights Operator
Copy link

4.5.9.1. Creating the insights-config ConfigMap object
Copy link

4.6. Using remote health reporting in a restricted network
Copy link

4.6.1. Running an Insights Operator gather operation
Copy link

4.6.2. Uploading an Insights Operator archive
Copy link

4.6.3. Enabling Insights Operator data obfuscation
Copy link

4.7. Importing simple content access entitlements with Insights Operator
Copy link

4.7.1. Configuring simple content access import interval
Copy link

4.7.2. Disabling simple content access import
Copy link

4.7.3. Enabling a previously disabled simple content access import
Copy link