This documentation is for a release that is no longer maintained
See documentation for the latest supported version 3 or the latest supported version 4.Este contenido no está disponible en el idioma seleccionado.
Logging
OpenShift Logging installation, usage, and release notes
Abstract
Chapter 1. Release notes for Red Hat OpenShift Logging
1.1. Making open source more inclusive
Red Hat is committed to replacing problematic language in our code, documentation, and web properties. We are beginning with these four terms: master, slave, blacklist, and whitelist. Because of the enormity of this endeavor, these changes will be implemented gradually over several upcoming releases. For more details, see Red Hat CTO Chris Wright’s message.
1.2. Supported Versions
| 4.7 | 4.8 | 4.9 | |
|---|---|---|---|
| RHOL 5.1 | X | X | |
| RHOL 5.2 | X | X | X | 
| RHOL 5.3 | X | X | 
1.2.1. OpenShift Logging 5.1.0
This release includes RHSA-2021:2112 OpenShift Logging Bug Fix Release 5.1.0.
1.2.1.1. New features and enhancements
OpenShift Logging 5.1 now supports OpenShift Container Platform 4.7 and later running on:
- IBM Power Systems
- IBM Z and LinuxONE
This release adds improvements related to the following components and concepts.
- 
								As a cluster administrator, you can use Kubernetes pod labels to gather log data from an application and send it to a specific log store. You can gather log data by configuring the inputs[].application.selector.matchLabelselement in theClusterLogForwardercustom resource (CR) YAML file. You can also filter the gathered log data by namespace. (LOG-883)
- This release adds the following new - ElasticsearchNodeDiskWatermarkReachedwarnings to the OpenShift Elasticsearch Operator (EO):- Elasticsearch Node Disk Low Watermark Reached
- Elasticsearch Node Disk High Watermark Reached
- Elasticsearch Node Disk Flood Watermark Reached
 - The alert applies the past several warnings when it predicts that an Elasticsearch node will reach the - Disk Low Watermark,- Disk High Watermark, or- Disk Flood Stage Watermarkthresholds in the next 6 hours. This warning period gives you time to respond before the node reaches the disk watermark thresholds. The warning messages also provide links to the troubleshooting steps, which you can follow to help mitigate the issue. The EO applies the past several hours of disk space data to a linear model to generate these warnings. (LOG-1100)
- JSON logs can now be forwarded as JSON objects, rather than quoted strings, to either Red Hat’s managed Elasticsearch cluster or any of the other supported third-party systems. Additionally, you can now query individual fields from a JSON log message inside Kibana which increases the discoverability of specific logs. (LOG-785, LOG-1148)
1.2.1.2. Deprecated and removed features
Some features available in previous releases have been deprecated or removed.
Deprecated functionality is still included in OpenShift Logging and continues to be supported; however, it will be removed in a future release of this product and is not recommended for new deployments.
1.2.1.2.1. Elasticsearch Curator has been removed
With this update, the Elasticsearch Curator has been removed and is no longer supported. Elasticsearch Curator helped you curate or manage your indices on OpenShift Container Platform 4.4 and earlier. Instead of using Elasticsearch Curator, configure the log retention time.
1.2.1.2.2. Forwarding logs using the legacy Fluentd and legacy syslog methods have been deprecated
From OpenShift Container Platform 4.6 to the present, forwarding logs by using the legacy Fluentd and legacy syslog methods have been deprecated and will be removed in a future release. Use the standard non-legacy methods instead.
1.2.1.3. Bug fixes
- 
								Before this update, the ClusterLogForwarderCR did not show theinput[].selectorelement after it had been created. With this update, when you specify aselectorin theClusterLogForwarderCR, it remains. Fixing this bug was necessary for LOG-883, which enables using pod label selectors to forward application log data. (LOG-1338)
- Before this update, an update in the cluster service version (CSV) accidentally introduced resources and limits for the OpenShift Elasticsearch Operator container. Under specific conditions, this caused an out-of-memory condition that terminated the Elasticsearch Operator pod. This update fixes the issue by removing the CSV resources and limits for the Operator container. The Operator gets scheduled without issues. (LOG-1254)
- Before this update, forwarding logs to Kafka using chained certificates failed with the following error message: - state=error: certificate verify failed (unable to get local issuer certificate)- Logs could not be forwarded to a Kafka broker with a certificate signed by an intermediate CA. This happened because the Fluentd Kafka plug-in could only handle a single CA certificate supplied in the - ca-bundle.crtentry of the corresponding secret. This update fixes the issue by enabling the Fluentd Kafka plug-in to handle multiple CA certificates supplied in the- ca-bundle.crtentry of the corresponding secret. Now, logs can be forwarded to a Kafka broker with a certificate signed by an intermediate CA. (LOG-1218, LOG-1216)
- Before this update, while under load, Elasticsearch responded to some requests with an HTTP 500 error, even though there was nothing wrong with the cluster. Retrying the request was successful. This update fixes the issue by updating the index management cron jobs to be more resilient when they encounter temporary HTTP 500 errors. The updated index management cron jobs will first retry a request multiple times before failing. (LOG-1215)
- 
								Before this update, if you did not set the .proxyvalue in the cluster installation configuration, and then configured a global proxy on the installed cluster, a bug prevented Fluentd from forwarding logs to Elasticsearch. To work around this issue, in the proxy or cluster configuration, set theno_proxyvalue to.svc.cluster.localso it skips internal traffic. This update fixes the proxy configuration issue. If you configure the global proxy after installing an OpenShift Container Platform cluster, Fluentd forwards logs to Elasticsearch. (LOG-1187, BZ#1915448)
- Before this update, the logging collector created more socket connections than necessary. With this update, the logging collector reuses the existing socket connection to send logs. (LOG-1186)
- Before this update, if a cluster administrator tried to add or remove storage from an Elasticsearch cluster, the OpenShift Elasticsearch Operator (EO) incorrectly tried to upgrade the Elasticsearch cluster, displaying - scheduledUpgrade: "True",- shardAllocationEnabled: primaries, and change the volumes. With this update, the EO does not try to upgrade the Elasticsearch cluster.- The EO status displays the following new status information to indicate when you have tried to make an unsupported change to the Elasticsearch storage that it has ignored: - 
										StorageStructureChangeIgnoredwhen you try to change between using ephemeral and persistent storage structures.
- 
										StorageClassNameChangeIgnoredwhen you try to change the storage class name.
- 
										StorageSizeChangeIgnoredwhen you try to change the storage size.
 Note- If you configure the - ClusterLoggingcustom resource (CR) to switch from ephemeral to persistent storage, the EO creates a persistent volume claim (PVC) but does not create a persistent volume (PV). To clear the- StorageStructureChangeIgnoredstatus, you must revert the change to the- ClusterLoggingCR and delete the persistent volume claim (PVC).- (LOG-1351) 
- 
										
- Before this update, if you redeployed a full Elasticsearch cluster, it got stuck in an unhealthy state, with one non-data node running and all other data nodes shut down. This issue happened because new certificates prevented the Elasticsearch Operator from scaling down the non-data nodes of the Elasticsearch cluster. With this update, Elasticsearch Operator can scale all the data and non-data nodes down and then back up again, so they load the new certificates. The Elasticsearch Operator can reach the new nodes after they load the new certificates. (LOG-1536)
1.2.2. OpenShift Logging 5.0.9
This release includes RHBA-2021:3705 - Bug Fix Advisory. OpenShift Logging Bug Fix Release (5.0.9).
1.2.2.1. Bug fixes
This release includes the following bug fixes:
- Before this update, some log entries had unrecognized UTF-8 bytes, which caused Elasticsearch to reject messages and block the entire buffered payload. This update resolves the issue: rejected payloads drop the invalid log entries and resubmit the remaining entries. (LOG-1574)
- 
								Before this update, editing the ClusterLoggingcustom resource (CR) did not apply the value oftotalLimitSizeto the Fluentdtotal_limit_sizefield, which limits the size of the buffer plugin instance. As a result, Fluentd applied the default values. With this update, the CR applies the value oftotalLimitSizeto the Fluentdtotal_limit_sizefield. Fluentd uses the value of thetotal_limit_sizefield or the default value, whichever is less. (LOG-1736)
1.2.2.2. CVEs
1.2.3. OpenShift Logging 5.0.8
This release includes RHBA-2021:3526 - Bug Fix Advisory. OpenShift Logging Bug Fix Release (5.0.8).
1.2.3.1. Bug fixes
This release also includes the following bug fixes:
- 
								Due to an issue in the release pipeline scripts, the value of the olm.skipRangefield remained unchanged at5.2.0and was not updated when the z-stream number,0, increased. The current release fixes the pipeline scripts to update the value of this field when the release numbers change. (LOG-1741)
1.2.4. OpenShift Logging 5.0.7
This release includes RHBA-2021:2884 - Bug Fix Advisory. OpenShift Logging Bug Fix Release (5.0.7).
1.2.4.1. Bug fixes
This release also includes the following bug fixes:
- LOG-1594 - Vendored viaq/logerr dependency is missing a license file
1.2.4.2. CVEs
- CVE-2016-10228
- CVE-2017-14502
- CVE-2018-25011
- CVE-2019-2708
- CVE-2019-3842
- CVE-2019-9169
- CVE-2019-13012
- CVE-2019-18276
- CVE-2019-18811
- CVE-2019-19523
- CVE-2019-19528
- CVE-2019-25013
- CVE-2020-0431
- CVE-2020-8231
- CVE-2020-8284
- CVE-2020-8285
- CVE-2020-8286
- CVE-2020-8927
- CVE-2020-9948
- CVE-2020-9951
- CVE-2020-9983
- CVE-2020-10543
- CVE-2020-10878
- CVE-2020-11608
- CVE-2020-12114
- CVE-2020-12362
- CVE-2020-12363
- CVE-2020-12364
- CVE-2020-12464
- CVE-2020-13434
- CVE-2020-13543
- CVE-2020-13584
- CVE-2020-13776
- CVE-2020-14314
- CVE-2020-14344
- CVE-2020-14345
- CVE-2020-14346
- CVE-2020-14347
- CVE-2020-14356
- CVE-2020-14360
- CVE-2020-14361
- CVE-2020-14362
- CVE-2020-14363
- CVE-2020-15358
- CVE-2020-15437
- CVE-2020-24394
- CVE-2020-24977
- CVE-2020-25212
- CVE-2020-25284
- CVE-2020-25285
- CVE-2020-25643
- CVE-2020-25704
- CVE-2020-25712
- CVE-2020-26116
- CVE-2020-26137
- CVE-2020-26541
- CVE-2020-27618
- CVE-2020-27619
- CVE-2020-27786
- CVE-2020-27835
- CVE-2020-28196
- CVE-2020-28974
- CVE-2020-29361
- CVE-2020-29362
- CVE-2020-29363
- CVE-2020-35508
- CVE-2020-36322
- CVE-2020-36328
- CVE-2020-36329
- CVE-2021-0342
- CVE-2021-0605
- CVE-2021-3177
- CVE-2021-3326
- CVE-2021-3501
- CVE-2021-3516
- CVE-2021-3517
- CVE-2021-3518
- CVE-2021-3520
- CVE-2021-3537
- CVE-2021-3541
- CVE-2021-3543
- CVE-2021-20271
- CVE-2021-23336
- CVE-2021-27219
- CVE-2021-33034
1.2.5. OpenShift Logging 5.0.6
This release includes RHBA-2021:2655 - Bug Fix Advisory. OpenShift Logging Bug Fix Release (5.0.6).
1.2.5.1. Bug fixes
This release also includes the following bug fixes:
- LOG-1451 - [1927249] fieldmanager.go:186] [SHOULD NOT HAPPEN] failed to update managedFields…duplicate entries for key [name="POLICY_MAPPING"] (LOG-1451)
- LOG-1537 - Full Cluster Cert Redeploy is broken when the ES clusters includes non-data nodes(LOG-1537)
- LOG-1430 - eventrouter raising "Observed a panic: &runtime.TypeAssertionError" (LOG-1430)
- 
								LOG-1461 - The index management job status is always Completedeven when there has an error in the job log. (LOG-1461)
- LOG-1459 - Operators missing disconnected annotation (LOG-1459)
- LOG-1572 - Bug 1981579: Fix built-in application behavior to collect all of logs (LOG-1572)
1.2.5.2. CVEs
- CVE-2016-10228
- CVE-2017-14502
- CVE-2018-25011
- CVE-2019-2708
- CVE-2019-9169
- CVE-2019-25013
- CVE-2020-8231
- CVE-2020-8284
- CVE-2020-8285
- CVE-2020-8286
- CVE-2020-8927
- CVE-2020-10543
- CVE-2020-10878
- CVE-2020-13434
- CVE-2020-14344
- CVE-2020-14345
- CVE-2020-14346
- CVE-2020-14347
- CVE-2020-14360
- CVE-2020-14361
- CVE-2020-14362
- CVE-2020-14363
- CVE-2020-15358
- CVE-2020-25712
- CVE-2020-26116
- CVE-2020-26137
- CVE-2020-26541
- CVE-2020-27618
- CVE-2020-27619
- CVE-2020-28196
- CVE-2020-29361
- CVE-2020-29362
- CVE-2020-29363
- CVE-2020-36328
- CVE-2020-36329
- CVE-2021-3177
- CVE-2021-3326
- CVE-2021-3516
- CVE-2021-3517
- CVE-2021-3518
- CVE-2021-3520
- CVE-2021-3537
- CVE-2021-3541
- CVE-2021-20271
- CVE-2021-23336
- CVE-2021-27219
- CVE-2021-33034
1.2.6. OpenShift Logging 5.0.5
This release includes RHSA-2021:2374 - Security Advisory. Moderate: Openshift Logging Bug Fix Release (5.0.5).
1.2.6.1. Security fixes
- gogo/protobuf: plugin/unmarshal/unmarshal.go lacks certain index validation. (CVE-2021-3121)
- glib: integer overflow in g_bytes_new function on 64-bit platforms due to an implicit cast from 64 bits to 32 bits(CVE-2021-27219)
The following issues relate to the above CVEs:
- BZ#1921650 gogo/protobuf: plugin/unmarshal/unmarshal.go lacks certain index validation(BZ#1921650)
- LOG-1361 CVE-2021-3121 elasticsearch-operator-container: gogo/protobuf: plugin/unmarshal/unmarshal.go lacks certain index validation [openshift-logging-5](LOG-1361)
- LOG-1362 CVE-2021-3121 elasticsearch-proxy-container: gogo/protobuf: plugin/unmarshal/unmarshal.go lacks certain index validation [openshift-logging-5](LOG-1362)
- LOG-1363 CVE-2021-3121 logging-eventrouter-container: gogo/protobuf: plugin/unmarshal/unmarshal.go lacks certain index validation [openshift-logging-5](LOG-1363)
1.2.7. OpenShift Logging 5.0.4
This release includes RHSA-2021:2136 - Security Advisory. Moderate: Openshift Logging security and bugs update (5.0.4).
1.2.7.1. Security fixes
- gogo/protobuf: plugin/unmarshal/unmarshal.go lacks certain index validation. (CVE-2021-3121)
The following Jira issues contain the above CVEs:
- LOG-1364 CVE-2021-3121 cluster-logging-operator-container: gogo/protobuf: plugin/unmarshal/unmarshal.go lacks certain index validation [openshift-logging-5]. (LOG-1364)
1.2.7.2. Bug fixes
This release also includes the following bug fixes:
- LOG-1328 Port fix to 5.0.z for BZ-1945168. (LOG-1328)
1.2.8. OpenShift Logging 5.0.3
This release includes RHSA-2021:1515 - Security Advisory. Important OpenShift Logging Bug Fix Release (5.0.3).
1.2.8.1. Security fixes
- jackson-databind: arbitrary code execution in slf4j-ext class (CVE-2018-14718)
- jackson-databind: arbitrary code execution in blaze-ds-opt and blaze-ds-core classes (CVE-2018-14719)
- jackson-databind: exfiltration/XXE in some JDK classes (CVE-2018-14720)
- jackson-databind: server-side request forgery (SSRF) in axis2-jaxws class (CVE-2018-14721)
- jackson-databind: improper polymorphic deserialization in axis2-transport-jms class (CVE-2018-19360)
- jackson-databind: improper polymorphic deserialization in openjpa class (CVE-2018-19361)
- jackson-databind: improper polymorphic deserialization in jboss-common-core class (CVE-2018-19362)
- jackson-databind: default typing mishandling leading to remote code execution (CVE-2019-14379)
- jackson-databind: serialization gadgets in com.pastdev.httpcomponents.configuration.JndiConfiguration (CVE-2020-24750)
- jackson-databind: mishandles the interaction between serialization gadgets and typing, related to org.apache.commons.dbcp2.datasources.PerUserPoolDataSource (CVE-2020-35490)
- jackson-databind: mishandles the interaction between serialization gadgets and typing, related to org.apache.commons.dbcp2.datasources.SharedPoolDataSource (CVE-2020-35491)
- jackson-databind: mishandles the interaction between serialization gadgets and typing, related to com.oracle.wls.shaded.org.apache.xalan.lib.sql.JNDIConnectionPool (CVE-2020-35728)
- jackson-databind: mishandles the interaction between serialization gadgets and typing, related to oadd.org.apache.commons.dbcp.cpdsadapter.DriverAdapterCPDS (CVE-2020-36179)
- jackson-databind: mishandles the interaction between serialization gadgets and typing, related to org.apache.commons.dbcp2.cpdsadapter.DriverAdapterCPDS (CVE-2020-36180)
- jackson-databind: mishandles the interaction between serialization gadgets and typing, related to org.apache.tomcat.dbcp.dbcp.cpdsadapter.DriverAdapterCPDS (CVE-2020-36181)
- jackson-databind: mishandles the interaction between serialization gadgets and typing, related to org.apache.tomcat.dbcp.dbcp2.cpdsadapter.DriverAdapterCPDS (CVE-2020-36182)
- jackson-databind: mishandles the interaction between serialization gadgets and typing, related to org.docx4j.org.apache.xalan.lib.sql.JNDIConnectionPool (CVE-2020-36183)
- jackson-databind: mishandles the interaction between serialization gadgets and typing, related to org.apache.tomcat.dbcp.dbcp2.datasources.PerUserPoolDataSource (CVE-2020-36184)
- jackson-databind: mishandles the interaction between serialization gadgets and typing, related to org.apache.tomcat.dbcp.dbcp2.datasources.SharedPoolDataSource (CVE-2020-36185)
- jackson-databind: mishandles the interaction between serialization gadgets and typing, related to org.apache.tomcat.dbcp.dbcp.datasources.PerUserPoolDataSource (CVE-2020-36186)
- jackson-databind: mishandles the interaction between serialization gadgets and typing, related to org.apache.tomcat.dbcp.dbcp.datasources.SharedPoolDataSource (CVE-2020-36187)
- jackson-databind: mishandles the interaction between serialization gadgets and typing, related to com.newrelic.agent.deps.ch.qos.logback.core.db.JNDIConnectionSource (CVE-2020-36188)
- jackson-databind: mishandles the interaction between serialization gadgets and typing, related to com.newrelic.agent.deps.ch.qos.logback.core.db.DriverManagerConnectionSource (CVE-2020-36189)
- jackson-databind: mishandles the interaction between serialization gadgets and typing, related to javax.swing (CVE-2021-20190)
- golang: data race in certain net/http servers including ReverseProxy can lead to DoS (CVE-2020-15586)
- golang: ReadUvarint and ReadVarint can read an unlimited number of bytes from invalid inputs (CVE-2020-16845)
- OpenJDK: Incomplete enforcement of JAR signing disabled algorithms (Libraries, 8249906) (CVE-2021-2163)
The following Jira issues contain the above CVEs:
- LOG-1234 CVE-2020-15586 CVE-2020-16845 openshift-eventrouter: various flaws [openshift-4]. (LOG-1234)
- LOG-1243 CVE-2018-14718 CVE-2018-14719 CVE-2018-14720 CVE-2018-14721 CVE-2018-19360 CVE-2018-19361 CVE-2018-19362 CVE-2019-14379 CVE-2020-35490 CVE-2020-35491 CVE-2020-35728… logging-elasticsearch6-container: various flaws [openshift-logging-5.0]. (LOG-1243)
1.2.8.2. Bug fixes
This release also includes the following bug fixes:
- LOG-1224 Release 5.0 - ClusterLogForwarder namespace-specific log forwarding does not work as expected. (LOG-1224)
- LOG-1232 5.0 - Bug 1859004 - Sometimes the eventrouter couldn’t gather event logs. (LOG-1232)
- LOG-1299 Release 5.0 - Forwarding logs to Kafka using Chained certificates fails with error "state=error: certificate verify failed (unable to get local issuer certificate)". (LOG-1299)
1.2.9. OpenShift Logging 5.0.2
This release includes RHBA-2021:1167 - Bug Fix Advisory. OpenShift Logging Bug Fix Release (5.0.2).
1.2.9.1. Bug fixes
- 
								If you did not set .proxyin the cluster installation configuration, and then configured a global proxy on the installed cluster, a bug prevented Fluentd from forwarding logs to Elasticsearch. To work around this issue, in the proxy/cluster configuration, setno_proxyto.svc.cluster.localso it skips internal traffic. The current release fixes the proxy configuration issue. Now, if you configure the global proxy after installing an OpenShift cluster, Fluentd forwards logs to Elasticsearch. (LOG-1187)
- Previously, forwarding logs to Kafka using chained certificates failed with error "state=error: certificate verify failed (unable to get local issuer certificate)." Logs could not be forwarded to a Kafka broker with a certificate signed by an intermediate CA. This happened because fluentd Kafka plugin could only handle a single CA certificate supplied in the ca-bundle.crt entry of the corresponding secret. The current release fixes this issue by enabling the fluentd Kafka plugin to handle multiple CA certificates supplied in the ca-bundle.crt entry of the corresponding secret. Now, logs can be forwarded to a Kafka broker with a certificate signed by an intermediate CA. (LOG-1216, LOG-1218)
- Previously, an update in the cluster service version (CSV) accidentally introduced resources and limits for the OpenShift Elasticsearch operator container. Under specific conditions, this caused an out-of-memory condition that terminated the Elasticsearch operator pod. The current release fixes this issue by removing the CSV resources and limits for the operator container. Now, the operator gets scheduled without issues. (LOG-1254)
1.2.10. OpenShift Logging 5.0.1
This release includes RHBA-2021:0963 - Bug Fix Advisory. OpenShift Logging Bug Fix Release (5.0.1).
1.2.10.1. Bug fixes
- 
								Previously, if you enabled legacy log forwarding, logs were not sent to managed storage. This issue occurred because the generated log forwarding configuration improperly chose between either log forwarding or legacy log forwarding. The current release fixes this issue. If the ClusterLoggingCR defines alogstore, logs are sent to managed storage. Additionally, if legacy log forwarding is enabled, logs are sent to legacy log forwarding regardless of whether managed storage is enabled. (LOG-1172)
- Previously, while under load, Elasticsearch responded to some requests with an HTTP 500 error, even though there was nothing wrong with the cluster. Retrying the request was successful. This release fixes the issue by updating the cron jobs to be more resilient when encountering temporary HTTP 500 errors. Now, they will retry a request multiple times first before failing. (LOG-1215)
1.2.11. OpenShift Logging 5.0.0
This release includes RHBA-2021:0652 - Bug Fix Advisory. Errata Advisory for Openshift Logging 5.0.0.
1.2.11.1. New features and enhancements
This release adds improvements related to the following concepts.
Cluster Logging becomes Red Hat OpenShift Logging
With this release, Cluster Logging becomes Red Hat OpenShift Logging 5.0.
Maximum five primary shards per index
With this release, the OpenShift Elasticsearch Operator (EO) sets the number of primary shards for an index between one and five, depending on the number of data nodes defined for a cluster.
Previously, the EO set the number of shards for an index to the number of data nodes. When an index in Elasticsearch was configured with a number of replicas, it created that many replicas for each primary shard, not per index. Therefore, as the index sharded, a greater number of replica shards existed in the cluster, which created a lot of overhead for the cluster to replicate and keep in sync.
Updated OpenShift Elasticsearch Operator name and maturity level
This release updates the display name of the OpenShift Elasticsearch Operator and operator maturity level. The new display name and clarified specific use for the OpenShift Elasticsearch Operator are updated in Operator Hub.
OpenShift Elasticsearch Operator reports on CSV success
						This release adds reporting metrics to indicate that installing or upgrading the ClusterServiceVersion (CSV) object for the OpenShift Elasticsearch Operator was successful. Previously, there was no way to determine, or generate an alert, if the installing or upgrading the CSV failed. Now, an alert is provided as part of the OpenShift Elasticsearch Operator.
					
Reduce Elasticsearch pod certificate permission warnings
Previously, when the Elasticsearch pod started, it generated certificate permission warnings, which misled some users to troubleshoot their clusters. The current release fixes these permissions issues to reduce these types of notifications.
New links from alerts to explanations and troubleshooting
This release adds a link from the alerts that an Elasticsearch cluster generates to a page of explanations and troubleshooting steps for that alert.
New connection timeout for deletion jobs
The current release adds a connection timeout for deletion jobs, which helps prevent pods from occasionally hanging when they query Elasticsearch to delete indices. Now, if the underlying 'curl' call does not connect before the timeout period elapses, the timeout terminates the call.
Minimize updates to rollover index templates
With this enhancement, the OpenShift Elasticsearch Operator only updates its rollover index templates if they have different field values. Index templates have a higher priority than indices. When the template is updated, the cluster prioritizes distributing them over the index shards, impacting performance. To minimize Elasticsearch cluster operations, the operator only updates the templates when the number of primary shards or replica shards changes from what is currently configured.
1.2.11.2. Technology Preview features
Some features in this release are currently in Technology Preview. These experimental features are not intended for production use. Note the following scope of support on the Red Hat Customer Portal for these features:
Technology Preview Features Support Scope
In the table below, features are marked with the following statuses:
- TP: Technology Preview
- GA: General Availability
- -: Not Available
| Feature | OCP 4.5 | OCP 4.6 | Logging 5.0 | 
|---|---|---|---|
| Log forwarding | TP | GA | GA | 
1.2.11.3. Deprecated and removed features
Some features available in previous releases have been deprecated or removed.
Deprecated functionality is still included in OpenShift Logging and continues to be supported; however, it will be removed in a future release of this product and is not recommended for new deployments.
1.2.11.3.1. Elasticsearch Curator has been deprecated
The Elasticsearch Curator has been deprecated and will be removed in a future release. Elasticsearch Curator helped you curate or manage your indices on OpenShift Container Platform 4.4 and earlier. Instead of using Elasticsearch Curator, configure the log retention time.
1.2.11.3.2. Forwarding logs using the legacy Fluentd and legacy syslog methods have been deprecated
From OpenShift Container Platform 4.6 to the present, forwarding logs by using the legacy Fluentd and legacy syslog methods have been deprecated and will be removed in a future release. Use the standard non-legacy methods instead.
1.2.11.4. Bug fixes
- Previously, Elasticsearch rejected HTTP requests whose headers exceeded the default max header size, 8 KB. Now, the max header size is 128 KB, and Elasticsearch no longer rejects HTTP requests for exceeding the max header size. (BZ#1845293)
- 
								Previously, nodes did not recover from Pendingstatus because a software bug did not correctly update their statuses in the Elasticsearch custom resource (CR). The current release fixes this issue, so the nodes can recover when their status isPending.(BZ#1887357)
- 
								Previously, when the Cluster Logging Operator (CLO) scaled down the number of Elasticsearch nodes in the clusterloggingCR to three nodes, it omitted previously-created nodes that had unique IDs. The OpenShift Elasticsearch Operator rejected the update because it has safeguards that prevent nodes with unique IDs from being removed. Now, when the CLO scales down the number of nodes and updates the Elasticsearch CR, it marks nodes with unique IDs as count 0 instead of omitting them. As a result, users can scale down their cluster to 3 nodes by using theclusterloggingCR. (BZ#1879150)
In OpenShift Logging 5.0 and later, the Cluster Logging Operator is called Red Hat OpenShift Logging Operator.
- 
								Previously, the Fluentd collector pod went into a crash loop when the ClusterLogForwarderhad an incorrectly-configured secret. The current release fixes this issue. Now, theClusterLogForwardervalidates the secrets and reports any errors in its status field. As a result, it does not cause the Fluentd collector pod to crash. (BZ#1888943)
- 
								Previously, if you updated the Kibana resource configuration in the clusterlogginginstance toresource{}, the resulting nil map caused a panic and changed the status of the OpenShift Elasticsearch Operator toCrashLoopBackOff. The current release fixes this issue by initializing the map. (BZ#1889573)
- Previously, the fluentd collector pod went into a crash loop when the ClusterLogForwarder had multiple outputs using the same secret. The current release fixes this issue. Now, multiple outputs can share a secret. (BZ#1890072)
- Previously, if you deleted a Kibana route, the Cluster Logging Operator (CLO) could not recover or recreate it. Now, the CLO watches the route, and if you delete the route, the OpenShift Elasticsearch Operator can reconcile or recreate it. (BZ#1890825)
- Previously, the Cluster Logging Operator (CLO) would attempt to reconcile the Elasticsearch resource, which depended upon the Red Hat-provided Elastic Custom Resource Definition (CRD). Attempts to list an unknown kind caused the CLO to exit its reconciliation loop. This happened because the CLO tried to reconcile all of its managed resources whether they were defined or not. The current release fixes this issue. The CLO only reconciles types provided by the OpenShift Elasticsearch Operator if a user defines managed storage. As a result, users can create collector-only deployments of cluster logging by deploying the CLO. (BZ#1891738)
- Previously, because of an LF GA syslog implementation for RFC 3164, logs sent to remote syslog were not compatible with the legacy behavior. The current release fixes this issue. AddLogSource adds details about log’s source details to the "message" field. Now, logs sent to remote syslog are compatible with the legacy behavior. (BZ#1891886)
- Previously, the Elasticsearch rollover pods failed with a - resource_already_exists_exceptionerror. Within the Elasticsearch rollover API, when the next index was created, the- *-writealias was not updated to point to it. As a result, the next time the rollover API endpoint was triggered for that particular index, it received an error that the resource already existed.- The current release fixes this issue. Now, when a rollover occurs in the - indexmanagementcronjobs, if a new index was created, it verifies that the alias points to the new index. This behavior prevents the error. If the cluster is already receiving this error, a cronjob fixes the issue so that subsequent runs work as expected. Now, performing rollovers no longer produces the exception. (BZ#1893992)
- Previously, Fluent stopped sending logs even though the logging stack seemed functional. Logs were not shipped to an endpoint for an extended period even when an endpoint came back up. This happened if the max backoff time was too long and the endpoint was down. The current release fixes this issue by lowering the max backoff time, so the logs are shipped sooner. (BZ#1894634)
- 
								Previously, omitting the Storage size of the Elasticsearch node caused panic in the OpenShift Elasticsearch Operator code. This panic appeared in the logs as: Observed a panic: "invalid memory address or nil pointer dereference"The panic happened because although Storage size is a required field, the software didn’t check for it. The current release fixes this issue, so there is no panic if the storage size is omitted. Instead, the storage defaults to ephemeral storage and generates a log message for the user. (BZ#1899589)
- 
								Previously, elasticsearch-rolloverandelasticsearch-deletepods remained in theInvalid JSON:orValueError: No JSON object could be decodederror states. This exception was raised because there was no exception handler for invalid JSON input. The current release fixes this issue by providing a handler for invalid JSON input. As a result, the handler outputs an error message instead of an exception traceback, and theelasticsearch-rolloverandelasticsearch-deletejobs do not remain those error states. (BZ#1899905)
- 
								Previously, when deploying Fluentd as a stand-alone, a Kibana pod was created even if the value of replicaswas0. This happened because Kibana defaulted to1pod even when there were no Elasticsearch nodes. The current release fixes this. Now, a Kibana only defaults to1when there are one or more Elasticsearch nodes. (BZ#1901424)
- Previously, if you deleted the secret, it was not recreated. Even though the certificates were on a disk local to the operator, they weren’t rewritten because they hadn’t changed. That is, certificates were only written if they changed. The current release fixes this issue. It rewrites the secret if the certificate changes or is not found. Now, if you delete the master-certs, they are replaced. (BZ#1901869)
- Previously, if a cluster had multiple custom resources with the same name, the resource would get selected alphabetically when not fully qualified with the API group. As a result, if you installed both Red Hat’s OpenShift Elasticsearch Operator alongside the OpenShift Elasticsearch Operator, you would see failures when collected data via a must-gather report. The current release fixes this issue by ensuring must-gathers now use the full API group when gathering information about the cluster’s custom resources. (BZ#1897731)
- An earlier bug fix to address issues related to certificate generation introduced an error. Trying to read the certificates caused them to be regenerated because they were recognized as missing. This, in turn, triggered the OpenShift Elasticsearch Operator to perform a rolling upgrade on the Elasticsearch cluster and, potentially, to have mismatched certificates. This bug was caused by the operator incorrectly writing certificates to the working directory. The current release fixes this issue. Now the operator consistently reads and writes certificates to the same working directory, and the certificates are only regenerated if needed. (BZ#1905910)
- 
								Previously, queries to the root endpoint to retrieve the Elasticsearch version received a 403 response. The 403 response broke any services that used this endpoint in prior releases. This error happened because non-administrative users did not have the monitorpermission required to query the root endpoint and retrieve the Elasticsearch version. Now, non-administrative users can query the root endpoint for the deployed version of Elasticsearch. (BZ#1906765)
- 
								Previously, in some bulk insertion situations, the Elasticsearch proxy timed out connections between fluentd and Elasticsearch. As a result, fluentd failed to deliver messages and logged a Server returned nothing (no headers, no data)error. The current release fixes this issue: It increases the default HTTP read and write timeouts in the Elasticsearch proxy from five seconds to one minute. It also provides command-line options in the Elasticsearch proxy to control HTTP timeouts in the field. (BZ#1908707)
- Previously, in some cases, the {ProductName}/Elasticsearch dashboard was missing from the OpenShift Container Platform monitoring dashboard because the dashboard configuration resource referred to a different namespace owner and caused the OpenShift Container Platform to garbage-collect that resource. Now, the ownership reference is removed from the OpenShift Elasticsearch Operator reconciler configuration, and the logging dashboard appears in the console. (BZ#1910259)
- Previously, the code that uses environment variables to replace values in the Kibana configuration file did not consider commented lines. This prevented users from overriding the default value of server.maxPayloadBytes. The current release fixes this issue by uncommenting the default value of server.maxPayloadByteswithin. Now, users can override the value by using environment variables, as documented. (BZ#1918876)
- Previously, the Kibana log level was increased not to suppress instructions to delete indices that failed to migrate, which also caused the display of GET requests at the INFO level that contained the Kibana user’s email address and OAuth token. The current release fixes this issue by masking these fields, so the Kibana logs do not display them. (BZ#1925081)
1.2.11.5. Known issues
- Fluentd pods with the - ruby-kafka-1.1.0and- fluent-plugin-kafka-0.13.1gems are not compatible with Apache Kafka version 0.10.1.0.- As a result, log forwarding to Kafka fails with a message: - error_class=Kafka::DeliveryFailed error="Failed to send messages to flux-openshift-v4/1"- The - ruby-kafka-0.7gem dropped support for Kafka 0.10 in favor of native support for Kafka 0.11. The- ruby-kafka-1.0.0gem added support for Kafka 2.3 and 2.4. The current version of OpenShift Logging tests and therefore supports Kafka version 2.4.1.- To work around this issue, upgrade to a supported version of Apache Kafka. 
Chapter 2. Understanding Red Hat OpenShift Logging
As a cluster administrator, you can deploy OpenShift Logging to aggregate all the logs from your OpenShift Container Platform cluster, such as node system audit logs, application container logs, and infrastructure logs. OpenShift Logging aggregates these logs from throughout your cluster and stores them in a default log store. You can use the Kibana web console to visualize log data.
OpenShift Logging aggregates the following types of logs:
- 
					application- Container logs generated by user applications running in the cluster, except infrastructure container applications.
- 
					infrastructure- Logs generated by infrastructure components running in the cluster and OpenShift Container Platform nodes, such as journal logs. Infrastructure components are pods that run in theopenshift*,kube*, ordefaultprojects.
- 
					audit- Logs generated by the node audit system (auditd), which are stored in the /var/log/audit/audit.log file, and the audit logs from the Kubernetes apiserver and the OpenShift apiserver.
Because the internal OpenShift Container Platform Elasticsearch log store does not provide secure storage for audit logs, audit logs are not stored in the internal Elasticsearch instance by default. If you want to send the audit logs to the internal log store, for example to view the audit logs in Kibana, you must use the Log Forwarding API as described in Forward audit logs to the log store.
2.1. About deploying OpenShift Logging
				OpenShift Container Platform cluster administrators can deploy OpenShift Logging using the OpenShift Container Platform web console or CLI to install the OpenShift Elasticsearch Operator and Red Hat OpenShift Logging Operator. When the operators are installed, you create a ClusterLogging custom resource (CR) to schedule OpenShift Logging pods and other resources necessary to support OpenShift Logging. The operators are responsible for deploying, upgrading, and maintaining OpenShift Logging.
			
				The ClusterLogging CR defines a complete OpenShift Logging environment that includes all the components of the logging stack to collect, store and visualize logs. The Red Hat OpenShift Logging Operator watches the OpenShift Logging CR and adjusts the logging deployment accordingly.
			
Administrators and application developers can view the logs of the projects for which they have view access.
For information, see Configuring the log collector.
2.1.1. About JSON OpenShift Container Platform Logging
You can use JSON logging to configure the Log Forwarding API to parse JSON strings into a structured object. You can perform the following tasks:
- Parse JSON logs
- Configure JSON log data for Elasticsearch
- Forward JSON logs to the Elasticsearch log store
For information, see About JSON Logging.
2.1.2. About collecting and storing Kubernetes events
The OpenShift Container Platform Event Router is a pod that watches Kubernetes events and logs them for collection by OpenShift Container Platform Logging. You must manually deploy the Event Router.
For information, see About collecting and storing Kubernetes events.
2.1.3. About updating OpenShift Container Platform Logging
OpenShift Container Platform allows you to update OpenShift Container Platform logging. You must update the following operators while updating OpenShift Container Platform Logging:
- Elasticsearch Operator
- Cluster Logging Operator
For information, see About updating OpenShift Container Platform Logging.
2.1.4. About viewing the cluster dashboard
The OpenShift Container Platform Logging dashboard contains charts that show details about your Elasticsearch instance at the cluster level. These charts help you diagnose and anticipate problems.
For information, see About viewing the cluster dashboard.
2.1.5. About troubleshooting OpenShift Container Platform Logging
You can troubleshoot the logging issues by performing the following tasks:
- Viewing logging status
- Viewing the status of the log store
- Understanding logging alerts
- Collecting logging data for Red Hat Support
- Troubleshooting for critical alerts
2.1.6. About uninstalling OpenShift Container Platform Logging
You can stop log aggregation by deleting the ClusterLogging custom resource (CR). After deleting the CR, there are other cluster logging components that remain, which you can optionally remove.
For information, see About uninstalling OpenShift Container Platform Logging.
2.1.7. About exporting fields
The logging system exports fields. Exported fields are present in the log records and are available for searching from Elasticsearch and Kibana.
For information, see About exporting fields.
2.1.8. About OpenShift Logging components
The OpenShift Logging components include a collector deployed to each node in the OpenShift Container Platform cluster that collects all node and container logs and writes them to a log store. You can use a centralized web UI to create rich visualizations and dashboards with the aggregated data.
The major components of OpenShift Logging are:
- collection - This is the component that collects logs from the cluster, formats them, and forwards them to the log store. The current implementation is Fluentd.
- log store - This is where the logs are stored. The default implementation is Elasticsearch. You can use the default Elasticsearch log store or forward logs to external log stores. The default log store is optimized and tested for short-term storage.
- visualization - This is the UI component you can use to view logs, graphs, charts, and so forth. The current implementation is Kibana.
This document might refer to log store or Elasticsearch, visualization or Kibana, collection or Fluentd, interchangeably, except where noted.
2.1.9. About the logging collector
OpenShift Container Platform uses Fluentd to collect container and node logs.
By default, the log collector uses the following sources:
- journald for all system logs
- 
							/var/log/containers/*.logfor all container logs
					If you configure the log collector to collect audit logs, it gets them from /var/log/audit/audit.log.
				
The logging collector is a daemon set that deploys pods to each OpenShift Container Platform node. System and infrastructure logs are generated by journald log messages from the operating system, the container runtime, and OpenShift Container Platform. Application logs are generated by the CRI-O container engine. Fluentd collects the logs from these sources and forwards them internally or externally as you configure in OpenShift Container Platform.
The container runtimes provide minimal information to identify the source of log messages: project, pod name, and container ID. This information is not sufficient to uniquely identify the source of the logs. If a pod with a given name and project is deleted before the log collector begins processing its logs, information from the API server, such as labels and annotations, might not be available. There might not be a way to distinguish the log messages from a similarly named pod and project or trace the logs to their source. This limitation means that log collection and normalization are considered best effort.
The available container runtimes provide minimal information to identify the source of log messages and do not guarantee unique individual log messages or that these messages can be traced to their source.
For information, see Configuring the log collector.
2.1.10. About the log store
By default, OpenShift Container Platform uses Elasticsearch (ES) to store log data. Optionally, you can use the log forwarding features to forward logs to external log stores using Fluentd protocols, syslog protocols, or the OpenShift Container Platform Log Forwarding API.
The OpenShift Logging Elasticsearch instance is optimized and tested for short term storage, approximately seven days. If you want to retain your logs over a longer term, it is recommended you move the data to a third-party storage system.
					Elasticsearch organizes the log data from Fluentd into datastores, or indices, then subdivides each index into multiple pieces called shards, which it spreads across a set of Elasticsearch nodes in an Elasticsearch cluster. You can configure Elasticsearch to make copies of the shards, called replicas, which Elasticsearch also spreads across the Elasticsearch nodes. The ClusterLogging custom resource (CR) allows you to specify how the shards are replicated to provide data redundancy and resilience to failure. You can also specify how long the different types of logs are retained using a retention policy in the ClusterLogging CR.
				
The number of primary shards for the index templates is equal to the number of Elasticsearch data nodes.
					The Red Hat OpenShift Logging Operator and companion OpenShift Elasticsearch Operator ensure that each Elasticsearch node is deployed using a unique deployment that includes its own storage volume. You can use a ClusterLogging custom resource (CR) to increase the number of Elasticsearch nodes, as needed. Refer to the Elasticsearch documentation for considerations involved in configuring storage.
				
A highly-available Elasticsearch environment requires at least three Elasticsearch nodes, each on a different host.
Role-based access control (RBAC) applied on the Elasticsearch indices enables the controlled access of the logs to the developers. Administrators can access all logs and developers can access only the logs in their projects.
For information, see Configuring the log store.
2.1.11. About logging visualization
OpenShift Container Platform uses Kibana to display the log data collected by Fluentd and indexed by Elasticsearch.
Kibana is a browser-based console interface to query, discover, and visualize your Elasticsearch data through histograms, line graphs, pie charts, and other visualizations.
For information, see Configuring the log visualizer.
2.1.12. About event routing
					The Event Router is a pod that watches OpenShift Container Platform events so they can be collected by OpenShift Logging. The Event Router collects events from all projects and writes them to STDOUT. Fluentd collects those events and forwards them into the OpenShift Container Platform Elasticsearch instance. Elasticsearch indexes the events to the infra index.
				
You must manually deploy the Event Router.
For information, see Collecting and storing Kubernetes events.
2.1.13. About log forwarding
					By default, OpenShift Logging sends logs to the default internal Elasticsearch log store, defined in the ClusterLogging custom resource (CR). If you want to forward logs to other log aggregators, you can use the log forwarding features to send logs to specific endpoints within or outside your cluster.
				
For information, see Forwarding logs to third-party systems.
Chapter 3. Installing OpenShift Logging
You can install OpenShift Logging by deploying the OpenShift Elasticsearch and Red Hat OpenShift Logging Operators. The OpenShift Elasticsearch Operator creates and manages the Elasticsearch cluster used by OpenShift Logging. The Red Hat OpenShift Logging Operator creates and manages the components of the logging stack.
The process for deploying OpenShift Logging to OpenShift Container Platform involves:
- Reviewing the OpenShift Logging storage considerations.
- Installing the OpenShift Elasticsearch Operator and Red Hat OpenShift Logging Operator using the OpenShift Container Platform web console or CLI.
3.1. Installing OpenShift Logging using the web console
You can use the OpenShift Container Platform web console to install the OpenShift Elasticsearch and Red Hat OpenShift Logging Operators.
					If you do not want to use the default Elasticsearch log store, you can remove the internal Elasticsearch logStore and Kibana visualization components from the ClusterLogging custom resource (CR). Removing these components is optional but saves resources. For more information, see Removing unused components if you do not use the default Elasticsearch log store.
				
Prerequisites
- Ensure that you have the necessary persistent storage for Elasticsearch. Note that each Elasticsearch node requires its own storage volume. Note- If you use a local volume for persistent storage, do not use a raw block volume, which is described with - volumeMode: blockin the- LocalVolumeobject. Elasticsearch cannot use raw block volumes.- Elasticsearch is a memory-intensive application. By default, OpenShift Container Platform installs three Elasticsearch nodes with memory requests and limits of 16 GB. This initial set of three OpenShift Container Platform nodes might not have enough memory to run Elasticsearch within your cluster. If you experience memory issues that are related to Elasticsearch, add more Elasticsearch nodes to your cluster rather than increasing the memory on existing nodes. 
Procedure
To install the OpenShift Elasticsearch Operator and Red Hat OpenShift Logging Operator using the OpenShift Container Platform web console:
- Install the OpenShift Elasticsearch Operator: - In the OpenShift Container Platform web console, click Operators → OperatorHub.
- Choose OpenShift Elasticsearch Operator from the list of available Operators, and click Install.
- Ensure that the All namespaces on the cluster is selected under Installation Mode.
- Ensure that openshift-operators-redhat is selected under Installed Namespace. - You must specify the - openshift-operators-redhatnamespace. The- openshift-operatorsnamespace might contain Community Operators, which are untrusted and could publish a metric with the same name as an OpenShift Container Platform metric, which would cause conflicts.
- Select Enable operator recommended cluster monitoring on this namespace. - This option sets the - openshift.io/cluster-monitoring: "true"label in the Namespace object. You must select this option to ensure that cluster monitoring scrapes the- openshift-operators-redhatnamespace.
- Select stable-5.x as the Update Channel.
- Select an Approval Strategy. - The Automatic strategy allows Operator Lifecycle Manager (OLM) to automatically update the Operator when a new version is available.
- The Manual strategy requires a user with appropriate credentials to approve the Operator update.
 
- Click Install.
- Verify that the OpenShift Elasticsearch Operator installed by switching to the Operators → Installed Operators page.
- Ensure that OpenShift Elasticsearch Operator is listed in all projects with a Status of Succeeded.
 
- Install the Red Hat OpenShift Logging Operator: - In the OpenShift Container Platform web console, click Operators → OperatorHub.
- Choose Red Hat OpenShift Logging from the list of available Operators, and click Install.
- Ensure that the A specific namespace on the cluster is selected under Installation Mode.
- Ensure that Operator recommended namespace is openshift-logging under Installed Namespace.
- Select Enable operator recommended cluster monitoring on this namespace. - This option sets the - openshift.io/cluster-monitoring: "true"label in the Namespace object. You must select this option to ensure that cluster monitoring scrapes the- openshift-loggingnamespace.
- Select stable-5.x as the Update Channel.
- Select an Approval Strategy. - The Automatic strategy allows Operator Lifecycle Manager (OLM) to automatically update the Operator when a new version is available.
- The Manual strategy requires a user with appropriate credentials to approve the Operator update.
 
- Click Install.
- Verify that the Red Hat OpenShift Logging Operator installed by switching to the Operators → Installed Operators page.
- Ensure that Red Hat OpenShift Logging is listed in the openshift-logging project with a Status of Succeeded. - If the Operator does not appear as installed, to troubleshoot further: - Switch to the Operators → Installed Operators page and inspect the Status column for any errors or failures.
- 
										Switch to the Workloads → Pods page and check the logs in any pods in the openshift-loggingproject that are reporting issues.
 
 
- Create an OpenShift Logging instance: - Switch to the Administration → Custom Resource Definitions page.
- On the Custom Resource Definitions page, click ClusterLogging.
- On the Custom Resource Definition details page, select View Instances from the Actions menu.
- On the ClusterLoggings page, click Create ClusterLogging. - You might have to refresh the page to load the data. 
- In the YAML field, replace the code with the following: Note- This default OpenShift Logging configuration should support a wide array of environments. Review the topics on tuning and configuring OpenShift Logging components for information on modifications you can make to your OpenShift Logging cluster. - Copy to Clipboard Copied! - Toggle word wrap Toggle overflow - 1
- The name must beinstance.
- 2
- The OpenShift Logging management state. In some cases, if you change the OpenShift Logging defaults, you must set this toUnmanaged. However, an unmanaged deployment does not receive updates until OpenShift Logging is placed back into a managed state.
- 3
- Settings for configuring Elasticsearch. Using the CR, you can configure shard replication policy and persistent storage.
- 4
- Specify the length of time that Elasticsearch should retain each log source. Enter an integer and a time designation: weeks(w), hours(h/H), minutes(m) and seconds(s). For example,7dfor seven days. Logs older than themaxAgeare deleted. You must specify a retention policy for each log source or the Elasticsearch indices will not be created for that source.
- 5
- Specify the number of Elasticsearch nodes. See the note that follows this list.
- 6
- Enter the name of an existing storage class for Elasticsearch storage. For best performance, specify a storage class that allocates block storage. If you do not specify a storage class, OpenShift Logging uses ephemeral storage.
- 7
- Specify the CPU and memory requests for Elasticsearch as needed. If you leave these values blank, the OpenShift Elasticsearch Operator sets default values that should be sufficient for most deployments. The default values are16Gifor the memory request and1for the CPU request.
- 8
- Specify the CPU and memory requests for the Elasticsearch proxy as needed. If you leave these values blank, the OpenShift Elasticsearch Operator sets default values that should be sufficient for most deployments. The default values are256Mifor the memory request and100mfor the CPU request.
- 9
- Settings for configuring Kibana. Using the CR, you can scale Kibana for redundancy and configure the CPU and memory for your Kibana nodes. For more information, see Configuring the log visualizer.
- 10
- Settings for configuring Fluentd. Using the CR, you can configure Fluentd CPU and memory limits. For more information, see Configuring Fluentd.
 Note- The maximum number of Elasticsearch control plane nodes (also known as the master nodes) is three. If you specify a - nodeCountgreater than- 3, OpenShift Container Platform creates three Elasticsearch nodes that are Master-eligible nodes, with the master, client, and data roles. The additional Elasticsearch nodes are created as Data-only nodes, using client and data roles. Control plane nodes perform cluster-wide actions such as creating or deleting an index, shard allocation, and tracking nodes. Data nodes hold the shards and perform data-related operations such as CRUD, search, and aggregations. Data-related operations are I/O-, memory-, and CPU-intensive. It is important to monitor these resources and to add more Data nodes if the current nodes are overloaded.- For example, if - nodeCount=4, the following nodes are created:- oc get deployment - $ oc get deployment- Copy to Clipboard Copied! - Toggle word wrap Toggle overflow - Example output - cluster-logging-operator 1/1 1 1 18h elasticsearch-cd-x6kdekli-1 0/1 1 0 6m54s elasticsearch-cdm-x6kdekli-1 1/1 1 1 18h elasticsearch-cdm-x6kdekli-2 0/1 1 0 6m49s elasticsearch-cdm-x6kdekli-3 0/1 1 0 6m44s - cluster-logging-operator 1/1 1 1 18h elasticsearch-cd-x6kdekli-1 0/1 1 0 6m54s elasticsearch-cdm-x6kdekli-1 1/1 1 1 18h elasticsearch-cdm-x6kdekli-2 0/1 1 0 6m49s elasticsearch-cdm-x6kdekli-3 0/1 1 0 6m44s- Copy to Clipboard Copied! - Toggle word wrap Toggle overflow - The number of primary shards for the index templates is equal to the number of Elasticsearch data nodes. 
- 
								Click Create. This creates the OpenShift Logging components, the Elasticsearchcustom resource and components, and the Kibana interface.
 
- Verify the install: - Switch to the Workloads → Pods page.
- Select the openshift-logging project. - You should see several pods for OpenShift Logging, Elasticsearch, Fluentd, and Kibana similar to the following list: - cluster-logging-operator-cb795f8dc-xkckc
- elasticsearch-cdm-b3nqzchd-1-5c6797-67kfz
- elasticsearch-cdm-b3nqzchd-2-6657f4-wtprv
- elasticsearch-cdm-b3nqzchd-3-588c65-clg7g
- fluentd-2c7dg
- fluentd-9z7kk
- fluentd-br7r2
- fluentd-fn2sb
- fluentd-pb2f8
- fluentd-zqgqx
- kibana-7fb4fd4cc9-bvt4p
 
 
3.2. Post-installation tasks
If you plan to use Kibana, you must manually create your Kibana index patterns and visualizations to explore and visualize data in Kibana.
If your cluster network provider enforces network isolation, allow network traffic between the projects that contain the OpenShift Logging operators.
3.3. Installing OpenShift Logging using the CLI
You can use the OpenShift Container Platform CLI to install the OpenShift Elasticsearch and Red Hat OpenShift Logging Operators.
Prerequisites
- Ensure that you have the necessary persistent storage for Elasticsearch. Note that each Elasticsearch node requires its own storage volume. Note- If you use a local volume for persistent storage, do not use a raw block volume, which is described with - volumeMode: blockin the- LocalVolumeobject. Elasticsearch cannot use raw block volumes.- Elasticsearch is a memory-intensive application. By default, OpenShift Container Platform installs three Elasticsearch nodes with memory requests and limits of 16 GB. This initial set of three OpenShift Container Platform nodes might not have enough memory to run Elasticsearch within your cluster. If you experience memory issues that are related to Elasticsearch, add more Elasticsearch nodes to your cluster rather than increasing the memory on existing nodes. 
Procedure
To install the OpenShift Elasticsearch Operator and Red Hat OpenShift Logging Operator using the CLI:
- Create a namespace for the OpenShift Elasticsearch Operator. - Create a namespace object YAML file (for example, - eo-namespace.yaml) for the OpenShift Elasticsearch Operator:- Copy to Clipboard Copied! - Toggle word wrap Toggle overflow - 1
- You must specify theopenshift-operators-redhatnamespace. To prevent possible conflicts with metrics, you should configure the Prometheus Cluster Monitoring stack to scrape metrics from theopenshift-operators-redhatnamespace and not theopenshift-operatorsnamespace. Theopenshift-operatorsnamespace might contain community Operators, which are untrusted and could publish a metric with the same name as an OpenShift Container Platform metric, which would cause conflicts.
- 2
- String. You must specify this label as shown to ensure that cluster monitoring scrapes theopenshift-operators-redhatnamespace.
 
- Create the namespace: - oc create -f <file-name>.yaml - $ oc create -f <file-name>.yaml- Copy to Clipboard Copied! - Toggle word wrap Toggle overflow - For example: - oc create -f eo-namespace.yaml - $ oc create -f eo-namespace.yaml- Copy to Clipboard Copied! - Toggle word wrap Toggle overflow 
 
- Create a namespace for the Red Hat OpenShift Logging Operator: - Create a namespace object YAML file (for example, - olo-namespace.yaml) for the Red Hat OpenShift Logging Operator:- Copy to Clipboard Copied! - Toggle word wrap Toggle overflow 
- Create the namespace: - oc create -f <file-name>.yaml - $ oc create -f <file-name>.yaml- Copy to Clipboard Copied! - Toggle word wrap Toggle overflow - For example: - oc create -f olo-namespace.yaml - $ oc create -f olo-namespace.yaml- Copy to Clipboard Copied! - Toggle word wrap Toggle overflow 
 
- Install the OpenShift Elasticsearch Operator by creating the following objects: - Create an Operator Group object YAML file (for example, - eo-og.yaml) for the OpenShift Elasticsearch Operator:- Copy to Clipboard Copied! - Toggle word wrap Toggle overflow - 1
- You must specify theopenshift-operators-redhatnamespace.
 
- Create an Operator Group object: - oc create -f <file-name>.yaml - $ oc create -f <file-name>.yaml- Copy to Clipboard Copied! - Toggle word wrap Toggle overflow - For example: - oc create -f eo-og.yaml - $ oc create -f eo-og.yaml- Copy to Clipboard Copied! - Toggle word wrap Toggle overflow 
- Create a Subscription object YAML file (for example, - eo-sub.yaml) to subscribe a namespace to the OpenShift Elasticsearch Operator.- Example Subscription - Copy to Clipboard Copied! - Toggle word wrap Toggle overflow - 1
- You must specify theopenshift-operators-redhatnamespace.
- 2
- Specify5.0,stable, orstable-5.<x>as the channel. See the following note.
- 3
- Specifyredhat-operators. If your OpenShift Container Platform cluster is installed on a restricted network, also known as a disconnected cluster, specify the name of the CatalogSource object created when you configured the Operator Lifecycle Manager (OLM).
 Note- Specifying - stableinstalls the current version of the latest stable release. Using- stablewith- installPlanApproval: "Automatic", will automatically upgrade your operators to the latest stable major and minor release.- Specifying - stable-5.<x>installs the current minor version of a specific major release. Using- stable-5.<x>with- installPlanApproval: "Automatic", will automatically upgrade your operators to the latest stable minor release within the major release you specify with- x.
- Create the Subscription object: - oc create -f <file-name>.yaml - $ oc create -f <file-name>.yaml- Copy to Clipboard Copied! - Toggle word wrap Toggle overflow - For example: - oc create -f eo-sub.yaml - $ oc create -f eo-sub.yaml- Copy to Clipboard Copied! - Toggle word wrap Toggle overflow - The OpenShift Elasticsearch Operator is installed to the - openshift-operators-redhatnamespace and copied to each project in the cluster.
- Verify the Operator installation: - oc get csv --all-namespaces - $ oc get csv --all-namespaces- Copy to Clipboard Copied! - Toggle word wrap Toggle overflow - Example output - Copy to Clipboard Copied! - Toggle word wrap Toggle overflow - There should be an OpenShift Elasticsearch Operator in each namespace. The version number might be different than shown. 
 
- Install the Red Hat OpenShift Logging Operator by creating the following objects: - Create an Operator Group object YAML file (for example, - olo-og.yaml) for the Red Hat OpenShift Logging Operator:- Copy to Clipboard Copied! - Toggle word wrap Toggle overflow 
- Create the OperatorGroup object: - oc create -f <file-name>.yaml - $ oc create -f <file-name>.yaml- Copy to Clipboard Copied! - Toggle word wrap Toggle overflow - For example: - oc create -f olo-og.yaml - $ oc create -f olo-og.yaml- Copy to Clipboard Copied! - Toggle word wrap Toggle overflow 
- Create a Subscription object YAML file (for example, - olo-sub.yaml) to subscribe a namespace to the Red Hat OpenShift Logging Operator.- Copy to Clipboard Copied! - Toggle word wrap Toggle overflow - 1
- You must specify theopenshift-loggingnamespace.
- 2
- Specify5.0,stable, orstable-5.<x>as the channel.
- 3
- Specifyredhat-operators. If your OpenShift Container Platform cluster is installed on a restricted network, also known as a disconnected cluster, specify the name of the CatalogSource object you created when you configured the Operator Lifecycle Manager (OLM).
 - oc create -f <file-name>.yaml - $ oc create -f <file-name>.yaml- Copy to Clipboard Copied! - Toggle word wrap Toggle overflow - For example: - oc create -f olo-sub.yaml - $ oc create -f olo-sub.yaml- Copy to Clipboard Copied! - Toggle word wrap Toggle overflow - The Red Hat OpenShift Logging Operator is installed to the - openshift-loggingnamespace.
- Verify the Operator installation. - There should be a Red Hat OpenShift Logging Operator in the - openshift-loggingnamespace. The Version number might be different than shown.- oc get csv -n openshift-logging - $ oc get csv -n openshift-logging- Copy to Clipboard Copied! - Toggle word wrap Toggle overflow - Example output - NAMESPACE NAME DISPLAY VERSION REPLACES PHASE ... openshift-logging clusterlogging.5.1.0-202007012112.p0 OpenShift Logging 5.1.0-202007012112.p0 Succeeded ... - NAMESPACE NAME DISPLAY VERSION REPLACES PHASE ... openshift-logging clusterlogging.5.1.0-202007012112.p0 OpenShift Logging 5.1.0-202007012112.p0 Succeeded ...- Copy to Clipboard Copied! - Toggle word wrap Toggle overflow 
 
- Create an OpenShift Logging instance: - Create an instance object YAML file (for example, - olo-instance.yaml) for the Red Hat OpenShift Logging Operator:Note- This default OpenShift Logging configuration should support a wide array of environments. Review the topics on tuning and configuring OpenShift Logging components for information on modifications you can make to your OpenShift Logging cluster. - Copy to Clipboard Copied! - Toggle word wrap Toggle overflow - 1
- The name must beinstance.
- 2
- The OpenShift Logging management state. In some cases, if you change the OpenShift Logging defaults, you must set this toUnmanaged. However, an unmanaged deployment does not receive updates until OpenShift Logging is placed back into a managed state. Placing a deployment back into a managed state might revert any modifications you made.
- 3
- Settings for configuring Elasticsearch. Using the custom resource (CR), you can configure shard replication policy and persistent storage.
- 4
- Specify the length of time that Elasticsearch should retain each log source. Enter an integer and a time designation: weeks(w), hours(h/H), minutes(m) and seconds(s). For example,7dfor seven days. Logs older than themaxAgeare deleted. You must specify a retention policy for each log source or the Elasticsearch indices will not be created for that source.
- 5
- Specify the number of Elasticsearch nodes. See the note that follows this list.
- 6
- Enter the name of an existing storage class for Elasticsearch storage. For best performance, specify a storage class that allocates block storage. If you do not specify a storage class, OpenShift Container Platform deploys OpenShift Logging with ephemeral storage only.
- 7
- Specify the CPU and memory requests for Elasticsearch as needed. If you leave these values blank, the OpenShift Elasticsearch Operator sets default values that are sufficient for most deployments. The default values are16Gifor the memory request and1for the CPU request.
- 8
- Specify the CPU and memory requests for the Elasticsearch proxy as needed. If you leave these values blank, the OpenShift Elasticsearch Operator sets default values that should be sufficient for most deployments. The default values are256Mifor the memory request and100mfor the CPU request.
- 9
- Settings for configuring Kibana. Using the CR, you can scale Kibana for redundancy and configure the CPU and memory for your Kibana pods. For more information, see Configuring the log visualizer.
- 10
- Settings for configuring Fluentd. Using the CR, you can configure Fluentd CPU and memory limits. For more information, see Configuring Fluentd.
 Note- The maximum number of Elasticsearch control plane nodes is three. If you specify a - nodeCountgreater than- 3, OpenShift Container Platform creates three Elasticsearch nodes that are Master-eligible nodes, with the master, client, and data roles. The additional Elasticsearch nodes are created as Data-only nodes, using client and data roles. Control plane nodes perform cluster-wide actions such as creating or deleting an index, shard allocation, and tracking nodes. Data nodes hold the shards and perform data-related operations such as CRUD, search, and aggregations. Data-related operations are I/O-, memory-, and CPU-intensive. It is important to monitor these resources and to add more Data nodes if the current nodes are overloaded.- For example, if - nodeCount=4, the following nodes are created:- oc get deployment - $ oc get deployment- Copy to Clipboard Copied! - Toggle word wrap Toggle overflow - Example output - cluster-logging-operator 1/1 1 1 18h elasticsearch-cd-x6kdekli-1 1/1 1 0 6m54s elasticsearch-cdm-x6kdekli-1 1/1 1 1 18h elasticsearch-cdm-x6kdekli-2 1/1 1 0 6m49s elasticsearch-cdm-x6kdekli-3 1/1 1 0 6m44s - cluster-logging-operator 1/1 1 1 18h elasticsearch-cd-x6kdekli-1 1/1 1 0 6m54s elasticsearch-cdm-x6kdekli-1 1/1 1 1 18h elasticsearch-cdm-x6kdekli-2 1/1 1 0 6m49s elasticsearch-cdm-x6kdekli-3 1/1 1 0 6m44s- Copy to Clipboard Copied! - Toggle word wrap Toggle overflow - The number of primary shards for the index templates is equal to the number of Elasticsearch data nodes. 
- Create the instance: - oc create -f <file-name>.yaml - $ oc create -f <file-name>.yaml- Copy to Clipboard Copied! - Toggle word wrap Toggle overflow - For example: - oc create -f olo-instance.yaml - $ oc create -f olo-instance.yaml- Copy to Clipboard Copied! - Toggle word wrap Toggle overflow - This creates the OpenShift Logging components, the - Elasticsearchcustom resource and components, and the Kibana interface.
 
- Verify the installation by listing the pods in the openshift-logging project. - You should see several pods for OpenShift Logging, Elasticsearch, Fluentd, and Kibana similar to the following list: - oc get pods -n openshift-logging - $ oc get pods -n openshift-logging- Copy to Clipboard Copied! - Toggle word wrap Toggle overflow - Example output - Copy to Clipboard Copied! - Toggle word wrap Toggle overflow 
3.4. Post-installation tasks
If you plan to use Kibana, you must manually create your Kibana index patterns and visualizations to explore and visualize data in Kibana.
If your cluster network provider enforces network isolation, allow network traffic between the projects that contain the OpenShift Logging operators.
3.4.1. Defining Kibana index patterns
An index pattern defines the Elasticsearch indices that you want to visualize. To explore and visualize data in Kibana, you must create an index pattern.
Prerequisites
- A user must have the - cluster-adminrole, the- cluster-readerrole, or both roles to view the infra and audit indices in Kibana. The default- kubeadminuser has proper permissions to view these indices.- If you can view the pods and logs in the - default,- kube-and- openshift-projects, you should be able to access these indices. You can use the following command to check if the current user has appropriate permissions:- oc auth can-i get pods/log -n <project> - $ oc auth can-i get pods/log -n <project>- Copy to Clipboard Copied! - Toggle word wrap Toggle overflow - Example output - yes - yes- Copy to Clipboard Copied! - Toggle word wrap Toggle overflow Note- The audit logs are not stored in the internal OpenShift Container Platform Elasticsearch instance by default. To view the audit logs in Kibana, you must use the Log Forwarding API to configure a pipeline that uses the - defaultoutput for audit logs.
- Elasticsearch documents must be indexed before you can create index patterns. This is done automatically, but it might take a few minutes in a new or updated cluster.
Procedure
To define index patterns and create visualizations in Kibana:
- 
							In the OpenShift Container Platform console, click the Application Launcher 
							 and select Logging. and select Logging.
- Create your Kibana index patterns by clicking Management → Index Patterns → Create index pattern: - 
									Each user must manually create index patterns when logging into Kibana the first time to see logs for their projects. Users must create an index pattern named appand use the@timestamptime field to view their container logs.
- 
									Each admin user must create index patterns when logged into Kibana the first time for the app,infra, andauditindices using the@timestamptime field.
 
- 
									Each user must manually create index patterns when logging into Kibana the first time to see logs for their projects. Users must create an index pattern named 
- Create Kibana Visualizations from the new index patterns.
3.4.2. Allowing traffic between projects when network isolation is enabled
Your cluster network provider might enforce network isolation. If so, you must allow network traffic between the projects that contain the operators deployed by OpenShift Logging.
					Network isolation blocks network traffic between pods or services that are in different projects. OpenShift Logging installs the OpenShift Elasticsearch Operator in the openshift-operators-redhat project and the Red Hat OpenShift Logging Operator in the openshift-logging project. Therefore, you must allow traffic between these two projects.
				
OpenShift Container Platform offers two supported choices for the default Container Network Interface (CNI) network provider, OpenShift SDN and OVN-Kubernetes. These two providers implement various network isolation policies.
OpenShift SDN has three modes:
- network policy
- This is the default mode. If no policy is defined, it allows all traffic. However, if a user defines a policy, they typically start by denying all traffic and then adding exceptions. This process might break applications that are running in different projects. Therefore, explicitly configure the policy to allow traffic to egress from one logging-related project to the other.
- multitenant
- This mode enforces network isolation. You must join the two logging-related projects to allow traffic between them.
- subnet
- This mode allows all traffic. It does not enforce network isolation. No action is needed.
OVN-Kubernetes always uses a network policy. Therefore, as with OpenShift SDN, you must configure the policy to allow traffic to egress from one logging-related project to the other.
Procedure
- If you are using OpenShift SDN in multitenant mode, join the two projects. For example: - oc adm pod-network join-projects --to=openshift-operators-redhat openshift-logging - $ oc adm pod-network join-projects --to=openshift-operators-redhat openshift-logging- Copy to Clipboard Copied! - Toggle word wrap Toggle overflow 
- Otherwise, for OpenShift SDN in network policy mode and OVN-Kubernetes, perform the following actions: - Set a label on the - openshift-operators-redhatnamespace. For example:- oc label namespace openshift-operators-redhat project=openshift-operators-redhat - $ oc label namespace openshift-operators-redhat project=openshift-operators-redhat- Copy to Clipboard Copied! - Toggle word wrap Toggle overflow 
- Create a network policy object in the - openshift-loggingnamespace that allows ingress from the- openshift-operators-redhat,- openshift-monitoringand- openshift-ingressprojects to the openshift-logging project. For example:- Copy to Clipboard Copied! - Toggle word wrap Toggle overflow 
 
Chapter 4. Configuring your Logging deployment
4.1. About the Cluster Logging custom resource
				To configure OpenShift Logging, you customize the ClusterLogging custom resource (CR).
			
4.1.1. About the ClusterLogging custom resource
					To make changes to your OpenShift Logging environment, create and modify the ClusterLogging custom resource (CR).
				
Instructions for creating or modifying a CR are provided in this documentation as appropriate.
The following example shows a typical custom resource for OpenShift Logging.
Sample ClusterLogging custom resource (CR)
- 1
- The CR name must beinstance.
- 2
- The CR must be installed to theopenshift-loggingnamespace.
- 3
- The Red Hat OpenShift Logging Operator management state. When set tounmanagedthe operator is in an unsupported state and will not get updates.
- 4
- Settings for the log store, including retention policy, the number of nodes, the resource requests and limits, and the storage class.
- 5
- Settings for the visualizer, including the resource requests and limits, and the number of pod replicas.
- 6
- Settings for the log collector, including the resource requests and limits.
4.2. Configuring the logging collector
OpenShift Container Platform uses Fluentd to collect operations and application logs from your cluster and enriches the data with Kubernetes pod and project metadata.
				You can configure the CPU and memory limits for the log collector and move the log collector pods to specific nodes. All supported modifications to the log collector can be performed though the spec.collection.log.fluentd stanza in the ClusterLogging custom resource (CR).
			
4.2.1. About unsupported configurations
The supported way of configuring OpenShift Logging is by configuring it using the options described in this documentation. Do not use other configurations, as they are unsupported. Configuration paradigms might change across OpenShift Container Platform releases, and such cases can only be handled gracefully if all configuration possibilities are controlled. If you use configurations other than those described in this documentation, your changes will disappear because the OpenShift Elasticsearch Operator and Red Hat OpenShift Logging Operator reconcile any differences. The Operators reverse everything to the defined state by default and by design.
If you must perform configurations not described in the OpenShift Container Platform documentation, you must set your Red Hat OpenShift Logging Operator or OpenShift Elasticsearch Operator to Unmanaged. An unmanaged OpenShift Logging environment is not supported and does not receive updates until you return OpenShift Logging to Managed.
4.2.2. Viewing logging collector pods
					You can view the Fluentd logging collector pods and the corresponding nodes that they are running on. The Fluentd logging collector pods run only in the openshift-logging project.
				
Procedure
- 
							Run the following command in the openshift-loggingproject to view the Fluentd logging collector pods and their details:
oc get pods --selector component=fluentd -o wide -n openshift-logging
$ oc get pods --selector component=fluentd -o wide -n openshift-loggingExample output
4.2.3. Configure log collector CPU and memory limits
The log collector allows for adjustments to both the CPU and memory limits.
Procedure
- Edit the - ClusterLoggingcustom resource (CR) in the- openshift-loggingproject:- oc -n openshift-logging edit ClusterLogging instance - $ oc -n openshift-logging edit ClusterLogging instance- Copy to Clipboard Copied! - Toggle word wrap Toggle overflow - Copy to Clipboard Copied! - Toggle word wrap Toggle overflow - 1
- Specify the CPU and memory limits and requests as needed. The values shown are the default values.
 
4.2.4. Advanced configuration for the log forwarder
OpenShift Logging includes multiple Fluentd parameters that you can use for tuning the performance of the Fluentd log forwarder. With these parameters, you can change the following Fluentd behaviors:
- the size of Fluentd chunks and chunk buffer
- the Fluentd chunk flushing behavior
- the Fluentd chunk forwarding retry behavior
Fluentd collects log data in a single blob called a chunk. When Fluentd creates a chunk, the chunk is considered to be in the stage, where the chunk gets filled with data. When the chunk is full, Fluentd moves the chunk to the queue, where chunks are held before being flushed, or written out to their destination. Fluentd can fail to flush a chunk for a number of reasons, such as network issues or capacity issues at the destination. If a chunk cannot be flushed, Fluentd retries flushing as configured.
By default in OpenShift Container Platform, Fluentd uses the exponential backoff method to retry flushing, where Fluentd doubles the time it waits between attempts to retry flushing again, which helps reduce connection requests to the destination. You can disable exponential backoff and use the periodic retry method instead, which retries flushing the chunks at a specified interval. By default, Fluentd retries chunk flushing indefinitely. In OpenShift Container Platform, you cannot change the indefinite retry behavior.
These parameters can help you determine the trade-offs between latency and throughput.
- To optimize Fluentd for throughput, you could use these parameters to reduce network packet count by configuring larger buffers and queues, delaying flushes, and setting longer times between retries. Be aware that larger buffers require more space on the node file system.
- To optimize for low latency, you could use the parameters to send data as soon as possible, avoid the build-up of batches, have shorter queues and buffers, and use more frequent flush and retries.
					You can configure the chunking and flushing behavior using the following parameters in the ClusterLogging custom resource (CR). The parameters are then automatically added to the Fluentd config map for use by Fluentd.
				
These parameters are:
- Not relevant to most users. The default settings should give good general performance.
- Only for advanced users with detailed knowledge of Fluentd configuration and performance.
- Only for performance tuning. They have no effect on functional aspects of logging.
| Parmeter | Description | Default | 
|---|---|---|
| 
									 | The maximum size of each chunk. Fluentd stops writing data to a chunk when it reaches this size. Then, Fluentd sends the chunk to the queue and opens a new chunk. | 
									 | 
| 
									 | The maximum size of the buffer, which is the total size of the stage and the queue. If the buffer size exceeds this value, Fluentd stops adding data to chunks and fails with an error. All data not in chunks is lost. | 
									 | 
| 
									 | 
									The interval between chunk flushes. You can use  | 
									 | 
| 
									 | The method to perform flushes: 
 | 
									 | 
| 
									 | The number of threads that perform chunk flushing. Increasing the number of threads improves the flush throughput, which hides network latency. | 
									 | 
| 
									 | The chunking behavior when the queue is full: 
 | 
									 | 
| 
									 | 
									The maximum time in seconds for the  | 
									 | 
| 
									 | The retry method when flushing fails: 
 | 
									 | 
| 
									 | The time in seconds before the next chunk flush. | 
									 | 
For more information on the Fluentd chunk lifecycle, see Buffer Plugins in the Fluentd documentation.
Procedure
- Edit the - ClusterLoggingcustom resource (CR) in the- openshift-loggingproject:- oc edit ClusterLogging instance - $ oc edit ClusterLogging instance- Copy to Clipboard Copied! - Toggle word wrap Toggle overflow 
- Add or modify any of the following parameters: - Copy to Clipboard Copied! - Toggle word wrap Toggle overflow - 1
- Specify the maximum size of each chunk before it is queued for flushing.
- 2
- Specify the interval between chunk flushes.
- 3
- Specify the method to perform chunk flushes:lazy,interval, orimmediate.
- 4
- Specify the number of threads to use for chunk flushes.
- 5
- Specify the chunking behavior when the queue is full:throw_exception,block, ordrop_oldest_chunk.
- 6
- Specify the maximum interval in seconds for theexponential_backoffchunk flushing method.
- 7
- Specify the retry type when chunk flushing fails:exponential_backofforperiodic.
- 8
- Specify the time in seconds before the next chunk flush.
- 9
- Specify the maximum size of the chunk buffer.
 
- Verify that the Fluentd pods are redeployed: - oc get pods -n openshift-logging - $ oc get pods -n openshift-logging- Copy to Clipboard Copied! - Toggle word wrap Toggle overflow 
- Check that the new values are in the - fluentdconfig map:- oc extract configmap/fluentd --confirm - $ oc extract configmap/fluentd --confirm- Copy to Clipboard Copied! - Toggle word wrap Toggle overflow - Example fluentd.conf - Copy to Clipboard Copied! - Toggle word wrap Toggle overflow 
4.2.5. Removing unused components if you do not use the default Elasticsearch log store
As an administrator, in the rare case that you forward logs to a third-party log store and do not use the default Elasticsearch log store, you can remove several unused components from your logging cluster.
					In other words, if you do not use the default Elasticsearch log store, you can remove the internal Elasticsearch logStore and Kibana visualization components from the ClusterLogging custom resource (CR). Removing these components is optional but saves resources.
				
Prerequisites
- Verify that your log forwarder does not send log data to the default internal Elasticsearch cluster. Inspect the - ClusterLogForwarderCR YAML file that you used to configure log forwarding. Verify that it does not have an- outputRefselement that specifies- default. For example:- outputRefs: - default - outputRefs: - default- Copy to Clipboard Copied! - Toggle word wrap Toggle overflow 
						Suppose the ClusterLogForwarder CR forwards log data to the internal Elasticsearch cluster, and you remove the logStore component from the ClusterLogging CR. In that case, the internal Elasticsearch cluster will not be present to store the log data. This absence can cause data loss.
					
Procedure
- Edit the - ClusterLoggingcustom resource (CR) in the- openshift-loggingproject:- oc edit ClusterLogging instance - $ oc edit ClusterLogging instance- Copy to Clipboard Copied! - Toggle word wrap Toggle overflow 
- 
							If they are present, remove the logStoreandvisualizationstanzas from theClusterLoggingCR.
- Preserve the - collectionstanza of the- ClusterLoggingCR. The result should look similar to the following example:- Copy to Clipboard Copied! - Toggle word wrap Toggle overflow 
- Verify that the Fluentd pods are redeployed: - oc get pods -n openshift-logging - $ oc get pods -n openshift-logging- Copy to Clipboard Copied! - Toggle word wrap Toggle overflow 
4.3. Configuring the log store
OpenShift Container Platform uses Elasticsearch 6 (ES) to store and organize the log data.
You can make modifications to your log store, including:
- storage for your Elasticsearch cluster
- shard replication across data nodes in the cluster, from full replication to no replication
- external access to Elasticsearch data
				Elasticsearch is a memory-intensive application. Each Elasticsearch node needs at least 16G of memory for both memory requests and limits, unless you specify otherwise in the ClusterLogging custom resource. The initial set of OpenShift Container Platform nodes might not be large enough to support the Elasticsearch cluster. You must add additional nodes to the OpenShift Container Platform cluster to run with the recommended or higher memory, up to a maximum of 64G for each Elasticsearch node.
			
Each Elasticsearch node can operate with a lower memory setting, though this is not recommended for production environments.
4.3.1. Forward audit logs to the log store
Because the internal OpenShift Container Platform Elasticsearch log store does not provide secure storage for audit logs, by default audit logs are not stored in the internal Elasticsearch instance.
If you want to send the audit logs to the internal log store, for example to view the audit logs in Kibana, you must use the Log Forward API.
The internal OpenShift Container Platform Elasticsearch log store does not provide secure storage for audit logs. We recommend you ensure that the system to which you forward audit logs is compliant with your organizational and governmental regulations and is properly secured. OpenShift Logging does not comply with those regulations.
Procedure
To use the Log Forward API to forward audit logs to the internal Elasticsearch instance:
- Create a - ClusterLogForwarderCR YAML file or edit your existing CR:- Create a CR to send all log types to the internal Elasticsearch instance. You can use the following example without making any changes: - Copy to Clipboard Copied! - Toggle word wrap Toggle overflow - 1
- A pipeline defines the type of logs to forward using the specified output. The default output forwards logs to the internal Elasticsearch instance.
 Note- You must specify all three types of logs in the pipeline: application, infrastructure, and audit. If you do not specify a log type, those logs are not stored and will be lost. 
- If you have an existing - ClusterLogForwarderCR, add a pipeline to the default output for the audit logs. You do not need to define the default output. For example:- Copy to Clipboard Copied! - Toggle word wrap Toggle overflow - 1
- This pipeline sends the audit logs to the internal Elasticsearch instance in addition to an external instance.
 
 
4.3.2. Configuring log retention time
You can configure a retention policy that specifies how long the default Elasticsearch log store keeps indices for each of the three log sources: infrastructure logs, application logs, and audit logs.
					To configure the retention policy, you set a maxAge parameter for each log source in the ClusterLogging custom resource (CR). The CR applies these values to the Elasticsearch rollover schedule, which determines when Elasticsearch deletes the rolled-over indices.
				
Elasticsearch rolls over an index, moving the current index and creating a new index, when an index matches any of the following conditions:
- 
							The index is older than the rollover.maxAgevalue in theElasticsearchCR.
- The index size is greater than 40 GB × the number of primary shards.
- The index doc count is greater than 40960 KB × the number of primary shards.
Elasticsearch deletes the rolled-over indices based on the retention policy you configure. If you do not create a retention policy for any log sources, logs are deleted after seven days by default.
Prerequisites
- OpenShift Logging and the OpenShift Elasticsearch Operator must be installed.
Procedure
To configure the log retention time:
- Edit the - ClusterLoggingCR to add or modify the- retentionPolicyparameter:- Copy to Clipboard Copied! - Toggle word wrap Toggle overflow - 1
- Specify the time that Elasticsearch should retain each log source. Enter an integer and a time designation: weeks(w), hours(h/H), minutes(m) and seconds(s). For example,1dfor one day. Logs older than themaxAgeare deleted. By default, logs are retained for seven days.
 
- You can verify the settings in the - Elasticsearchcustom resource (CR).- For example, the Red Hat OpenShift Logging Operator updated the following - ElasticsearchCR to configure a retention policy that includes settings to roll over active indices for the infrastructure logs every eight hours and the rolled-over indices are deleted seven days after rollover. OpenShift Container Platform checks every 15 minutes to determine if the indices need to be rolled over.- Copy to Clipboard Copied! - Toggle word wrap Toggle overflow - 1
- For each log source, the retention policy indicates when to delete and roll over logs for that source.
- 2
- When OpenShift Container Platform deletes the rolled-over indices. This setting is themaxAgeyou set in theClusterLoggingCR.
- 3
- The index age for OpenShift Container Platform to consider when rolling over the indices. This value is determined from themaxAgeyou set in theClusterLoggingCR.
- 4
- When OpenShift Container Platform checks if the indices should be rolled over. This setting is the default and cannot be changed.
 Note- Modifying the - ElasticsearchCR is not supported. All changes to the retention policies must be made in the- ClusterLoggingCR.- The OpenShift Elasticsearch Operator deploys a cron job to roll over indices for each mapping using the defined policy, scheduled using the - pollInterval.- oc get cronjob - $ oc get cronjob- Copy to Clipboard Copied! - Toggle word wrap Toggle overflow - Example output - NAME SCHEDULE SUSPEND ACTIVE LAST SCHEDULE AGE elasticsearch-im-app */15 * * * * False 0 <none> 4s elasticsearch-im-audit */15 * * * * False 0 <none> 4s elasticsearch-im-infra */15 * * * * False 0 <none> 4s - NAME SCHEDULE SUSPEND ACTIVE LAST SCHEDULE AGE elasticsearch-im-app */15 * * * * False 0 <none> 4s elasticsearch-im-audit */15 * * * * False 0 <none> 4s elasticsearch-im-infra */15 * * * * False 0 <none> 4s- Copy to Clipboard Copied! - Toggle word wrap Toggle overflow 
4.3.3. Configuring CPU and memory requests for the log store
Each component specification allows for adjustments to both the CPU and memory requests. You should not have to manually adjust these values as the OpenShift Elasticsearch Operator sets values sufficient for your environment.
In large-scale clusters, the default memory limit for the Elasticsearch proxy container might not be sufficient, causing the proxy container to be OOMKilled. If you experience this issue, increase the memory requests and limits for the Elasticsearch proxy.
Each Elasticsearch node can operate with a lower memory setting though this is not recommended for production deployments. For production use, you should have no less than the default 16Gi allocated to each pod. Preferably you should allocate as much as possible, up to 64Gi per pod.
Prerequisites
- OpenShift Logging and Elasticsearch must be installed.
Procedure
- Edit the - ClusterLoggingcustom resource (CR) in the- openshift-loggingproject:- oc edit ClusterLogging instance - $ oc edit ClusterLogging instance- Copy to Clipboard Copied! - Toggle word wrap Toggle overflow - Copy to Clipboard Copied! - Toggle word wrap Toggle overflow - 1
- Specify the CPU and memory requests for Elasticsearch as needed. If you leave these values blank, the OpenShift Elasticsearch Operator sets default values that should be sufficient for most deployments. The default values are16Gifor the memory request and1for the CPU request.
- 2
- The maximum amount of resources a pod can use.
- 3
- The minimum resources required to schedule a pod.
- 4
- Specify the CPU and memory requests for the Elasticsearch proxy as needed. If you leave these values blank, the OpenShift Elasticsearch Operator sets default values that are sufficient for most deployments. The default values are256Mifor the memory request and100mfor the CPU request.
 
					When adjusting the amount of Elasticsearch memory, the same value should be used for both requests and limits.
				
For example:
					Kubernetes generally adheres the node configuration and does not allow Elasticsearch to use the specified limits. Setting the same value for the requests and limits ensures that Elasticsearch can use the memory you want, assuming the node has the memory available.
				
4.3.4. Configuring replication policy for the log store
You can define how Elasticsearch shards are replicated across data nodes in the cluster.
Prerequisites
- OpenShift Logging and Elasticsearch must be installed.
Procedure
- Edit the - ClusterLoggingcustom resource (CR) in the- openshift-loggingproject:- oc edit clusterlogging instance - $ oc edit clusterlogging instance- Copy to Clipboard Copied! - Toggle word wrap Toggle overflow - Copy to Clipboard Copied! - Toggle word wrap Toggle overflow - 1
- Specify a redundancy policy for the shards. The change is applied upon saving the changes.- FullRedundancy. Elasticsearch fully replicates the primary shards for each index to every data node. This provides the highest safety, but at the cost of the highest amount of disk required and the poorest performance.
- MultipleRedundancy. Elasticsearch fully replicates the primary shards for each index to half of the data nodes. This provides a good tradeoff between safety and performance.
- SingleRedundancy. Elasticsearch makes one copy of the primary shards for each index. Logs are always available and recoverable as long as at least two data nodes exist. Better performance than MultipleRedundancy, when using 5 or more nodes. You cannot apply this policy on deployments of single Elasticsearch node.
- ZeroRedundancy. Elasticsearch does not make copies of the primary shards. Logs might be unavailable or lost in the event a node is down or fails. Use this mode when you are more concerned with performance than safety, or have implemented your own disk/PVC backup/restore strategy.
 
 
The number of primary shards for the index templates is equal to the number of Elasticsearch data nodes.
4.3.5. Scaling down Elasticsearch pods
Reducing the number of Elasticsearch pods in your cluster can result in data loss or Elasticsearch performance degradation.
					If you scale down, you should scale down by one pod at a time and allow the cluster to re-balance the shards and replicas. After the Elasticsearch health status returns to green, you can scale down by another pod.
				
						If your Elasticsearch cluster is set to ZeroRedundancy, you should not scale down your Elasticsearch pods.
					
4.3.6. Configuring persistent storage for the log store
Elasticsearch requires persistent storage. The faster the storage, the faster the Elasticsearch performance.
Using NFS storage as a volume or a persistent volume (or via NAS such as Gluster) is not supported for Elasticsearch storage, as Lucene relies on file system behavior that NFS does not supply. Data corruption and other problems can occur.
Prerequisites
- OpenShift Logging and Elasticsearch must be installed.
Procedure
- Edit the - ClusterLoggingCR to specify that each data node in the cluster is bound to a Persistent Volume Claim.- Copy to Clipboard Copied! - Toggle word wrap Toggle overflow 
This example specifies each data node in the cluster is bound to a Persistent Volume Claim that requests "200G" of AWS General Purpose SSD (gp2) storage.
						If you use a local volume for persistent storage, do not use a raw block volume, which is described with volumeMode: block in the LocalVolume object. Elasticsearch cannot use raw block volumes.
					
4.3.7. Configuring the log store for emptyDir storage
You can use emptyDir with your log store, which creates an ephemeral deployment in which all of a pod’s data is lost upon restart.
When using emptyDir, if log storage is restarted or redeployed, you will lose data.
Prerequisites
- OpenShift Logging and Elasticsearch must be installed.
Procedure
- Edit the - ClusterLoggingCR to specify emptyDir:- Copy to Clipboard Copied! - Toggle word wrap Toggle overflow 
4.3.8. Performing an Elasticsearch rolling cluster restart
					Perform a rolling restart when you change the elasticsearch config map or any of the elasticsearch-* deployment configurations.
				
Also, a rolling restart is recommended if the nodes on which an Elasticsearch pod runs requires a reboot.
Prerequisites
- OpenShift Logging and Elasticsearch must be installed.
Procedure
To perform a rolling cluster restart:
- Change to the - openshift-loggingproject:- oc project openshift-logging - $ oc project openshift-logging- Copy to Clipboard Copied! - Toggle word wrap Toggle overflow 
- Get the names of the Elasticsearch pods: - oc get pods | grep elasticsearch- - $ oc get pods | grep elasticsearch-- Copy to Clipboard Copied! - Toggle word wrap Toggle overflow 
- Scale down the Fluentd pods so they stop sending new logs to Elasticsearch: - oc -n openshift-logging patch daemonset/logging-fluentd -p '{"spec":{"template":{"spec":{"nodeSelector":{"logging-infra-fluentd": "false"}}}}}'- $ oc -n openshift-logging patch daemonset/logging-fluentd -p '{"spec":{"template":{"spec":{"nodeSelector":{"logging-infra-fluentd": "false"}}}}}'- Copy to Clipboard Copied! - Toggle word wrap Toggle overflow 
- Perform a shard synced flush using the OpenShift Container Platform es_util tool to ensure there are no pending operations waiting to be written to disk prior to shutting down: - oc exec <any_es_pod_in_the_cluster> -c elasticsearch -- es_util --query="_flush/synced" -XPOST - $ oc exec <any_es_pod_in_the_cluster> -c elasticsearch -- es_util --query="_flush/synced" -XPOST- Copy to Clipboard Copied! - Toggle word wrap Toggle overflow - For example: - oc exec -c elasticsearch-cdm-5ceex6ts-1-dcd6c4c7c-jpw6 -c elasticsearch -- es_util --query="_flush/synced" -XPOST - $ oc exec -c elasticsearch-cdm-5ceex6ts-1-dcd6c4c7c-jpw6 -c elasticsearch -- es_util --query="_flush/synced" -XPOST- Copy to Clipboard Copied! - Toggle word wrap Toggle overflow - Example output - {"_shards":{"total":4,"successful":4,"failed":0},".security":{"total":2,"successful":2,"failed":0},".kibana_1":{"total":2,"successful":2,"failed":0}}- {"_shards":{"total":4,"successful":4,"failed":0},".security":{"total":2,"successful":2,"failed":0},".kibana_1":{"total":2,"successful":2,"failed":0}}- Copy to Clipboard Copied! - Toggle word wrap Toggle overflow 
- Prevent shard balancing when purposely bringing down nodes using the OpenShift Container Platform es_util tool: - oc exec <any_es_pod_in_the_cluster> -c elasticsearch -- es_util --query="_cluster/settings" -XPUT -d '{ "persistent": { "cluster.routing.allocation.enable" : "primaries" } }'- $ oc exec <any_es_pod_in_the_cluster> -c elasticsearch -- es_util --query="_cluster/settings" -XPUT -d '{ "persistent": { "cluster.routing.allocation.enable" : "primaries" } }'- Copy to Clipboard Copied! - Toggle word wrap Toggle overflow - For example: - oc exec elasticsearch-cdm-5ceex6ts-1-dcd6c4c7c-jpw6 -c elasticsearch -- es_util --query="_cluster/settings" -XPUT -d '{ "persistent": { "cluster.routing.allocation.enable" : "primaries" } }'- $ oc exec elasticsearch-cdm-5ceex6ts-1-dcd6c4c7c-jpw6 -c elasticsearch -- es_util --query="_cluster/settings" -XPUT -d '{ "persistent": { "cluster.routing.allocation.enable" : "primaries" } }'- Copy to Clipboard Copied! - Toggle word wrap Toggle overflow - Example output - {"acknowledged":true,"persistent":{"cluster":{"routing":{"allocation":{"enable":"primaries"}}}},"transient":- {"acknowledged":true,"persistent":{"cluster":{"routing":{"allocation":{"enable":"primaries"}}}},"transient":- Copy to Clipboard Copied! - Toggle word wrap Toggle overflow 
- After the command is complete, for each deployment you have for an ES cluster: - By default, the OpenShift Container Platform Elasticsearch cluster blocks rollouts to their nodes. Use the following command to allow rollouts and allow the pod to pick up the changes: - oc rollout resume deployment/<deployment-name> - $ oc rollout resume deployment/<deployment-name>- Copy to Clipboard Copied! - Toggle word wrap Toggle overflow - For example: - oc rollout resume deployment/elasticsearch-cdm-0-1 - $ oc rollout resume deployment/elasticsearch-cdm-0-1- Copy to Clipboard Copied! - Toggle word wrap Toggle overflow - Example output - deployment.extensions/elasticsearch-cdm-0-1 resumed - deployment.extensions/elasticsearch-cdm-0-1 resumed- Copy to Clipboard Copied! - Toggle word wrap Toggle overflow - A new pod is deployed. After the pod has a ready container, you can move on to the next deployment. - oc get pods | grep elasticsearch- - $ oc get pods | grep elasticsearch-- Copy to Clipboard Copied! - Toggle word wrap Toggle overflow - Example output - NAME READY STATUS RESTARTS AGE elasticsearch-cdm-5ceex6ts-1-dcd6c4c7c-jpw6k 2/2 Running 0 22h elasticsearch-cdm-5ceex6ts-2-f799564cb-l9mj7 2/2 Running 0 22h elasticsearch-cdm-5ceex6ts-3-585968dc68-k7kjr 2/2 Running 0 22h - NAME READY STATUS RESTARTS AGE elasticsearch-cdm-5ceex6ts-1-dcd6c4c7c-jpw6k 2/2 Running 0 22h elasticsearch-cdm-5ceex6ts-2-f799564cb-l9mj7 2/2 Running 0 22h elasticsearch-cdm-5ceex6ts-3-585968dc68-k7kjr 2/2 Running 0 22h- Copy to Clipboard Copied! - Toggle word wrap Toggle overflow 
- After the deployments are complete, reset the pod to disallow rollouts: - oc rollout pause deployment/<deployment-name> - $ oc rollout pause deployment/<deployment-name>- Copy to Clipboard Copied! - Toggle word wrap Toggle overflow - For example: - oc rollout pause deployment/elasticsearch-cdm-0-1 - $ oc rollout pause deployment/elasticsearch-cdm-0-1- Copy to Clipboard Copied! - Toggle word wrap Toggle overflow - Example output - deployment.extensions/elasticsearch-cdm-0-1 paused - deployment.extensions/elasticsearch-cdm-0-1 paused- Copy to Clipboard Copied! - Toggle word wrap Toggle overflow 
- Check that the Elasticsearch cluster is in a - greenor- yellowstate:- oc exec <any_es_pod_in_the_cluster> -c elasticsearch -- es_util --query=_cluster/health?pretty=true - $ oc exec <any_es_pod_in_the_cluster> -c elasticsearch -- es_util --query=_cluster/health?pretty=true- Copy to Clipboard Copied! - Toggle word wrap Toggle overflow Note- If you performed a rollout on the Elasticsearch pod you used in the previous commands, the pod no longer exists and you need a new pod name here. - For example: - oc exec elasticsearch-cdm-5ceex6ts-1-dcd6c4c7c-jpw6 -c elasticsearch -- es_util --query=_cluster/health?pretty=true - $ oc exec elasticsearch-cdm-5ceex6ts-1-dcd6c4c7c-jpw6 -c elasticsearch -- es_util --query=_cluster/health?pretty=true- Copy to Clipboard Copied! - Toggle word wrap Toggle overflow - Copy to Clipboard Copied! - Toggle word wrap Toggle overflow - 1
- Make sure this parameter value isgreenoryellowbefore proceeding.
 
 
- If you changed the Elasticsearch configuration map, repeat these steps for each Elasticsearch pod.
- After all the deployments for the cluster have been rolled out, re-enable shard balancing: - oc exec <any_es_pod_in_the_cluster> -c elasticsearch -- es_util --query="_cluster/settings" -XPUT -d '{ "persistent": { "cluster.routing.allocation.enable" : "all" } }'- $ oc exec <any_es_pod_in_the_cluster> -c elasticsearch -- es_util --query="_cluster/settings" -XPUT -d '{ "persistent": { "cluster.routing.allocation.enable" : "all" } }'- Copy to Clipboard Copied! - Toggle word wrap Toggle overflow - For example: - oc exec elasticsearch-cdm-5ceex6ts-1-dcd6c4c7c-jpw6 -c elasticsearch -- es_util --query="_cluster/settings" -XPUT -d '{ "persistent": { "cluster.routing.allocation.enable" : "all" } }'- $ oc exec elasticsearch-cdm-5ceex6ts-1-dcd6c4c7c-jpw6 -c elasticsearch -- es_util --query="_cluster/settings" -XPUT -d '{ "persistent": { "cluster.routing.allocation.enable" : "all" } }'- Copy to Clipboard Copied! - Toggle word wrap Toggle overflow - Example output - Copy to Clipboard Copied! - Toggle word wrap Toggle overflow 
- Scale up the Fluentd pods so they send new logs to Elasticsearch. - oc -n openshift-logging patch daemonset/logging-fluentd -p '{"spec":{"template":{"spec":{"nodeSelector":{"logging-infra-fluentd": "true"}}}}}'- $ oc -n openshift-logging patch daemonset/logging-fluentd -p '{"spec":{"template":{"spec":{"nodeSelector":{"logging-infra-fluentd": "true"}}}}}'- Copy to Clipboard Copied! - Toggle word wrap Toggle overflow 
4.3.9. Exposing the log store service as a route
By default, the log store that is deployed with OpenShift Logging is not accessible from outside the logging cluster. You can enable a route with re-encryption termination for external access to the log store service for those tools that access its data.
Externally, you can access the log store by creating a reencrypt route, your OpenShift Container Platform token and the installed log store CA certificate. Then, access a node that hosts the log store service with a cURL request that contains:
- 
							The Authorization: Bearer ${token}
- The Elasticsearch reencrypt route and an Elasticsearch API request.
Internally, you can access the log store service using the log store cluster IP, which you can get by using either of the following commands:
oc get service elasticsearch -o jsonpath={.spec.clusterIP} -n openshift-logging
$ oc get service elasticsearch -o jsonpath={.spec.clusterIP} -n openshift-loggingExample output
172.30.183.229
172.30.183.229oc get service elasticsearch -n openshift-logging
$ oc get service elasticsearch -n openshift-loggingExample output
NAME TYPE CLUSTER-IP EXTERNAL-IP PORT(S) AGE elasticsearch ClusterIP 172.30.183.229 <none> 9200/TCP 22h
NAME            TYPE        CLUSTER-IP       EXTERNAL-IP   PORT(S)    AGE
elasticsearch   ClusterIP   172.30.183.229   <none>        9200/TCP   22hYou can check the cluster IP address with a command similar to the following:
oc exec elasticsearch-cdm-oplnhinv-1-5746475887-fj2f8 -n openshift-logging -- curl -tlsv1.2 --insecure -H "Authorization: Bearer ${token}" "https://172.30.183.229:9200/_cat/health"
$ oc exec elasticsearch-cdm-oplnhinv-1-5746475887-fj2f8 -n openshift-logging -- curl -tlsv1.2 --insecure -H "Authorization: Bearer ${token}" "https://172.30.183.229:9200/_cat/health"Example output
  % Total    % Received % Xferd  Average Speed   Time    Time     Time  Current
                                 Dload  Upload   Total   Spent    Left  Speed
100    29  100    29    0     0    108      0 --:--:-- --:--:-- --:--:--   108
  % Total    % Received % Xferd  Average Speed   Time    Time     Time  Current
                                 Dload  Upload   Total   Spent    Left  Speed
100    29  100    29    0     0    108      0 --:--:-- --:--:-- --:--:--   108Prerequisites
- OpenShift Logging and Elasticsearch must be installed.
- You must have access to the project to be able to access to the logs.
Procedure
To expose the log store externally:
- Change to the - openshift-loggingproject:- oc project openshift-logging - $ oc project openshift-logging- Copy to Clipboard Copied! - Toggle word wrap Toggle overflow 
- Extract the CA certificate from the log store and write to the admin-ca file: - oc extract secret/elasticsearch --to=. --keys=admin-ca - $ oc extract secret/elasticsearch --to=. --keys=admin-ca- Copy to Clipboard Copied! - Toggle word wrap Toggle overflow - Example output - admin-ca - admin-ca- Copy to Clipboard Copied! - Toggle word wrap Toggle overflow 
- Create the route for the log store service as a YAML file: - Create a YAML file with the following: - Copy to Clipboard Copied! - Toggle word wrap Toggle overflow - 1
- Add the log store CA certifcate or use the command in the next step. You do not have to set thespec.tls.key,spec.tls.certificate, andspec.tls.caCertificateparameters required by some reencrypt routes.
 
- Run the following command to add the log store CA certificate to the route YAML you created in the previous step: - cat ./admin-ca | sed -e "s/^/ /" >> <file-name>.yaml - $ cat ./admin-ca | sed -e "s/^/ /" >> <file-name>.yaml- Copy to Clipboard Copied! - Toggle word wrap Toggle overflow 
- Create the route: - oc create -f <file-name>.yaml - $ oc create -f <file-name>.yaml- Copy to Clipboard Copied! - Toggle word wrap Toggle overflow - Example output - route.route.openshift.io/elasticsearch created - route.route.openshift.io/elasticsearch created- Copy to Clipboard Copied! - Toggle word wrap Toggle overflow 
 
- Check that the Elasticsearch service is exposed: - Get the token of this service account to be used in the request: - token=$(oc whoami -t) - $ token=$(oc whoami -t)- Copy to Clipboard Copied! - Toggle word wrap Toggle overflow 
- Set the elasticsearch route you created as an environment variable. - routeES=`oc get route elasticsearch -o jsonpath={.spec.host}`- $ routeES=`oc get route elasticsearch -o jsonpath={.spec.host}`- Copy to Clipboard Copied! - Toggle word wrap Toggle overflow 
- To verify the route was successfully created, run the following command that accesses Elasticsearch through the exposed route: - curl -tlsv1.2 --insecure -H "Authorization: Bearer ${token}" "https://${routeES}"- curl -tlsv1.2 --insecure -H "Authorization: Bearer ${token}" "https://${routeES}"- Copy to Clipboard Copied! - Toggle word wrap Toggle overflow - The response appears similar to the following: - Example output - Copy to Clipboard Copied! - Toggle word wrap Toggle overflow 
 
4.4. Configuring the log visualizer
OpenShift Container Platform uses Kibana to display the log data collected by OpenShift Logging.
You can scale Kibana for redundancy and configure the CPU and memory for your Kibana nodes.
4.4.1. Configuring CPU and memory limits
The OpenShift Logging components allow for adjustments to both the CPU and memory limits.
Procedure
- Edit the - ClusterLoggingcustom resource (CR) in the- openshift-loggingproject:- oc -n openshift-logging edit ClusterLogging instance - $ oc -n openshift-logging edit ClusterLogging instance- Copy to Clipboard Copied! - Toggle word wrap Toggle overflow - Copy to Clipboard Copied! - Toggle word wrap Toggle overflow - 1
- Specify the CPU and memory limits and requests for the log store as needed. For Elasticsearch, you must adjust both the request value and the limit value.
- 2 3
- Specify the CPU and memory limits and requests for the log visualizer as needed.
- 4
- Specify the CPU and memory limits and requests for the log collector as needed.
 
4.4.2. Scaling redundancy for the log visualizer nodes
You can scale the pod that hosts the log visualizer for redundancy.
Procedure
- Edit the - ClusterLoggingcustom resource (CR) in the- openshift-loggingproject:- oc edit ClusterLogging instance - $ oc edit ClusterLogging instance- Copy to Clipboard Copied! - Toggle word wrap Toggle overflow - Copy to Clipboard Copied! - Toggle word wrap Toggle overflow - 1
- Specify the number of Kibana nodes.
 
4.5. Configuring OpenShift Logging storage
Elasticsearch is a memory-intensive application. The default OpenShift Logging installation deploys 16G of memory for both memory requests and memory limits. The initial set of OpenShift Container Platform nodes might not be large enough to support the Elasticsearch cluster. You must add additional nodes to the OpenShift Container Platform cluster to run with the recommended or higher memory. Each Elasticsearch node can operate with a lower memory setting, though this is not recommended for production environments.
4.5.1. Storage considerations for OpenShift Logging and OpenShift Container Platform
A persistent volume is required for each Elasticsearch deployment configuration. On OpenShift Container Platform this is achieved using persistent volume claims.
						If you use a local volume for persistent storage, do not use a raw block volume, which is described with volumeMode: block in the LocalVolume object. Elasticsearch cannot use raw block volumes.
					
The OpenShift Elasticsearch Operator names the PVCs using the Elasticsearch resource name.
Fluentd ships any logs from systemd journal and /var/log/containers/ to Elasticsearch.
Elasticsearch requires sufficient memory to perform large merge operations. If it does not have enough memory, it becomes unresponsive. To avoid this problem, evaluate how much application log data you need, and allocate approximately double that amount of free storage capacity.
By default, when storage capacity is 85% full, Elasticsearch stops allocating new data to the node. At 90%, Elasticsearch attempts to relocate existing shards from that node to other nodes if possible. But if no nodes have a free capacity below 85%, Elasticsearch effectively rejects creating new indices and becomes RED.
These low and high watermark values are Elasticsearch defaults in the current release. You can modify these default values. Although the alerts use the same default values, you cannot change these values in the alerts.
4.6. Configuring CPU and memory limits for OpenShift Logging components
You can configure both the CPU and memory limits for each of the OpenShift Logging components as needed.
4.6.1. Configuring CPU and memory limits
The OpenShift Logging components allow for adjustments to both the CPU and memory limits.
Procedure
- Edit the - ClusterLoggingcustom resource (CR) in the- openshift-loggingproject:- oc -n openshift-logging edit ClusterLogging instance - $ oc -n openshift-logging edit ClusterLogging instance- Copy to Clipboard Copied! - Toggle word wrap Toggle overflow - Copy to Clipboard Copied! - Toggle word wrap Toggle overflow - 1
- Specify the CPU and memory limits and requests for the log store as needed. For Elasticsearch, you must adjust both the request value and the limit value.
- 2 3
- Specify the CPU and memory limits and requests for the log visualizer as needed.
- 4
- Specify the CPU and memory limits and requests for the log collector as needed.
 
4.7. Using tolerations to control OpenShift Logging pod placement
You can use taints and tolerations to ensure that OpenShift Logging pods run on specific nodes and that no other workload can run on those nodes.
				Taints and tolerations are simple key:value pair. A taint on a node instructs the node to repel all pods that do not tolerate the taint.
			
				The key is any string, up to 253 characters and the value is any string up to 63 characters. The string must begin with a letter or number, and may contain letters, numbers, hyphens, dots, and underscores.
			
Sample OpenShift Logging CR with tolerations
4.7.1. Using tolerations to control the log store pod placement
You can control which nodes the log store pods runs on and prevent other workloads from using those nodes by using tolerations on the pods.
					You apply tolerations to the log store pods through the ClusterLogging custom resource (CR) and apply taints to a node through the node specification. A taint on a node is a key:value pair that instructs the node to repel all pods that do not tolerate the taint. Using a specific key:value pair that is not on other pods ensures only the log store pods can run on that node.
				
By default, the log store pods have the following toleration:
tolerations: - effect: "NoExecute" key: "node.kubernetes.io/disk-pressure" operator: "Exists"
tolerations:
- effect: "NoExecute"
  key: "node.kubernetes.io/disk-pressure"
  operator: "Exists"Prerequisites
- OpenShift Logging and Elasticsearch must be installed.
Procedure
- Use the following command to add a taint to a node where you want to schedule the OpenShift Logging pods: - oc adm taint nodes <node-name> <key>=<value>:<effect> - $ oc adm taint nodes <node-name> <key>=<value>:<effect>- Copy to Clipboard Copied! - Toggle word wrap Toggle overflow - For example: - oc adm taint nodes node1 elasticsearch=node:NoExecute - $ oc adm taint nodes node1 elasticsearch=node:NoExecute- Copy to Clipboard Copied! - Toggle word wrap Toggle overflow - This example places a taint on - node1that has key- elasticsearch, value- node, and taint effect- NoExecute. Nodes with the- NoExecuteeffect schedule only pods that match the taint and remove existing pods that do not match.
- Edit the - logstoresection of the- ClusterLoggingCR to configure a toleration for the Elasticsearch pods:- Copy to Clipboard Copied! - Toggle word wrap Toggle overflow - 1
- Specify the key that you added to the node.
- 2
- Specify theExistsoperator to require a taint with the keyelasticsearchto be present on the Node.
- 3
- Specify theNoExecuteeffect.
- 4
- Optionally, specify thetolerationSecondsparameter to set how long a pod can remain bound to a node before being evicted.
 
					This toleration matches the taint created by the oc adm taint command. A pod with this toleration could be scheduled onto node1.
				
4.7.2. Using tolerations to control the log visualizer pod placement
You can control the node where the log visualizer pod runs and prevent other workloads from using those nodes by using tolerations on the pods.
					You apply tolerations to the log visualizer pod through the ClusterLogging custom resource (CR) and apply taints to a node through the node specification. A taint on a node is a key:value pair that instructs the node to repel all pods that do not tolerate the taint. Using a specific key:value pair that is not on other pods ensures only the Kibana pod can run on that node.
				
Prerequisites
- OpenShift Logging and Elasticsearch must be installed.
Procedure
- Use the following command to add a taint to a node where you want to schedule the log visualizer pod: - oc adm taint nodes <node-name> <key>=<value>:<effect> - $ oc adm taint nodes <node-name> <key>=<value>:<effect>- Copy to Clipboard Copied! - Toggle word wrap Toggle overflow - For example: - oc adm taint nodes node1 kibana=node:NoExecute - $ oc adm taint nodes node1 kibana=node:NoExecute- Copy to Clipboard Copied! - Toggle word wrap Toggle overflow - This example places a taint on - node1that has key- kibana, value- node, and taint effect- NoExecute. You must use the- NoExecutetaint effect.- NoExecuteschedules only pods that match the taint and remove existing pods that do not match.
- Edit the - visualizationsection of the- ClusterLoggingCR to configure a toleration for the Kibana pod:- Copy to Clipboard Copied! - Toggle word wrap Toggle overflow 
					This toleration matches the taint created by the oc adm taint command. A pod with this toleration would be able to schedule onto node1.
				
4.7.3. Using tolerations to control the log collector pod placement
You can ensure which nodes the logging collector pods run on and prevent other workloads from using those nodes by using tolerations on the pods.
					You apply tolerations to logging collector pods through the ClusterLogging custom resource (CR) and apply taints to a node through the node specification. You can use taints and tolerations to ensure the pod does not get evicted for things like memory and CPU issues.
				
By default, the logging collector pods have the following toleration:
tolerations: - key: "node-role.kubernetes.io/master" operator: "Exists" effect: "NoExecute"
tolerations:
- key: "node-role.kubernetes.io/master"
  operator: "Exists"
  effect: "NoExecute"Prerequisites
- OpenShift Logging and Elasticsearch must be installed.
Procedure
- Use the following command to add a taint to a node where you want logging collector pods to schedule logging collector pods: - oc adm taint nodes <node-name> <key>=<value>:<effect> - $ oc adm taint nodes <node-name> <key>=<value>:<effect>- Copy to Clipboard Copied! - Toggle word wrap Toggle overflow - For example: - oc adm taint nodes node1 collector=node:NoExecute - $ oc adm taint nodes node1 collector=node:NoExecute- Copy to Clipboard Copied! - Toggle word wrap Toggle overflow - This example places a taint on - node1that has key- collector, value- node, and taint effect- NoExecute. You must use the- NoExecutetaint effect.- NoExecuteschedules only pods that match the taint and removes existing pods that do not match.
- Edit the - collectionstanza of the- ClusterLoggingcustom resource (CR) to configure a toleration for the logging collector pods:- Copy to Clipboard Copied! - Toggle word wrap Toggle overflow 
					This toleration matches the taint created by the oc adm taint command. A pod with this toleration would be able to schedule onto node1.
				
4.8. Moving OpenShift Logging resources with node selectors
You can use node selectors to deploy the Elasticsearch and Kibana pods to different nodes.
4.8.1. Moving OpenShift Logging resources
You can configure the Cluster Logging Operator to deploy the pods for OpenShift Logging components, such as Elasticsearch and Kibana, to different nodes. You cannot move the Cluster Logging Operator pod from its installed location.
For example, you can move the Elasticsearch pods to a separate node because of high CPU, memory, and disk requirements.
Prerequisites
- OpenShift Logging and Elasticsearch must be installed. These features are not installed by default.
Procedure
- Edit the - ClusterLoggingcustom resource (CR) in the- openshift-loggingproject:- oc edit ClusterLogging instance - $ oc edit ClusterLogging instance- Copy to Clipboard Copied! - Toggle word wrap Toggle overflow - Copy to Clipboard Copied! - Toggle word wrap Toggle overflow 
Verification
						To verify that a component has moved, you can use the oc get pod -o wide command.
					
For example:
- You want to move the Kibana pod from the - ip-10-0-147-79.us-east-2.compute.internalnode:- oc get pod kibana-5b8bdf44f9-ccpq9 -o wide - $ oc get pod kibana-5b8bdf44f9-ccpq9 -o wide- Copy to Clipboard Copied! - Toggle word wrap Toggle overflow - Example output - NAME READY STATUS RESTARTS AGE IP NODE NOMINATED NODE READINESS GATES kibana-5b8bdf44f9-ccpq9 2/2 Running 0 27s 10.129.2.18 ip-10-0-147-79.us-east-2.compute.internal <none> <none> - NAME READY STATUS RESTARTS AGE IP NODE NOMINATED NODE READINESS GATES kibana-5b8bdf44f9-ccpq9 2/2 Running 0 27s 10.129.2.18 ip-10-0-147-79.us-east-2.compute.internal <none> <none>- Copy to Clipboard Copied! - Toggle word wrap Toggle overflow 
- You want to move the Kibana Pod to the - ip-10-0-139-48.us-east-2.compute.internalnode, a dedicated infrastructure node:- oc get nodes - $ oc get nodes- Copy to Clipboard Copied! - Toggle word wrap Toggle overflow - Example output - Copy to Clipboard Copied! - Toggle word wrap Toggle overflow - Note that the node has a - node-role.kubernetes.io/infra: ''label:- oc get node ip-10-0-139-48.us-east-2.compute.internal -o yaml - $ oc get node ip-10-0-139-48.us-east-2.compute.internal -o yaml- Copy to Clipboard Copied! - Toggle word wrap Toggle overflow - Example output - Copy to Clipboard Copied! - Toggle word wrap Toggle overflow 
- To move the Kibana pod, edit the - ClusterLoggingCR to add a node selector:- Copy to Clipboard Copied! - Toggle word wrap Toggle overflow - 1
- Add a node selector to match the label in the node specification.
 
- After you save the CR, the current Kibana pod is terminated and new pod is deployed: - oc get pods - $ oc get pods- Copy to Clipboard Copied! - Toggle word wrap Toggle overflow - Example output - Copy to Clipboard Copied! - Toggle word wrap Toggle overflow 
- The new pod is on the - ip-10-0-139-48.us-east-2.compute.internalnode:- oc get pod kibana-7d85dcffc8-bfpfp -o wide - $ oc get pod kibana-7d85dcffc8-bfpfp -o wide- Copy to Clipboard Copied! - Toggle word wrap Toggle overflow - Example output - NAME READY STATUS RESTARTS AGE IP NODE NOMINATED NODE READINESS GATES kibana-7d85dcffc8-bfpfp 2/2 Running 0 43s 10.131.0.22 ip-10-0-139-48.us-east-2.compute.internal <none> <none> - NAME READY STATUS RESTARTS AGE IP NODE NOMINATED NODE READINESS GATES kibana-7d85dcffc8-bfpfp 2/2 Running 0 43s 10.131.0.22 ip-10-0-139-48.us-east-2.compute.internal <none> <none>- Copy to Clipboard Copied! - Toggle word wrap Toggle overflow 
- After a few moments, the original Kibana pod is removed. - oc get pods - $ oc get pods- Copy to Clipboard Copied! - Toggle word wrap Toggle overflow - Example output - Copy to Clipboard Copied! - Toggle word wrap Toggle overflow 
4.9. Configuring systemd-journald and Fluentd
Because Fluentd reads from the journal, and the journal default settings are very low, journal entries can be lost because the journal cannot keep up with the logging rate from system services.
				We recommend setting RateLimitIntervalSec=30s and RateLimitBurst=10000 (or even higher if necessary) to prevent the journal from losing entries.
			
4.9.1. Configuring systemd-journald for OpenShift Logging
As you scale up your project, the default logging environment might need some adjustments.
For example, if you are missing logs, you might have to increase the rate limits for journald. You can adjust the number of messages to retain for a specified period of time to ensure that OpenShift Logging does not use excessive resources without dropping logs.
You can also determine if you want the logs compressed, how long to retain logs, how or if the logs are stored, and other settings.
Procedure
- Create a - journald.conffile with the required settings:- Copy to Clipboard Copied! - Toggle word wrap Toggle overflow - 1
- Specify whether you want logs compressed before they are written to the file system. Specifyyesto compress the message ornoto not compress. The default isyes.
- 2
- Configure whether to forward log messages. Defaults tonofor each. Specify:- 
											ForwardToConsoleto forward logs to the system console.
- 
											ForwardToKsmgto forward logs to the kernel log buffer.
- 
											ForwardToSyslogto forward to a syslog daemon.
- 
											ForwardToWallto forward messages as wall messages to all logged-in users.
 
- 
											
- 3
- Specify the maximum time to store journal entries. Enter a number to specify seconds. Or include a unit: "year", "month", "week", "day", "h" or "m". Enter0to disable. The default is1month.
- 4
- Configure rate limiting. If, during the time interval defined byRateLimitIntervalSec, more logs than specified inRateLimitBurstare received, all further messages within the interval are dropped until the interval is over. It is recommended to setRateLimitIntervalSec=30sandRateLimitBurst=10000, which are the defaults.
- 5
- Specify how logs are stored. The default ispersistent:- 
											volatileto store logs in memory in/var/log/journal/.
- 
											persistentto store logs to disk in/var/log/journal/. systemd creates the directory if it does not exist.
- 
											autoto store logs in in/var/log/journal/if the directory exists. If it does not exist, systemd temporarily stores logs in/run/systemd/journal.
- 
											noneto not store logs. systemd drops all logs.
 
- 
											
- 6
- Specify the timeout before synchronizing journal files to disk for ERR, WARNING, NOTICE, INFO, and DEBUG logs. systemd immediately syncs after receiving a CRIT, ALERT, or EMERG log. The default is1s.
- 7
- Specify the maximum size the journal can use. The default is8G.
- 8
- Specify how much disk space systemd must leave free. The default is20%.
- 9
- Specify the maximum size for individual journal files stored persistently in/var/log/journal. The default is10M.NoteIf you are removing the rate limit, you might see increased CPU utilization on the system logging daemons as it processes any messages that would have previously been throttled. For more information on systemd settings, see https://www.freedesktop.org/software/systemd/man/journald.conf.html. The default settings listed on that page might not apply to OpenShift Container Platform. 
 
- Convert the - journal.conffile to base64 and store it in a variable that is named- jrnl_cnfby running the following command:- export jrnl_cnf=$( cat journald.conf | base64 -w0 ) - $ export jrnl_cnf=$( cat journald.conf | base64 -w0 )- Copy to Clipboard Copied! - Toggle word wrap Toggle overflow 
- Create a - MachineConfigobject that includes the- jrnl_cnfvariable, which was created in the previous step. The following sample command creates a- MachineConfigobject for the worker:- Copy to Clipboard Copied! - Toggle word wrap Toggle overflow - 1
- Optional: For control plane (also known as master) node, you can provide the file name as40-master-custom-journald.yaml.
- 2
- Optional: For control plane (also known as master) node, provide the role asmaster.
- 3
- Optional: For control plane (also known as master) node, you can provide the name as40-master-custom-journald.
- 4
- Optional: To include a static copy of the parameters in thejournald.conffile, replace${jrnl_cnf}with the output of theecho $jrnl_cnfcommand.
- 5
- Set the permissions for thejournal.conffile. It is recommended to set0644permissions.
 
- Create the machine config by running the following command: - oc apply -f <file_name>.yaml - $ oc apply -f <file_name>.yaml- Copy to Clipboard Copied! - Toggle word wrap Toggle overflow - The controller detects the new - MachineConfigobject and generates a new- rendered-worker-<hash>version.
- Monitor the status of the rollout of the new rendered configuration to each node by running the following command: - oc describe machineconfigpool/<node> - $ oc describe machineconfigpool/<node>- 1 - Copy to Clipboard Copied! - Toggle word wrap Toggle overflow - 1
- Specify the node asmasterorworker.
 - Example output for worker - Copy to Clipboard Copied! - Toggle word wrap Toggle overflow 
4.10. Maintenance and support
4.10.1. About unsupported configurations
The supported way of configuring OpenShift Logging is by configuring it using the options described in this documentation. Do not use other configurations, as they are unsupported. Configuration paradigms might change across OpenShift Container Platform releases, and such cases can only be handled gracefully if all configuration possibilities are controlled. If you use configurations other than those described in this documentation, your changes will disappear because the OpenShift Elasticsearch Operator and Red Hat OpenShift Logging Operator reconcile any differences. The Operators reverse everything to the defined state by default and by design.
If you must perform configurations not described in the OpenShift Container Platform documentation, you must set your Red Hat OpenShift Logging Operator or OpenShift Elasticsearch Operator to Unmanaged. An unmanaged OpenShift Logging environment is not supported and does not receive updates until you return OpenShift Logging to Managed.
4.10.2. Unsupported configurations
You must set the Red Hat OpenShift Logging Operator to the unmanaged state to modify the following components:
- 
							The ElasticsearchCR
- The Kibana deployment
- 
							The fluent.conffile
- The Fluentd daemon set
You must set the OpenShift Elasticsearch Operator to the unmanaged state to modify the following component:
- the Elasticsearch deployment files.
Explicitly unsupported cases include:
- Configuring default log rotation. You cannot modify the default log rotation configuration.
- 
							Configuring the collected log location. You cannot change the location of the log collector output file, which by default is /var/log/fluentd/fluentd.log.
- Throttling log collection. You cannot throttle down the rate at which the logs are read in by the log collector.
- Configuring the logging collector using environment variables. You cannot use environment variables to modify the log collector.
- Configuring how the log collector normalizes logs. You cannot modify default log normalization.
4.10.3. Support policy for unmanaged Operators
The management state of an Operator determines whether an Operator is actively managing the resources for its related component in the cluster as designed. If an Operator is set to an unmanaged state, it does not respond to changes in configuration nor does it receive updates.
While this can be helpful in non-production clusters or during debugging, Operators in an unmanaged state are unsupported and the cluster administrator assumes full control of the individual component configurations and upgrades.
An Operator can be set to an unmanaged state using the following methods:
- Individual Operator configuration - Individual Operators have a - managementStateparameter in their configuration. This can be accessed in different ways, depending on the Operator. For example, the Red Hat OpenShift Logging Operator accomplishes this by modifying a custom resource (CR) that it manages, while the Cluster Samples Operator uses a cluster-wide configuration resource.- Changing the - managementStateparameter to- Unmanagedmeans that the Operator is not actively managing its resources and will take no action related to the related component. Some Operators might not support this management state as it might damage the cluster and require manual recovery.Warning- Changing individual Operators to the - Unmanagedstate renders that particular component and functionality unsupported. Reported issues must be reproduced in- Managedstate for support to proceed.
- Cluster Version Operator (CVO) overrides - The - spec.overridesparameter can be added to the CVO’s configuration to allow administrators to provide a list of overrides to the CVO’s behavior for a component. Setting the- spec.overrides[].unmanagedparameter to- truefor a component blocks cluster upgrades and alerts the administrator after a CVO override has been set:- Disabling ownership via cluster version overrides prevents upgrades. Please remove overrides before continuing. - Disabling ownership via cluster version overrides prevents upgrades. Please remove overrides before continuing.- Copy to Clipboard Copied! - Toggle word wrap Toggle overflow Warning- Setting a CVO override puts the entire cluster in an unsupported state. Reported issues must be reproduced after removing any overrides for support to proceed. 
Chapter 5. Viewing logs for a resource
You can view the logs for various resources, such as builds, deployments, and pods by using the OpenShift CLI (oc) and the web console.
Resource logs are a default feature that provides limited log viewing capability. To enhance your log retrieving and viewing experience, it is recommended that you install OpenShift Logging. OpenShift Logging aggregates all the logs from your OpenShift Container Platform cluster, such as node system audit logs, application container logs, and infrastructure logs, into a dedicated log store. You can then query, discover, and visualize your log data through the Kibana interface. Resource logs do not access the OpenShift Logging log store.
5.1. Viewing resource logs
You can view the log for various resources in the OpenShift CLI (oc) and web console. Logs read from the tail, or end, of the log.
Prerequisites
- Access to the OpenShift CLI (oc).
Procedure (UI)
- In the OpenShift Container Platform console, navigate to Workloads → Pods or navigate to the pod through the resource you want to investigate. Note- Some resources, such as builds, do not have pods to query directly. In such instances, you can locate the Logs link on the Details page for the resource. 
- Select a project from the drop-down menu.
- Click the name of the pod you want to investigate.
- Click Logs.
Procedure (CLI)
- View the log for a specific pod: - oc logs -f <pod_name> -c <container_name> - $ oc logs -f <pod_name> -c <container_name>- Copy to Clipboard Copied! - Toggle word wrap Toggle overflow - where: - -f
- Optional: Specifies that the output follows what is being written into the logs.
- <pod_name>
- Specifies the name of the pod.
- <container_name>
- Optional: Specifies the name of a container. When a pod has more than one container, you must specify the container name.
 - For example: - oc logs ruby-58cd97df55-mww7r - $ oc logs ruby-58cd97df55-mww7r- Copy to Clipboard Copied! - Toggle word wrap Toggle overflow - oc logs -f ruby-57f7f4855b-znl92 -c ruby - $ oc logs -f ruby-57f7f4855b-znl92 -c ruby- Copy to Clipboard Copied! - Toggle word wrap Toggle overflow - The contents of log files are printed out. 
- View the log for a specific resource: - oc logs <object_type>/<resource_name> - $ oc logs <object_type>/<resource_name>- 1 - Copy to Clipboard Copied! - Toggle word wrap Toggle overflow - 1
- Specifies the resource type and name.
 - For example: - oc logs deployment/ruby - $ oc logs deployment/ruby- Copy to Clipboard Copied! - Toggle word wrap Toggle overflow - The contents of log files are printed out. 
Chapter 6. Viewing cluster logs by using Kibana
OpenShift Logging includes a web console for visualizing collected log data. Currently, OpenShift Container Platform deploys the Kibana console for visualization.
Using the log visualizer, you can do the following with your data:
- search and browse the data using the Discover tab.
- chart and map the data using the Visualize tab.
- create and view custom dashboards using the Dashboard tab.
Use and configuration of the Kibana interface is beyond the scope of this documentation. For more information, on using the interface, see the Kibana documentation.
				The audit logs are not stored in the internal OpenShift Container Platform Elasticsearch instance by default. To view the audit logs in Kibana, you must use the Log Forwarding API to configure a pipeline that uses the default output for audit logs.
			
6.1. Defining Kibana index patterns
An index pattern defines the Elasticsearch indices that you want to visualize. To explore and visualize data in Kibana, you must create an index pattern.
Prerequisites
- A user must have the - cluster-adminrole, the- cluster-readerrole, or both roles to view the infra and audit indices in Kibana. The default- kubeadminuser has proper permissions to view these indices.- If you can view the pods and logs in the - default,- kube-and- openshift-projects, you should be able to access these indices. You can use the following command to check if the current user has appropriate permissions:- oc auth can-i get pods/log -n <project> - $ oc auth can-i get pods/log -n <project>- Copy to Clipboard Copied! - Toggle word wrap Toggle overflow - Example output - yes - yes- Copy to Clipboard Copied! - Toggle word wrap Toggle overflow Note- The audit logs are not stored in the internal OpenShift Container Platform Elasticsearch instance by default. To view the audit logs in Kibana, you must use the Log Forwarding API to configure a pipeline that uses the - defaultoutput for audit logs.
- Elasticsearch documents must be indexed before you can create index patterns. This is done automatically, but it might take a few minutes in a new or updated cluster.
Procedure
To define index patterns and create visualizations in Kibana:
- 
						In the OpenShift Container Platform console, click the Application Launcher 
						 and select Logging. and select Logging.
- Create your Kibana index patterns by clicking Management → Index Patterns → Create index pattern: - 
								Each user must manually create index patterns when logging into Kibana the first time to see logs for their projects. Users must create an index pattern named appand use the@timestamptime field to view their container logs.
- 
								Each admin user must create index patterns when logged into Kibana the first time for the app,infra, andauditindices using the@timestamptime field.
 
- 
								Each user must manually create index patterns when logging into Kibana the first time to see logs for their projects. Users must create an index pattern named 
- Create Kibana Visualizations from the new index patterns.
6.2. Viewing cluster logs in Kibana
You view cluster logs in the Kibana web console. The methods for viewing and visualizing your data in Kibana that are beyond the scope of this documentation. For more information, refer to the Kibana documentation.
Prerequisites
- OpenShift Logging and Elasticsearch must be installed.
- Kibana index patterns must exist.
- A user must have the - cluster-adminrole, the- cluster-readerrole, or both roles to view the infra and audit indices in Kibana. The default- kubeadminuser has proper permissions to view these indices.- If you can view the pods and logs in the - default,- kube-and- openshift-projects, you should be able to access these indices. You can use the following command to check if the current user has appropriate permissions:- oc auth can-i get pods/log -n <project> - $ oc auth can-i get pods/log -n <project>- Copy to Clipboard Copied! - Toggle word wrap Toggle overflow - Example output - yes - yes- Copy to Clipboard Copied! - Toggle word wrap Toggle overflow Note- The audit logs are not stored in the internal OpenShift Container Platform Elasticsearch instance by default. To view the audit logs in Kibana, you must use the Log Forwarding API to configure a pipeline that uses the - defaultoutput for audit logs.
Procedure
To view logs in Kibana:
- 
						In the OpenShift Container Platform console, click the Application Launcher 
						 and select Logging. and select Logging.
- Log in using the same credentials you use to log in to the OpenShift Container Platform console. - The Kibana interface launches. 
- In Kibana, click Discover.
- Select the index pattern you created from the drop-down menu in the top-left corner: app, audit, or infra. - The log data displays as time-stamped documents. 
- Expand one of the time-stamped documents.
- Click the JSON tab to display the log entry for that document. - Example 6.1. Sample infrastructure log entry in Kibana - Copy to Clipboard Copied! - Toggle word wrap Toggle overflow 
Chapter 7. Forwarding logs to third-party systems
			By default, OpenShift Logging sends container and infrastructure logs to the default internal Elasticsearch log store defined in the ClusterLogging custom resource. However, it does not send audit logs to the internal store because it does not provide secure storage. If this default configuration meets your needs, you do not need to configure the Cluster Log Forwarder.
		
To send logs to other log aggregators, you use the OpenShift Container Platform Cluster Log Forwarder. This API enables you to send container, infrastructure, and audit logs to specific endpoints within or outside your cluster. In addition, you can send different types of logs to various systems so that various individuals can access each type. You can also enable Transport Layer Security (TLS) support to send logs securely, as required by your organization.
To send audit logs to the internal log store, use the Cluster Log Forwarder as described in Forward audit logs to the log store.
When you forward logs externally, the Red Hat OpenShift Logging Operator creates or modifies a Fluentd config map to send logs using your desired protocols. You are responsible for configuring the protocol on the external log aggregator.
Alternatively, you can create a config map to use the Fluentd forward protocol or the syslog protocol to send logs to external systems. However, these methods for forwarding logs are deprecated in OpenShift Container Platform and will be removed in a future release.
You cannot use the config map methods and the Cluster Log Forwarder in the same cluster.
7.1. About forwarding logs to third-party systems
				Forwarding cluster logs to external third-party systems requires a combination of outputs and pipelines specified in a ClusterLogForwarder custom resource (CR) to send logs to specific endpoints inside and outside of your OpenShift Container Platform cluster. You can also use inputs to forward the application logs associated with a specific project to an endpoint.
			
- An output is the destination for log data that you define, or where you want the logs sent. An output can be one of the following types: - 
								elasticsearch. An external Elasticsearch 6 (all releases) instance. Theelasticsearchoutput can use a TLS connection.
- 
								fluentdForward. An external log aggregation solution that supports Fluentd. This option uses the Fluentd forward protocols. ThefluentForwardoutput can use a TCP or TLS connection and supports shared-key authentication by providing a shared_key field in a secret. Shared-key authentication can be used with or without TLS.
- 
								syslog. An external log aggregation solution that supports the syslog RFC3164 or RFC5424 protocols. Thesyslogoutput can use a UDP, TCP, or TLS connection.
- 
								kafka. A Kafka broker. Thekafkaoutput can use a TCP or TLS connection.
- 
								default. The internal OpenShift Container Platform Elasticsearch instance. You are not required to configure the default output. If you do configure adefaultoutput, you receive an error message because thedefaultoutput is reserved for the Red Hat OpenShift Logging Operator.
 - If the output URL scheme requires TLS (HTTPS, TLS, or UDPS), then TLS server-side authentication is enabled. To also enable client authentication, the output must name a secret in the - openshift-loggingproject. The secret must have keys of: tls.crt, tls.key, and ca-bundle.crt that point to the respective certificates that they represent.
- 
								
- A pipeline defines simple routing from one log type to one or more outputs, or which logs you want to send. The log types are one of the following: - 
								application. Container logs generated by user applications running in the cluster, except infrastructure container applications.
- 
								infrastructure. Container logs from pods that run in theopenshift*,kube*, ordefaultprojects and journal logs sourced from node file system.
- 
								audit. Logs generated by auditd, the node audit system, and the audit logs from the Kubernetes API server and the OpenShift API server.
 - You can add labels to outbound log messages by using - key:valuepairs in the pipeline. For example, you might add a label to messages that are forwarded to others data centers or label the logs by type. Labels that are added to objects are also forwarded with the log message.
- 
								
- An input forwards the application logs associated with a specific project to a pipeline.
				In the pipeline, you define which log types to forward using an inputRef parameter and where to forward the logs to using an outputRef parameter.
			
Note the following:
- 
						If a ClusterLogForwarderCR object exists, logs are not forwarded to the default Elasticsearch instance, unless there is a pipeline with thedefaultoutput.
- 
						By default, OpenShift Logging sends container and infrastructure logs to the default internal Elasticsearch log store defined in the ClusterLoggingcustom resource. However, it does not send audit logs to the internal store because it does not provide secure storage. If this default configuration meets your needs, do not configure the Log Forwarding API.
- 
						If you do not define a pipeline for a log type, the logs of the undefined types are dropped. For example, if you specify a pipeline for the applicationandaudittypes, but do not specify a pipeline for theinfrastructuretype,infrastructurelogs are dropped.
- 
						You can use multiple types of outputs in the ClusterLogForwardercustom resource (CR) to send logs to servers that support different protocols.
- The internal OpenShift Container Platform Elasticsearch instance does not provide secure storage for audit logs. We recommend you ensure that the system to which you forward audit logs is compliant with your organizational and governmental regulations and is properly secured. OpenShift Logging does not comply with those regulations.
- You are responsible for creating and maintaining any additional configurations that external destinations might require, such as keys and secrets, service accounts, port openings, or global proxy configuration.
				The following example forwards the audit logs to a secure external Elasticsearch instance, the infrastructure logs to an insecure external Elasticsearch instance, the application logs to a Kafka broker, and the application logs from the my-apps-logs project to the internal Elasticsearch instance.
			
Sample log forwarding outputs and pipelines
- 1
- The name of theClusterLogForwarderCR must beinstance.
- 2
- The namespace for theClusterLogForwarderCR must beopenshift-logging.
- 3
- Configuration for an secure Elasticsearch output using a secret with a secure URL.- A name to describe the output.
- 
								The type of output: elasticsearch.
- The secure URL and port of the Elasticsearch instance as a valid absolute URL, including the prefix.
- 
								The secret required by the endpoint for TLS communication. The secret must exist in the openshift-loggingproject.
 
- 4
- Configuration for an insecure Elasticsearch output:- A name to describe the output.
- 
								The type of output: elasticsearch.
- The insecure URL and port of the Elasticsearch instance as a valid absolute URL, including the prefix.
 
- 5
- Configuration for a Kafka output using a client-authenticated TLS communication over a secure URL- A name to describe the output.
- 
								The type of output: kafka.
- Specify the URL and port of the Kafka broker as a valid absolute URL, including the prefix.
 
- 6
- Configuration for an input to filter application logs from themy-projectnamespace.
- 7
- Configuration for a pipeline to send audit logs to the secure external Elasticsearch instance:- Optional. A name to describe the pipeline.
- 
								The inputRefsis the log type, in this exampleaudit.
- 
								The outputRefsis the name of the output to use, in this exampleelasticsearch-secureto forward to the secure Elasticsearch instance anddefaultto forward to the internal Elasticsearch instance.
- Optional: Labels to add to the logs.
 
- 8
- Optional: Forward structured JSON log entries as JSON objects in thestructuredfield. The log entry must contain valid structured JSON; otherwise, OpenShift Logging removes thestructuredfield and instead sends the log entry to the default index,app-00000x.
- 9
- Optional: String. One or more labels to add to the logs. Quote values like "true" so they are recognized as string values, not as a boolean.
- 10
- Configuration for a pipeline to send infrastructure logs to the insecure external Elasticsearch instance.
- 11
- Configuration for a pipeline to send logs from themy-projectproject to the internal Elasticsearch instance.- Optional. A name to describe the pipeline.
- 
								The inputRefsis a specific input:my-app-logs.
- 
								The outputRefsisdefault.
- Optional: String. One or more labels to add to the logs.
 
- 12
- Configuration for a pipeline to send logs to the Kafka broker, with no pipeline name:- 
								The inputRefsis the log type, in this exampleapplication.
- 
								The outputRefsis the name of the output to use.
- Optional: String. One or more labels to add to the logs.
 
- 
								The 
Fluentd log handling when the external log aggregator is unavailable
If your external logging aggregator becomes unavailable and cannot receive logs, Fluentd continues to collect logs and stores them in a buffer. When the log aggregator becomes available, log forwarding resumes, including the buffered logs. If the buffer fills completely, Fluentd stops collecting logs. OpenShift Container Platform rotates the logs and deletes them. You cannot adjust the buffer size or add a persistent volume claim (PVC) to the Fluentd daemon set or pods.
7.2. Supported log data output types
Red Hat OpenShift Logging 5.0 provides the following output types and protocols for sending log data to target log collectors.
Red Hat tests each of the combinations shown in the following table. However, you should be able to send log data to a wider range target log collectors that ingest these protocols.
| Output types | Protocols | Tested with | 
|---|---|---|
| fluentdForward | fluentd forward v1 | fluentd 1.7.4 logstash 7.10.1 | 
| elasticsearch | elasticsearch | Elasticsearch 6.8.1 Elasticsearch 7.10.1 | 
| syslog | RFC-3164, RFC-5424 | rsyslog 8.37.0-9.el7 | 
| kafka | kafka 0.11 | kafka 2.4.1 | 
Previously, the syslog output supported only RFC-3164. The current syslog output adds support for RFC-5424.
7.3. Forwarding logs to an external Elasticsearch instance
You can optionally forward logs to an external Elasticsearch instance in addition to, or instead of, the internal OpenShift Container Platform Elasticsearch instance. You are responsible for configuring the external log aggregator to receive log data from OpenShift Container Platform.
				To configure log forwarding to an external Elasticsearch instance, create a ClusterLogForwarder custom resource (CR) with an output to that instance and a pipeline that uses the output. The external Elasticsearch output can use the HTTP (insecure) or HTTPS (secure HTTP) connection.
			
				To forward logs to both an external and the internal Elasticsearch instance, create outputs and pipelines to the external instance and a pipeline that uses the default output to forward logs to the internal instance. You do not need to create a default output. If you do configure a default output, you receive an error message because the default output is reserved for the Red Hat OpenShift Logging Operator.
			
					If you want to forward logs to only the internal OpenShift Container Platform Elasticsearch instance, you do not need to create a ClusterLogForwarder CR.
				
Prerequisites
- You must have a logging server that is configured to receive the logging data using the specified protocol or format.
Procedure
- Create a - ClusterLogForwarderCR YAML file similar to the following:- Copy to Clipboard Copied! - Toggle word wrap Toggle overflow - 1
- The name of theClusterLogForwarderCR must beinstance.
- 2
- The namespace for theClusterLogForwarderCR must beopenshift-logging.
- 3
- Specify a name for the output.
- 4
- Specify theelasticsearchtype.
- 5
- Specify the URL and port of the external Elasticsearch instance as a valid absolute URL. You can use thehttp(insecure) orhttps(secure HTTP) protocol. If the cluster-wide proxy using the CIDR annotation is enabled, the output must be a server name or FQDN, not an IP Address.
- 6
- If using anhttpsprefix, you must specify the name of the secret required by the endpoint for TLS communication. The secret must exist in theopenshift-loggingproject and must have keys of: tls.crt, tls.key, and ca-bundle.crt that point to the respective certificates that they represent.
- 7
- Optional: Specify a name for the pipeline.
- 8
- Specify which log types should be forwarded using that pipeline:application,infrastructure, oraudit.
- 9
- Specify the output to use with that pipeline for forwarding the logs.
- 10
- Optional: Specify thedefaultoutput to send the logs to the internal Elasticsearch instance.
- 11
- Optional: Forward structured JSON log entries as JSON objects in thestructuredfield. The log entry must contain valid structured JSON; otherwise, OpenShift Logging removes thestructuredfield and instead sends the log entry to the default index,app-00000x.
- 12
- Optional: String. One or more labels to add to the logs.
- 13
- Optional: Configure multiple outputs to forward logs to other external log aggregators of any supported type:- Optional. A name to describe the pipeline.
- 
										The inputRefsis the log type to forward using that pipeline:application,infrastructure, oraudit.
- 
										The outputRefsis the name of the output to use.
- Optional: String. One or more labels to add to the logs.
 
 
- Create the CR object: - oc create -f <file-name>.yaml - $ oc create -f <file-name>.yaml- Copy to Clipboard Copied! - Toggle word wrap Toggle overflow 
The Red Hat OpenShift Logging Operator redeploys the Fluentd pods. If the pods do not redeploy, you can delete the Fluentd pods to force them to redeploy.
oc delete pod --selector logging-infra=fluentd
$ oc delete pod --selector logging-infra=fluentd7.4. Forwarding logs using the Fluentd forward protocol
You can use the Fluentd forward protocol to send a copy of your logs to an external log aggregator configured to accept the protocol instead of, or in addition to, the default Elasticsearch log store. You are responsible for configuring the external log aggregator to receive the logs from OpenShift Container Platform.
				To configure log forwarding using the forward protocol, create a ClusterLogForwarder custom resource (CR) with one or more outputs to the Fluentd servers and pipelines that use those outputs. The Fluentd output can use a TCP (insecure) or TLS (secure TCP) connection.
			
Alternately, you can use a config map to forward logs using the forward protocols. However, this method is deprecated in OpenShift Container Platform and will be removed in a future release.
Prerequisites
- You must have a logging server that is configured to receive the logging data using the specified protocol or format.
Procedure
- Create a - ClusterLogForwarderCR YAML file similar to the following:- Copy to Clipboard Copied! - Toggle word wrap Toggle overflow - 1
- The name of theClusterLogForwarderCR must beinstance.
- 2
- The namespace for theClusterLogForwarderCR must beopenshift-logging.
- 3
- Specify a name for the output.
- 4
- Specify thefluentdForwardtype.
- 5
- Specify the URL and port of the external Fluentd instance as a valid absolute URL. You can use thetcp(insecure) ortls(secure TCP) protocol. If the cluster-wide proxy using the CIDR annotation is enabled, the output must be a server name or FQDN, not an IP address.
- 6
- If using atlsprefix, you must specify the name of the secret required by the endpoint for TLS communication. The secret must exist in theopenshift-loggingproject and must have keys of: tls.crt, tls.key, and ca-bundle.crt that point to the respective certificates that they represent.
- 7
- Optional. Specify a name for the pipeline.
- 8
- Specify which log types should be forwarded using that pipeline:application,infrastructure, oraudit.
- 9
- Specify the output to use with that pipeline for forwarding the logs.
- 10
- Optional. Specify thedefaultoutput to forward logs to the internal Elasticsearch instance.
- 11
- Optional: Forward structured JSON log entries as JSON objects in thestructuredfield. The log entry must contain valid structured JSON; otherwise, OpenShift Logging removes thestructuredfield and instead sends the log entry to the default index,app-00000x.
- 12
- Optional: String. One or more labels to add to the logs.
- 13
- Optional: Configure multiple outputs to forward logs to other external log aggregators of any supported type:- Optional. A name to describe the pipeline.
- 
										The inputRefsis the log type to forward using that pipeline:application,infrastructure, oraudit.
- 
										The outputRefsis the name of the output to use.
- Optional: String. One or more labels to add to the logs.
 
 
- Create the CR object: - oc create -f <file-name>.yaml - $ oc create -f <file-name>.yaml- Copy to Clipboard Copied! - Toggle word wrap Toggle overflow 
The Red Hat OpenShift Logging Operator redeploys the Fluentd pods. If the pods do not redeploy, you can delete the Fluentd pods to force them to redeploy.
oc delete pod --selector logging-infra=fluentd
$ oc delete pod --selector logging-infra=fluentd7.5. Forwarding logs using the syslog protocol
You can use the syslog RFC3164 or RFC5424 protocol to send a copy of your logs to an external log aggregator configured to accept the protocol instead of, or in addition to, the default Elasticsearch log store. You are responsible for configuring the external log aggregator, such as a syslog server, to receive the logs from OpenShift Container Platform.
				To configure log forwarding using the syslog protocol, create a ClusterLogForwarder custom resource (CR) with one or more outputs to the syslog servers and pipelines that use those outputs. The syslog output can use a UDP, TCP, or TLS connection.
			
Alternately, you can use a config map to forward logs using the syslog RFC3164 protocols. However, this method is deprecated in OpenShift Container Platform and will be removed in a future release.
Prerequisites
- You must have a logging server that is configured to receive the logging data using the specified protocol or format.
Procedure
- Create a - ClusterLogForwarderCR YAML file similar to the following:- Copy to Clipboard Copied! - Toggle word wrap Toggle overflow - 1
- The name of theClusterLogForwarderCR must beinstance.
- 2
- The namespace for theClusterLogForwarderCR must beopenshift-logging.
- 3
- Specify a name for the output.
- 4
- Specify thesyslogtype.
- 5
- Optional. Specify the syslog parameters, listed below.
- 6
- Specify the URL and port of the external syslog instance. You can use theudp(insecure),tcp(insecure) ortls(secure TCP) protocol. If the cluster-wide proxy using the CIDR annotation is enabled, the output must be a server name or FQDN, not an IP address.
- 7
- If using atlsprefix, you must specify the name of the secret required by the endpoint for TLS communication. The secret must exist in theopenshift-loggingproject and must have keys of: tls.crt, tls.key, and ca-bundle.crt that point to the respective certificates that they represent.
- 8
- Optional: Specify a name for the pipeline.
- 9
- Specify which log types should be forwarded using that pipeline:application,infrastructure, oraudit.
- 10
- Specify the output to use with that pipeline for forwarding the logs.
- 11
- Optional: Specify thedefaultoutput to forward logs to the internal Elasticsearch instance.
- 12
- Optional: Forward structured JSON log entries as JSON objects in thestructuredfield. The log entry must contain valid structured JSON; otherwise, OpenShift Logging removes thestructuredfield and instead sends the log entry to the default index,app-00000x.
- 13
- Optional: String. One or more labels to add to the logs. Quote values like "true" so they are recognized as string values, not as a boolean.
- 14
- Optional: Configure multiple outputs to forward logs to other external log aggregators of any supported type:- Optional. A name to describe the pipeline.
- 
										The inputRefsis the log type to forward using that pipeline:application,infrastructure, oraudit.
- 
										The outputRefsis the name of the output to use.
- Optional: String. One or more labels to add to the logs.
 
 
- Create the CR object: - oc create -f <file-name>.yaml - $ oc create -f <file-name>.yaml- Copy to Clipboard Copied! - Toggle word wrap Toggle overflow 
The Red Hat OpenShift Logging Operator redeploys the Fluentd pods. If the pods do not redeploy, you can delete the Fluentd pods to force them to redeploy.
oc delete pod --selector logging-infra=fluentd
$ oc delete pod --selector logging-infra=fluentd7.5.1. Syslog parameters
					You can configure the following for the syslog outputs. For more information, see the syslog RFC3164 or RFC5424 RFC.
				
- facility: The syslog facility. The value can be a decimal integer or a case-insensitive keyword: - 
									0orkernfor kernel messages
- 
									1oruserfor user-level messages, the default.
- 
									2ormailfor the mail system
- 
									3ordaemonfor system daemons
- 
									4orauthfor security/authentication messages
- 
									5orsyslogfor messages generated internally by syslogd
- 
									6orlprfor the line printer subsystem
- 
									7ornewsfor the network news subsystem
- 
									8oruucpfor the UUCP subsystem
- 
									9orcronfor the clock daemon
- 
									10orauthprivfor security authentication messages
- 
									11orftpfor the FTP daemon
- 
									12orntpfor the NTP subsystem
- 
									13orsecurityfor the syslog audit log
- 
									14orconsolefor the syslog alert log
- 
									15orsolaris-cronfor the scheduling daemon
- 
									16–23orlocal0–local7for locally used facilities
 
- 
									
- Optional. - payloadKey: The record field to use as payload for the syslog message.Note- Configuring the - payloadKeyparameter prevents other parameters from being forwarded to the syslog.
- rfc: The RFC to be used for sending logs using syslog. The default is RFC5424.
- severity: The syslog severity to set on outgoing syslog records. The value can be a decimal integer or a case-insensitive keyword: - 
									0orEmergencyfor messages indicating the system is unusable
- 
									1orAlertfor messages indicating action must be taken immediately
- 
									2orCriticalfor messages indicating critical conditions
- 
									3orErrorfor messages indicating error conditions
- 
									4orWarningfor messages indicating warning conditions
- 
									5orNoticefor messages indicating normal but significant conditions
- 
									6orInformationalfor messages indicating informational messages
- 
									7orDebugfor messages indicating debug-level messages, the default
 
- 
									
- tag: Tag specifies a record field to use as a tag on the syslog message.
- trimPrefix: Remove the specified prefix from the tag.
7.5.2. Additional RFC5424 syslog parameters
The following parameters apply to RFC5424:
- 
							appName: The APP-NAME is a free-text string that identifies the application that sent the log. Must be specified for RFC5424.
- 
							msgID: The MSGID is a free-text string that identifies the type of message. Must be specified for RFC5424.
- 
							procID: The PROCID is a free-text string. A change in the value indicates a discontinuity in syslog reporting. Must be specified for RFC5424.
7.6. Forwarding logs to a Kafka broker
You can forward logs to an external Kafka broker in addition to, or instead of, the default Elasticsearch log store.
				To configure log forwarding to an external Kafka instance, create a ClusterLogForwarder custom resource (CR) with an output to that instance and a pipeline that uses the output. You can include a specific Kafka topic in the output or use the default. The Kafka output can use a TCP (insecure) or TLS (secure TCP) connection.
			
Procedure
- Create a - ClusterLogForwarderCR YAML file similar to the following:- Copy to Clipboard Copied! - Toggle word wrap Toggle overflow - 1
- The name of theClusterLogForwarderCR must beinstance.
- 2
- The namespace for theClusterLogForwarderCR must beopenshift-logging.
- 3
- Specify a name for the output.
- 4
- Specify thekafkatype.
- 5
- Specify the URL and port of the Kafka broker as a valid absolute URL, optionally with a specific topic. You can use thetcp(insecure) ortls(secure TCP) protocol. If the cluster-wide proxy using the CIDR annotation is enabled, the output must be a server name or FQDN, not an IP address.
- 6
- If using atlsprefix, you must specify the name of the secret required by the endpoint for TLS communication. The secret must exist in theopenshift-loggingproject and must have keys of: tls.crt, tls.key, and ca-bundle.crt that point to the respective certificates that they represent.
- 7
- Optional: To send an insecure output, use atcpprefix in front of the URL. Also omit thesecretkey and itsnamefrom this output.
- 8
- Optional: Specify a name for the pipeline.
- 9
- Specify which log types should be forwarded using that pipeline:application,infrastructure, oraudit.
- 10
- Specify the output to use with that pipeline for forwarding the logs.
- 11
- Optional: Forward structured JSON log entries as JSON objects in thestructuredfield. The log entry must contain valid structured JSON; otherwise, OpenShift Logging removes thestructuredfield and instead sends the log entry to the default index,app-00000x.
- 12
- Optional: String. One or more labels to add to the logs.
- 13
- Optional: Configure multiple outputs to forward logs to other external log aggregators of any supported type:- Optional. A name to describe the pipeline.
- 
										The inputRefsis the log type to forward using that pipeline:application,infrastructure, oraudit.
- 
										The outputRefsis the name of the output to use.
- Optional: String. One or more labels to add to the logs.
 
- 14
- Optional: Specifydefaultto forward logs to the internal Elasticsearch instance.
 
- Optional: To forward a single output to multiple Kafka brokers, specify an array of Kafka brokers as shown in this example: - Copy to Clipboard Copied! - Toggle word wrap Toggle overflow 
- Create the CR object: - oc create -f <file-name>.yaml - $ oc create -f <file-name>.yaml- Copy to Clipboard Copied! - Toggle word wrap Toggle overflow 
The Red Hat OpenShift Logging Operator redeploys the Fluentd pods. If the pods do not redeploy, you can delete the Fluentd pods to force them to redeploy.
oc delete pod --selector logging-infra=fluentd
$ oc delete pod --selector logging-infra=fluentd7.7. Forwarding application logs from specific projects
You can use the Cluster Log Forwarder to send a copy of the application logs from specific projects to an external log aggregator. You can do this in addition to, or instead of, using the default Elasticsearch log store. You must also configure the external log aggregator to receive log data from OpenShift Container Platform.
				To configure forwarding application logs from a project, create a ClusterLogForwarder custom resource (CR) with at least one input from a project, optional outputs for other log aggregators, and pipelines that use those inputs and outputs.
			
Prerequisites
- You must have a logging server that is configured to receive the logging data using the specified protocol or format.
Procedure
- Create a - ClusterLogForwarderCR YAML file similar to the following:- Copy to Clipboard Copied! - Toggle word wrap Toggle overflow - 1
- The name of theClusterLogForwarderCR must beinstance.
- 2
- The namespace for theClusterLogForwarderCR must beopenshift-logging.
- 3
- Specify a name for the output.
- 4
- Specify the output type:elasticsearch,fluentdForward,syslog, orkafka.
- 5
- Specify the URL and port of the external log aggregator as a valid absolute URL. If the cluster-wide proxy using the CIDR annotation is enabled, the output must be a server name or FQDN, not an IP address.
- 6
- If using atlsprefix, you must specify the name of the secret required by the endpoint for TLS communication. The secret must exist in theopenshift-loggingproject and have tls.crt, tls.key, and ca-bundle.crt keys that each point to the certificates they represent.
- 7
- Configuration for an input to filter application logs from the specified projects.
- 8
- Configuration for a pipeline to use the input to send project application logs to an external Fluentd instance.
- 9
- Themy-app-logsinput.
- 10
- The name of the output to use.
- 11
- Optional: Forward structured JSON log entries as JSON objects in thestructuredfield. The log entry must contain valid structured JSON; otherwise, OpenShift Logging removes thestructuredfield and instead sends the log entry to the default index,app-00000x.
- 12
- Optional: String. One or more labels to add to the logs.
- 13
- Configuration for a pipeline to send logs to other log aggregators.- Optional: Specify a name for the pipeline.
- 
										Specify which log types should be forwarded using that pipeline: application,infrastructure, oraudit.
- Specify the output to use with that pipeline for forwarding the logs.
- 
										Optional: Specify the defaultoutput to forward logs to the internal Elasticsearch instance.
- Optional: String. One or more labels to add to the logs.
 
 
- Create the CR object: - oc create -f <file-name>.yaml - $ oc create -f <file-name>.yaml- Copy to Clipboard Copied! - Toggle word wrap Toggle overflow 
7.8. Forwarding application logs from specific pods
As a cluster administrator, you can use Kubernetes pod labels to gather log data from specific pods and forward it to a log collector.
Suppose that you have an application composed of pods running alongside other pods in various namespaces. If those pods have labels that identify the application, you can gather and output their log data to a specific log collector.
				To specify the pod labels, you use one or more matchLabels key-value pairs. If you specify multiple key-value pairs, the pods must match all of them to be selected.
			
Procedure
- 
						Create a ClusterLogForwardercustom resource (CR) YAML file.
- In the YAML file, specify the pod labels using simple equality-based selectors under - inputs[].name.application.selector.matchLabels, as shown in the following example.- Example - ClusterLogForwarderCR YAML file- Copy to Clipboard Copied! - Toggle word wrap Toggle overflow - 1
- The name of theClusterLogForwarderCR must beinstance.
- 2
- The namespace for theClusterLogForwarderCR must beopenshift-logging.
- 3
- Specify one or more comma-separated values frominputs[].name.
- 4
- Specify one or more comma-separated values fromoutputs[].
- 5
- Optional: Forward structured JSON log entries as JSON objects in thestructuredfield. The log entry must contain valid structured JSON; otherwise, OpenShift Logging removes thestructuredfield and instead sends the log entry to the default index,app-00000x.
- 6
- Define a uniqueinputs[].namefor each application that has a unique set of pod labels.
- 7
- Specify the key-value pairs of pod labels whose log data you want to gather. You must specify both a key and value, not just a key. To be selected, the pods must match all the key-value pairs.
- 8
- Optional: Specify one or more namespaces.
- 9
- Specify one or more outputs to forward your log data to. The optionaldefaultoutput shown here sends log data to the internal Elasticsearch instance.
 
- 
						Optional: To restrict the gathering of log data to specific namespaces, use inputs[].name.application.namespaces, as shown in the preceding example.
- Optional: You can send log data from additional applications that have different pod labels to the same pipeline. - 
								For each unique combination of pod labels, create an additional inputs[].namesection similar to the one shown.
- 
								Update the selectorsto match the pod labels of this application.
- Add the new - inputs[].namevalue to- inputRefs. For example:- - inputRefs: [ myAppLogData, myOtherAppLogData ] - - inputRefs: [ myAppLogData, myOtherAppLogData ]- Copy to Clipboard Copied! - Toggle word wrap Toggle overflow 
 
- 
								For each unique combination of pod labels, create an additional 
- Create the CR object: - oc create -f <file-name>.yaml - $ oc create -f <file-name>.yaml- Copy to Clipboard Copied! - Toggle word wrap Toggle overflow 
7.9. Forwarding logs using the legacy Fluentd method
You can use the Fluentd forward protocol to send logs to destinations outside of your OpenShift Container Platform cluster by creating a configuration file and config map. You are responsible for configuring the external log aggregator to receive log data from OpenShift Container Platform.
This method for forwarding logs is deprecated in OpenShift Container Platform and will be removed in a future release.
				To send logs using the Fluentd forward protocol, create a configuration file called secure-forward.conf, that points to an external log aggregator. Then, use that file to create a config map called called secure-forward in the openshift-logging project, which OpenShift Container Platform uses when forwarding the logs.
			
Prerequisites
- You must have a logging server that is configured to receive the logging data using the specified protocol or format.
Sample Fluentd configuration file
Procedure
To configure OpenShift Container Platform to forward logs using the legacy Fluentd method:
- Create a configuration file named - secure-forwardand specify parameters similar to the following within the- <store>stanza:- Copy to Clipboard Copied! - Toggle word wrap Toggle overflow - 1
- Enter the shared key between nodes.
- 2
- Specifytlsto enable TLS validation.
- 3
- Set totrueto verify the server cert hostname. Set tofalseto ignore server cert hostname.
- 4
- Specify the path to the private CA certificate file as/etc/ocp-forward/ca_cert.pem.
- 5
- Specify the Fluentd buffer parameters as needed.
- 6
- Optionally, enter a name for this server.
- 7
- Specify the hostname or IP of the server.
- 8
- Specify the host label of the server.
- 9
- Specify the port of the server.
- 10
- Optionally, add additional servers. If you specify two or more servers, forward uses these server nodes in a round-robin order.
 - To use Mutual TLS (mTLS) authentication, see the Fluentd documentation for information about client certificate, key parameters, and other settings. 
- Create a config map named - secure-forwardin the- openshift-loggingproject from the configuration file:- oc create configmap secure-forward --from-file=secure-forward.conf -n openshift-logging - $ oc create configmap secure-forward --from-file=secure-forward.conf -n openshift-logging- Copy to Clipboard Copied! - Toggle word wrap Toggle overflow 
The Red Hat OpenShift Logging Operator redeploys the Fluentd pods. If the pods do not redeploy, you can delete the Fluentd pods to force them to redeploy.
oc delete pod --selector logging-infra=fluentd
$ oc delete pod --selector logging-infra=fluentd7.10. Forwarding logs using the legacy syslog method
You can use the syslog RFC3164 protocol to send logs to destinations outside of your OpenShift Container Platform cluster by creating a configuration file and config map. You are responsible for configuring the external log aggregator, such as a syslog server, to receive the logs from OpenShift Container Platform.
This method for forwarding logs is deprecated in OpenShift Container Platform and will be removed in a future release.
There are two versions of the syslog protocol:
- out_syslog: The non-buffered implementation, which communicates through UDP, does not buffer data and writes out results immediately.
- out_syslog_buffered: The buffered implementation, which communicates through TCP and buffers data into chunks.
				To send logs using the syslog protocol, create a configuration file called syslog.conf, with the information needed to forward the logs. Then, use that file to create a config map called syslog in the openshift-logging project, which OpenShift Container Platform uses when forwarding the logs.
			
Prerequisites
- You must have a logging server that is configured to receive the logging data using the specified protocol or format.
Sample syslog configuration file
				You can configure the following syslog parameters. For more information, see the syslog RFC3164.
			
- facility: The syslog facility. The value can be a decimal integer or a case-insensitive keyword: - 
								0orkernfor kernel messages
- 
								1oruserfor user-level messages, the default.
- 
								2ormailfor the mail system
- 
								3ordaemonfor the system daemons
- 
								4orauthfor the security/authentication messages
- 
								5orsyslogfor messages generated internally by syslogd
- 
								6orlprfor the line printer subsystem
- 
								7ornewsfor the network news subsystem
- 
								8oruucpfor the UUCP subsystem
- 
								9orcronfor the clock daemon
- 
								10orauthprivfor security authentication messages
- 
								11orftpfor the FTP daemon
- 
								12orntpfor the NTP subsystem
- 
								13orsecurityfor the syslog audit logs
- 
								14orconsolefor the syslog alert logs
- 
								15orsolaris-cronfor the scheduling daemon
- 
								16–23orlocal0–local7for locally used facilities
 
- 
								
- payloadKey: The record field to use as payload for the syslog message.
- rfc: The RFC to be used for sending logs using syslog.
- severity: The syslog severity to set on outgoing syslog records. The value can be a decimal integer or a case-insensitive keyword: - 
								0orEmergencyfor messages indicating the system is unusable
- 
								1orAlertfor messages indicating action must be taken immediately
- 
								2orCriticalfor messages indicating critical conditions
- 
								3orErrorfor messages indicating error conditions
- 
								4orWarningfor messages indicating warning conditions
- 
								5orNoticefor messages indicating normal but significant conditions
- 
								6orInformationalfor messages indicating informational messages
- 
								7orDebugfor messages indicating debug-level messages, the default
 
- 
								
- tag: The record field to use as a tag on the syslog message.
- trimPrefix: The prefix to remove from the tag.
Procedure
To configure OpenShift Container Platform to forward logs using the legacy configuration methods:
- Create a configuration file named - syslog.confand specify parameters similar to the following within the- <store>stanza:- Copy to Clipboard Copied! - Toggle word wrap Toggle overflow - 1
- Specify the protocol to use, either:syslogorsyslog_buffered.
- 2
- Specify the FQDN or IP address of the syslog server.
- 3
- Specify the port of the syslog server.
- 4
- Optional: Specify the appropriate syslog parameters, for example:- 
										Parameter to remove the specified tagfield from the syslog prefix.
- Parameter to set the specified field as the syslog key.
- Parameter to specify the syslog log facility or source.
- Parameter to specify the syslog log severity.
- 
										Parameter to use the severity and facility from the record if available. If true, thecontainer_name,namespace_name, andpod_nameare included in the output content.
- 
										Parameter to specify the key to set the payload of the syslog message. Defaults to message.
 
- 
										Parameter to remove the specified 
- 5
- With the legacy syslog method, you must specify3164for therfcvalue.
 
- Create a config map named - syslogin the- openshift-loggingproject from the configuration file:- oc create configmap syslog --from-file=syslog.conf -n openshift-logging - $ oc create configmap syslog --from-file=syslog.conf -n openshift-logging- Copy to Clipboard Copied! - Toggle word wrap Toggle overflow 
The Red Hat OpenShift Logging Operator redeploys the Fluentd pods. If the pods do not redeploy, you can delete the Fluentd pods to force them to redeploy.
oc delete pod --selector logging-infra=fluentd
$ oc delete pod --selector logging-infra=fluentdChapter 8. Enabling JSON logging
You can configure the Log Forwarding API to parse JSON strings into a structured object.
8.1. Parsing JSON logs
				Logs including JSON logs are usually represented as a string inside the message field. That makes it hard for users to query specific fields inside a JSON document. OpenShift Logging’s Log Forwarding API enables you to parse JSON logs into a structured object and forward them to either OpenShift Logging-managed Elasticsearch or any other third-party system supported by the Log Forwarding API.
			
To illustrate how this works, suppose that you have the following structured JSON log entry.
Example structured JSON log entry
{"level":"info","name":"fred","home":"bedrock"}
{"level":"info","name":"fred","home":"bedrock"}
				Normally, the ClusterLogForwarder custom resource (CR) forwards that log entry in the message field. The message field contains the JSON-quoted string equivalent of the JSON log entry, as shown in the following example.
			
Example message field
{"message":"{\"level\":\"info\",\"name\":\"fred\",\"home\":\"bedrock\"",
 "more fields..."}
{"message":"{\"level\":\"info\",\"name\":\"fred\",\"home\":\"bedrock\"",
 "more fields..."}
				To enable parsing JSON log, you add parse: json to a pipeline in the ClusterLogForwarder CR, as shown in the following example.
			
Example snippet showing parse: json
pipelines: - inputRefs: [ application ] outputRefs: myFluentd parse: json
pipelines:
- inputRefs: [ application ]
  outputRefs: myFluentd
  parse: json
				When you enable parsing JSON logs by using parse: json, the CR copies the JSON-structured log entry in a structured field, as shown in the following example. This does not modify the original message field.
			
Example structured output containing the structured JSON log entry
{"structured": { "level": "info", "name": "fred", "home": "bedrock" },
 "more fields..."}
{"structured": { "level": "info", "name": "fred", "home": "bedrock" },
 "more fields..."}
					If the log entry does not contain valid structured JSON, the structured field will be absent.
				
To enable parsing JSON logs for specific logging platforms, see Forwarding logs to third-party systems.
8.2. Configuring JSON log data for Elasticsearch
				If your JSON logs follow more than one schema, storing them in a single index might cause type conflicts and cardinality problems. To avoid that, you must configure the ClusterLogForwarder custom resource (CR) to group each schema into a single output definition. This way, each schema is forwarded to a separate index.
			
If you forward JSON logs to the default Elasticsearch instance managed by OpenShift Logging, it generates new indices based on your configuration. To avoid performance issues associated with having too many indices, consider keeping the number of possible schemas low by standardizing to common schemas.
Structure types
					You can use the following structure types in the ClusterLogForwarder CR to construct index names for the Elasticsearch log store:
				
- structuredTypeKey(string, optional) is the name of a message field. The value of that field, if present, is used to construct the index name.- 
								kubernetes.labels.<key>is the Kubernetes pod label whose value is used to construct the index name.
- 
								openshift.labels.<key>is thepipeline.label.<key>element in theClusterLogForwarderCR whose value is used to construct the index name.
- 
								kubernetes.container_nameuses the container name to construct the index name.
 
- 
								
- 
						structuredTypeName: (string, optional) IfstructuredTypeKeyis not set or its key is not present, OpenShift Logging uses the value ofstructuredTypeNameas the structured type. When you use bothstructuredTypeKeyandstructuredTypeNametogether,structuredTypeNameprovides a fallback index name if the key instructuredTypeKeyis missing from the JSON log data.
					Although you can set the value of structuredTypeKey to any field shown in the "Log Record Fields" topic, the most useful fields are shown in the preceding list of structure types.
				
A structuredTypeKey: kubernetes.labels.<key> example
Suppose the following:
- Your cluster is running application pods that produce JSON logs in two different formats, "apache" and "google".
- 
						The user labels these application pods with logFormat=apacheandlogFormat=google.
- 
						You use the following snippet in your ClusterLogForwarderCR YAML file.
				In that case, the following structured log record goes to the app-apache-write index:
			
{
  "structured":{"name":"fred","home":"bedrock"},
  "kubernetes":{"labels":{"logFormat": "apache", ...}}
}
{
  "structured":{"name":"fred","home":"bedrock"},
  "kubernetes":{"labels":{"logFormat": "apache", ...}}
}
				And the following structured log record goes to the app-google-write index:
			
{
  "structured":{"name":"wilma","home":"bedrock"},
  "kubernetes":{"labels":{"logFormat": "google", ...}}
}
{
  "structured":{"name":"wilma","home":"bedrock"},
  "kubernetes":{"labels":{"logFormat": "google", ...}}
}A structuredTypeKey: openshift.labels.<key> example
					Suppose that you use the following snippet in your ClusterLogForwarder CR YAML file.
				
				In that case, the following structured log record goes to the app-myValue-write index:
			
{
  "structured":{"name":"fred","home":"bedrock"},
  "openshift":{"labels":{"myLabel": "myValue", ...}}
}
{
  "structured":{"name":"fred","home":"bedrock"},
  "openshift":{"labels":{"myLabel": "myValue", ...}}
}Additional considerations
- The Elasticsearch index for structured records is formed by prepending "app-" to the structured type and appending "-write".
- Unstructured records are not sent to the structured index. They are indexed as usual in the application, infrastructure, or audit indices.
- 
						If there is no non-empty structured type, forward an unstructured record with no structuredfield.
				It is important not to overload Elasticsearch with too many indices. Only use distinct structured types for distinct log formats, not for each application or namespace. For example, most Apache applications use the same JSON log format and structured type, such as LogApache.
			
8.3. Forwarding JSON logs to the Elasticsearch log store
				For an Elasticsearch log store, if your JSON log entries follow different schemas, configure the ClusterLogForwarder custom resource (CR) to group each JSON schema into a single output definition. This way, Elasticsearch uses a separate index for each schema.
			
Because forwarding different schemas to the same index can cause type conflicts and cardinality problems, you must perform this configuration before you forward data to the Elasticsearch store.
To avoid performance issues associated with having too many indices, consider keeping the number of possible schemas low by standardizing to common schemas.
Procedure
- Add the following snippet to your - ClusterLogForwarderCR YAML file.- Copy to Clipboard Copied! - Toggle word wrap Toggle overflow 
- 
						Optional: Use structuredTypeKeyto specify one of the log record fields, as described in the preceding topic, Configuring JSON log data for Elasticsearch. Otherwise, remove this line.
- Optional: Use - structuredTypeNameto specify a- <name>, as described in the preceding topic, Configuring JSON log data for Elasticsearch. Otherwise, remove this line.Important- To parse JSON logs, you must set either - structuredTypeKeyor- structuredTypeName, or both- structuredTypeKeyand- structuredTypeName.
- 
						For inputRefs, specify which log types should be forwarded using that pipeline, such asapplication,infrastructure, oraudit.
- 
						Add the parse: jsonelement to pipelines.
- Create the CR object: - oc create -f <file-name>.yaml - $ oc create -f <file-name>.yaml- Copy to Clipboard Copied! - Toggle word wrap Toggle overflow - The Red Hat OpenShift Logging Operator redeploys the Fluentd pods. However, if they do not redeploy, delete the Fluentd pods to force them to redeploy. - oc delete pod --selector logging-infra=fluentd - $ oc delete pod --selector logging-infra=fluentd- Copy to Clipboard Copied! - Toggle word wrap Toggle overflow 
Chapter 9. Collecting and storing Kubernetes events
The OpenShift Container Platform Event Router is a pod that watches Kubernetes events and logs them for collection by OpenShift Logging. You must manually deploy the Event Router.
			The Event Router collects events from all projects and writes them to STDOUT. Fluentd collects those events and forwards them into the OpenShift Container Platform Elasticsearch instance. Elasticsearch indexes the events to the infra index.
		
The Event Router adds additional load to Fluentd and can impact the number of other log messages that can be processed.
9.1. Deploying and configuring the Event Router
				Use the following steps to deploy the Event Router into your cluster. You should always deploy the Event Router to the openshift-logging project to ensure it collects events from across the cluster.
			
The following Template object creates the service account, cluster role, and cluster role binding required for the Event Router. The template also configures and deploys the Event Router pod. You can use this template without making changes, or change the deployment object CPU and memory requests.
Prerequisites
- You need proper permissions to create service accounts and update cluster role bindings. For example, you can run the following template with a user that has the cluster-admin role.
- OpenShift Logging must be installed.
Procedure
- Create a template for the Event Router: - Copy to Clipboard Copied! - Toggle word wrap Toggle overflow - 1
- Creates a Service Account in theopenshift-loggingproject for the Event Router.
- 2
- Creates a ClusterRole to monitor for events in the cluster.
- 3
- Creates a ClusterRoleBinding to bind the ClusterRole to the service account.
- 4
- Creates a config map in theopenshift-loggingproject to generate the requiredconfig.jsonfile.
- 5
- Creates a deployment in theopenshift-loggingproject to generate and configure the Event Router pod.
- 6
- Specifies the image, identified by a tag such asv0.3.
- 7
- Specifies the minimum amount of memory to allocate to the Event Router pod. Defaults to128Mi.
- 8
- Specifies the minimum amount of CPU to allocate to the Event Router pod. Defaults to100m.
- 9
- Specifies theopenshift-loggingproject to install objects in.
 
- Use the following command to process and apply the template: - oc process -f <templatefile> | oc apply -n openshift-logging -f - - $ oc process -f <templatefile> | oc apply -n openshift-logging -f -- Copy to Clipboard Copied! - Toggle word wrap Toggle overflow - For example: - oc process -f eventrouter.yaml | oc apply -n openshift-logging -f - - $ oc process -f eventrouter.yaml | oc apply -n openshift-logging -f -- Copy to Clipboard Copied! - Toggle word wrap Toggle overflow - Example output - serviceaccount/eventrouter created clusterrole.authorization.openshift.io/event-reader created clusterrolebinding.authorization.openshift.io/event-reader-binding created configmap/eventrouter created deployment.apps/eventrouter created - serviceaccount/eventrouter created clusterrole.authorization.openshift.io/event-reader created clusterrolebinding.authorization.openshift.io/event-reader-binding created configmap/eventrouter created deployment.apps/eventrouter created- Copy to Clipboard Copied! - Toggle word wrap Toggle overflow 
- Validate that the Event Router installed in the - openshift-loggingproject:- View the new Event Router pod: - oc get pods --selector component=eventrouter -o name -n openshift-logging - $ oc get pods --selector component=eventrouter -o name -n openshift-logging- Copy to Clipboard Copied! - Toggle word wrap Toggle overflow - Example output - pod/cluster-logging-eventrouter-d649f97c8-qvv8r - pod/cluster-logging-eventrouter-d649f97c8-qvv8r- Copy to Clipboard Copied! - Toggle word wrap Toggle overflow 
- View the events collected by the Event Router: - oc logs <cluster_logging_eventrouter_pod> -n openshift-logging - $ oc logs <cluster_logging_eventrouter_pod> -n openshift-logging- Copy to Clipboard Copied! - Toggle word wrap Toggle overflow - For example: - oc logs cluster-logging-eventrouter-d649f97c8-qvv8r -n openshift-logging - $ oc logs cluster-logging-eventrouter-d649f97c8-qvv8r -n openshift-logging- Copy to Clipboard Copied! - Toggle word wrap Toggle overflow - Example output - {"verb":"ADDED","event":{"metadata":{"name":"openshift-service-catalog-controller-manager-remover.1632d931e88fcd8f","namespace":"openshift-service-catalog-removed","selfLink":"/api/v1/namespaces/openshift-service-catalog-removed/events/openshift-service-catalog-controller-manager-remover.1632d931e88fcd8f","uid":"787d7b26-3d2f-4017-b0b0-420db4ae62c0","resourceVersion":"21399","creationTimestamp":"2020-09-08T15:40:26Z"},"involvedObject":{"kind":"Job","namespace":"openshift-service-catalog-removed","name":"openshift-service-catalog-controller-manager-remover","uid":"fac9f479-4ad5-4a57-8adc-cb25d3d9cf8f","apiVersion":"batch/v1","resourceVersion":"21280"},"reason":"Completed","message":"Job completed","source":{"component":"job-controller"},"firstTimestamp":"2020-09-08T15:40:26Z","lastTimestamp":"2020-09-08T15:40:26Z","count":1,"type":"Normal"}}- {"verb":"ADDED","event":{"metadata":{"name":"openshift-service-catalog-controller-manager-remover.1632d931e88fcd8f","namespace":"openshift-service-catalog-removed","selfLink":"/api/v1/namespaces/openshift-service-catalog-removed/events/openshift-service-catalog-controller-manager-remover.1632d931e88fcd8f","uid":"787d7b26-3d2f-4017-b0b0-420db4ae62c0","resourceVersion":"21399","creationTimestamp":"2020-09-08T15:40:26Z"},"involvedObject":{"kind":"Job","namespace":"openshift-service-catalog-removed","name":"openshift-service-catalog-controller-manager-remover","uid":"fac9f479-4ad5-4a57-8adc-cb25d3d9cf8f","apiVersion":"batch/v1","resourceVersion":"21280"},"reason":"Completed","message":"Job completed","source":{"component":"job-controller"},"firstTimestamp":"2020-09-08T15:40:26Z","lastTimestamp":"2020-09-08T15:40:26Z","count":1,"type":"Normal"}}- Copy to Clipboard Copied! - Toggle word wrap Toggle overflow - You can also use Kibana to view events by creating an index pattern using the Elasticsearch - infraindex.
 
Chapter 10. Updating OpenShift Logging
| 4.7 | 4.8 | 4.9 | |
|---|---|---|---|
| RHOL 5.1 | X | X | |
| RHOL 5.2 | X | X | X | 
| RHOL 5.3 | X | X | 
To upgrade from cluster logging in OpenShift Container Platform 4.6 and earlier to OpenShift Logging 5.x, you update the OpenShift Container Platform cluster to 4.7 or 4.8. Then, you update the following operators:
- From Elasticsearch Operator 4.x to OpenShift Elasticsearch Operator 5.x
- From Cluster Logging Operator 4.x to Red Hat OpenShift Logging Operator 5.x
To upgrade from a previous version of OpenShift Logging to the current version, you update OpenShift Elasticsearch Operator and Red Hat OpenShift Logging Operator to their current versions.
10.1. Updating from cluster logging in OpenShift Container Platform 4.6 or earlier to OpenShift Logging 5.x
OpenShift Container Platform 4.7 made the following name changes:
- The cluster logging feature became the Red Hat OpenShift Logging 5.x product.
- The Cluster Logging Operator became the Red Hat OpenShift Logging Operator.
- The Elasticsearch Operator became OpenShift Elasticsearch Operator.
To upgrade from cluster logging in OpenShift Container Platform 4.6 and earlier to OpenShift Logging 5.x, you update the OpenShift Container Platform cluster to 4.7 or 4.8. Then, you update the following operators:
- From Elasticsearch Operator 4.x to OpenShift Elasticsearch Operator 5.x
- From Cluster Logging Operator 4.x to Red Hat OpenShift Logging Operator 5.x
You must update the OpenShift Elasticsearch Operator before you update the Red Hat OpenShift Logging Operator. You must also update both Operators to the same version.
If you update the operators in the wrong order, Kibana does not update and the Kibana custom resource (CR) is not created. To work around this problem, you delete the Red Hat OpenShift Logging Operator pod. When the Red Hat OpenShift Logging Operator pod redeploys, it creates the Kibana CR and Kibana becomes available again.
Prerequisites
- The OpenShift Container Platform version is 4.7 or later.
- The OpenShift Logging status is healthy: - 
								All pods are ready.
- The Elasticsearch cluster is healthy.
 
- 
								All pods are 
- Your Elasticsearch and Kibana data is backed up.
Procedure
- Update the OpenShift Elasticsearch Operator: - From the web console, click Operators → Installed Operators.
- 
								Select the openshift-operators-redhatproject.
- Click the OpenShift Elasticsearch Operator.
- Click Subscription → Channel.
- In the Change Subscription Update Channel window, select 5.0 or stable-5.1 and click Save.
- Wait for a few seconds, then click Operators → Installed Operators. - Verify that the OpenShift Elasticsearch Operator version is 5.x.x. - Wait for the Status field to report Succeeded. 
 
- Update the Cluster Logging Operator: - From the web console, click Operators → Installed Operators.
- 
								Select the openshift-loggingproject.
- Click the Cluster Logging Operator.
- Click Subscription → Channel.
- In the Change Subscription Update Channel window, select 5.0 or stable-5.1 and click Save.
- Wait for a few seconds, then click Operators → Installed Operators. - Verify that the Red Hat OpenShift Logging Operator version is 5.0.x or 5.1.x. - Wait for the Status field to report Succeeded. 
 
- Check the logging components: - Ensure that all Elasticsearch pods are in the Ready status: - oc get pod -n openshift-logging --selector component=elasticsearch - $ oc get pod -n openshift-logging --selector component=elasticsearch- Copy to Clipboard Copied! - Toggle word wrap Toggle overflow - Example output - NAME READY STATUS RESTARTS AGE elasticsearch-cdm-1pbrl44l-1-55b7546f4c-mshhk 2/2 Running 0 31m elasticsearch-cdm-1pbrl44l-2-5c6d87589f-gx5hk 2/2 Running 0 30m elasticsearch-cdm-1pbrl44l-3-88df5d47-m45jc 2/2 Running 0 29m - NAME READY STATUS RESTARTS AGE elasticsearch-cdm-1pbrl44l-1-55b7546f4c-mshhk 2/2 Running 0 31m elasticsearch-cdm-1pbrl44l-2-5c6d87589f-gx5hk 2/2 Running 0 30m elasticsearch-cdm-1pbrl44l-3-88df5d47-m45jc 2/2 Running 0 29m- Copy to Clipboard Copied! - Toggle word wrap Toggle overflow 
- Ensure that the Elasticsearch cluster is healthy: - oc exec -n openshift-logging -c elasticsearch elasticsearch-cdm-1pbrl44l-1-55b7546f4c-mshhk -- health - $ oc exec -n openshift-logging -c elasticsearch elasticsearch-cdm-1pbrl44l-1-55b7546f4c-mshhk -- health- Copy to Clipboard Copied! - Toggle word wrap Toggle overflow - { "cluster_name" : "elasticsearch", "status" : "green", }- { "cluster_name" : "elasticsearch", "status" : "green", }- Copy to Clipboard Copied! - Toggle word wrap Toggle overflow 
- Ensure that the Elasticsearch cron jobs are created: - oc project openshift-logging - $ oc project openshift-logging- Copy to Clipboard Copied! - Toggle word wrap Toggle overflow - oc get cronjob - $ oc get cronjob- Copy to Clipboard Copied! - Toggle word wrap Toggle overflow - NAME SCHEDULE SUSPEND ACTIVE LAST SCHEDULE AGE elasticsearch-im-app */15 * * * * False 0 <none> 56s elasticsearch-im-audit */15 * * * * False 0 <none> 56s elasticsearch-im-infra */15 * * * * False 0 <none> 56s - NAME SCHEDULE SUSPEND ACTIVE LAST SCHEDULE AGE elasticsearch-im-app */15 * * * * False 0 <none> 56s elasticsearch-im-audit */15 * * * * False 0 <none> 56s elasticsearch-im-infra */15 * * * * False 0 <none> 56s- Copy to Clipboard Copied! - Toggle word wrap Toggle overflow 
- Verify that the log store is updated to 5.0 or 5.1 and the indices are - green:- oc exec -c elasticsearch <any_es_pod_in_the_cluster> -- indices - $ oc exec -c elasticsearch <any_es_pod_in_the_cluster> -- indices- Copy to Clipboard Copied! - Toggle word wrap Toggle overflow - Verify that the output includes the - app-00000x,- infra-00000x,- audit-00000x,- .securityindices.- Example 10.1. Sample output with indices in a green status - Copy to Clipboard Copied! - Toggle word wrap Toggle overflow 
- Verify that the log collector is updated to 5.0 or 5.1: - oc get ds fluentd -o json | grep fluentd-init - $ oc get ds fluentd -o json | grep fluentd-init- Copy to Clipboard Copied! - Toggle word wrap Toggle overflow - Verify that the output includes a - fluentd-initcontainer:- "containerName": "fluentd-init" - "containerName": "fluentd-init"- Copy to Clipboard Copied! - Toggle word wrap Toggle overflow 
- Verify that the log visualizer is updated to 5.0 or 5.1 using the Kibana CRD: - oc get kibana kibana -o json - $ oc get kibana kibana -o json- Copy to Clipboard Copied! - Toggle word wrap Toggle overflow - Verify that the output includes a Kibana pod with the - readystatus:- Example 10.2. Sample output with a ready Kibana pod - Copy to Clipboard Copied! - Toggle word wrap Toggle overflow 
 
10.2. Updating OpenShift Logging to the current version
To update OpenShift Logging from 5.x to the current version, you change the subscriptions for the OpenShift Elasticsearch Operator and Red Hat OpenShift Logging Operator.
You must update the OpenShift Elasticsearch Operator before you update the Red Hat OpenShift Logging Operator. You must also update both Operators to the same version.
If you update the operators in the wrong order, Kibana does not update and the Kibana custom resource (CR) is not created. To work around this problem, you delete the Red Hat OpenShift Logging Operator pod. When the Red Hat OpenShift Logging Operator pod redeploys, it creates the Kibana CR and Kibana becomes available again.
Prerequisites
- The OpenShift Container Platform version is 4.7 or later.
- The OpenShift Logging status is healthy: - 
								All pods are ready.
- The Elasticsearch cluster is healthy.
 
- 
								All pods are 
- Your Elasticsearch and Kibana data is backed up.
Procedure
- Update the OpenShift Elasticsearch Operator: - From the web console, click Operators → Installed Operators.
- 
								Select the openshift-operators-redhatproject.
- Click the OpenShift Elasticsearch Operator.
- Click Subscription → Channel.
- In the Change Subscription Update Channel window, select stable-5.x and click Save.
- Wait for a few seconds, then click Operators → Installed Operators. - Verify that the OpenShift Elasticsearch Operator version is 5.x.x. - Wait for the Status field to report Succeeded. 
 
- Update the Red Hat OpenShift Logging Operator: - From the web console, click Operators → Installed Operators.
- 
								Select the openshift-loggingproject.
- Click the Red Hat OpenShift Logging Operator.
- Click Subscription → Channel.
- In the Change Subscription Update Channel window, select stable-5.x and click Save.
- Wait for a few seconds, then click Operators → Installed Operators. - Verify that the Red Hat OpenShift Logging Operator version is 5.x.x. - Wait for the Status field to report Succeeded. 
 
- Check the logging components: - Ensure that all Elasticsearch pods are in the Ready status: - oc get pod -n openshift-logging --selector component=elasticsearch - $ oc get pod -n openshift-logging --selector component=elasticsearch- Copy to Clipboard Copied! - Toggle word wrap Toggle overflow - Example output - NAME READY STATUS RESTARTS AGE elasticsearch-cdm-1pbrl44l-1-55b7546f4c-mshhk 2/2 Running 0 31m elasticsearch-cdm-1pbrl44l-2-5c6d87589f-gx5hk 2/2 Running 0 30m elasticsearch-cdm-1pbrl44l-3-88df5d47-m45jc 2/2 Running 0 29m - NAME READY STATUS RESTARTS AGE elasticsearch-cdm-1pbrl44l-1-55b7546f4c-mshhk 2/2 Running 0 31m elasticsearch-cdm-1pbrl44l-2-5c6d87589f-gx5hk 2/2 Running 0 30m elasticsearch-cdm-1pbrl44l-3-88df5d47-m45jc 2/2 Running 0 29m- Copy to Clipboard Copied! - Toggle word wrap Toggle overflow 
- Ensure that the Elasticsearch cluster is healthy: - oc exec -n openshift-logging -c elasticsearch elasticsearch-cdm-1pbrl44l-1-55b7546f4c-mshhk -- health - $ oc exec -n openshift-logging -c elasticsearch elasticsearch-cdm-1pbrl44l-1-55b7546f4c-mshhk -- health- Copy to Clipboard Copied! - Toggle word wrap Toggle overflow - { "cluster_name" : "elasticsearch", "status" : "green", }- { "cluster_name" : "elasticsearch", "status" : "green", }- Copy to Clipboard Copied! - Toggle word wrap Toggle overflow 
- Ensure that the Elasticsearch cron jobs are created: - oc project openshift-logging - $ oc project openshift-logging- Copy to Clipboard Copied! - Toggle word wrap Toggle overflow - oc get cronjob - $ oc get cronjob- Copy to Clipboard Copied! - Toggle word wrap Toggle overflow - NAME SCHEDULE SUSPEND ACTIVE LAST SCHEDULE AGE elasticsearch-im-app */15 * * * * False 0 <none> 56s elasticsearch-im-audit */15 * * * * False 0 <none> 56s elasticsearch-im-infra */15 * * * * False 0 <none> 56s - NAME SCHEDULE SUSPEND ACTIVE LAST SCHEDULE AGE elasticsearch-im-app */15 * * * * False 0 <none> 56s elasticsearch-im-audit */15 * * * * False 0 <none> 56s elasticsearch-im-infra */15 * * * * False 0 <none> 56s- Copy to Clipboard Copied! - Toggle word wrap Toggle overflow 
- Verify that the log store is updated to 5.x and the indices are - green:- oc exec -c elasticsearch <any_es_pod_in_the_cluster> -- indices - $ oc exec -c elasticsearch <any_es_pod_in_the_cluster> -- indices- Copy to Clipboard Copied! - Toggle word wrap Toggle overflow - Verify that the output includes the - app-00000x,- infra-00000x,- audit-00000x,- .securityindices.- Example 10.3. Sample output with indices in a green status - Copy to Clipboard Copied! - Toggle word wrap Toggle overflow 
- Verify that the log collector is updated to 5.x: - oc get ds fluentd -o json | grep fluentd-init - $ oc get ds fluentd -o json | grep fluentd-init- Copy to Clipboard Copied! - Toggle word wrap Toggle overflow - Verify that the output includes a - fluentd-initcontainer:- "containerName": "fluentd-init" - "containerName": "fluentd-init"- Copy to Clipboard Copied! - Toggle word wrap Toggle overflow 
- Verify that the log visualizer is updated to 5.x using the Kibana CRD: - oc get kibana kibana -o json - $ oc get kibana kibana -o json- Copy to Clipboard Copied! - Toggle word wrap Toggle overflow - Verify that the output includes a Kibana pod with the - readystatus:- Example 10.4. Sample output with a ready Kibana pod - Copy to Clipboard Copied! - Toggle word wrap Toggle overflow 
 
Chapter 11. Viewing cluster dashboards
The Logging/Elasticsearch Nodes and Openshift Logging dashboards in the OpenShift Container Platform web console show in-depth details about your Elasticsearch instance and the individual Elasticsearch nodes that you can use to prevent and diagnose problems.
The OpenShift Logging dashboard contains charts that show details about your Elasticsearch instance at a cluster level, including cluster resources, garbage collection, shards in the cluster, and Fluentd statistics.
The Logging/Elasticsearch Nodes dashboard contains charts that show details about your Elasticsearch instance, many at node level, including details on indexing, shards, resources, and so forth.
For more detailed data, click the Grafana UI link in a dashboard to launch the Grafana dashboard. Grafana is shipped with OpenShift cluster monitoring.
11.1. Accessing the Elastisearch and OpenShift Logging dashboards
You can view the Logging/Elasticsearch Nodes and OpenShift Logging dashboards in the OpenShift Container Platform web console.
Procedure
To launch the dashboards:
- In the OpenShift Container Platform web console, click Monitoring → Dashboards.
- On the Dashboards page, select Logging/Elasticsearch Nodes or OpenShift Logging from the Dashboard menu. - For the Logging/Elasticsearch Nodes dashboard, you can select the Elasticsearch node you want to view and set the data resolution. - The appropriate dashboard is displayed, showing multiple charts of data. 
- Optionally, select a different time range to display or refresh rate for the data from the Time Range and Refresh Interval menus.
For more detailed data, click the Grafana UI link to launch the Grafana dashboard.
For information on the dashboard charts, see About the OpenShift Logging dashboard and About the Logging/Elastisearch Nodes dashboard.
11.2. About the OpenShift Logging dashboard
The OpenShift Logging dashboard contains charts that show details about your Elasticsearch instance at a cluster-level that you can use to diagnose and anticipate problems.
| Metric | Description | 
|---|---|
| Elastic Cluster Status | The current Elasticsearch status: 
 | 
| Elastic Nodes | The total number of Elasticsearch nodes in the Elasticsearch instance. | 
| Elastic Shards | The total number of Elasticsearch shards in the Elasticsearch instance. | 
| Elastic Documents | The total number of Elasticsearch documents in the Elasticsearch instance. | 
| Total Index Size on Disk | The total disk space that is being used for the Elasticsearch indices. | 
| Elastic Pending Tasks | The total number of Elasticsearch changes that have not been completed, such as index creation, index mapping, shard allocation, or shard failure. | 
| Elastic JVM GC time | The amount of time that the JVM spent executing Elasticsearch garbage collection operations in the cluster. | 
| Elastic JVM GC Rate | The total number of times that JVM executed garbage activities per second. | 
| Elastic Query/Fetch Latency Sum | 
 Fetch latency typically takes less time than query latency. If fetch latency is consistently increasing, it might indicate slow disks, data enrichment, or large requests with too many results. | 
| Elastic Query Rate | The total queries executed against the Elasticsearch instance per second for each Elasticsearch node. | 
| CPU | The amount of CPU used by Elasticsearch, Fluentd, and Kibana, shown for each component. | 
| Elastic JVM Heap Used | The amount of JVM memory used. In a healthy cluster, the graph shows regular drops as memory is freed by JVM garbage collection. | 
| Elasticsearch Disk Usage | The total disk space used by the Elasticsearch instance for each Elasticsearch node. | 
| File Descriptors In Use | The total number of file descriptors used by Elasticsearch, Fluentd, and Kibana. | 
| FluentD emit count | The total number of Fluentd messages per second for the Fluentd default output, and the retry count for the default output. | 
| FluentD Buffer Availability | The percent of the Fluentd buffer that is available for chunks. A full buffer might indicate that Fluentd is not able to process the number of logs received. | 
| Elastic rx bytes | The total number of bytes that Elasticsearch has received from FluentD, the Elasticsearch nodes, and other sources. | 
| Elastic Index Failure Rate | The total number of times per second that an Elasticsearch index fails. A high rate might indicate an issue with indexing. | 
| FluentD Output Error Rate | The total number of times per second that FluentD is not able to output logs. | 
11.3. Charts on the Logging/Elasticsearch nodes dashboard
The Logging/Elasticsearch Nodes dashboard contains charts that show details about your Elasticsearch instance, many at node-level, for further diagnostics.
- Elasticsearch status
- The Logging/Elasticsearch Nodes dashboard contains the following charts about the status of your Elasticsearch instance.
| Metric | Description | 
|---|---|
| Cluster status | The cluster health status during the selected time period, using the Elasticsearch green, yellow, and red statuses: 
 | 
| Cluster nodes | The total number of Elasticsearch nodes in the cluster. | 
| Cluster data nodes | The number of Elasticsearch data nodes in the cluster. | 
| Cluster pending tasks | The number of cluster state changes that are not finished and are waiting in a cluster queue, for example, index creation, index deletion, or shard allocation. A growing trend indicates that the cluster is not able to keep up with changes. | 
- Elasticsearch cluster index shard status
- Each Elasticsearch index is a logical group of one or more shards, which are basic units of persisted data. There are two types of index shards: primary shards, and replica shards. When a document is indexed into an index, it is stored in one of its primary shards and copied into every replica of that shard. The number of primary shards is specified when the index is created, and the number cannot change during index lifetime. You can change the number of replica shards at any time.
The index shard can be in several states depending on its lifecycle phase or events occurring in the cluster. When the shard is able to perform search and indexing requests, the shard is active. If the shard cannot perform these requests, the shard is non–active. A shard might be non-active if the shard is initializing, reallocating, unassigned, and so forth.
Index shards consist of a number of smaller internal blocks, called index segments, which are physical representations of the data. An index segment is a relatively small, immutable Lucene index that is created when Lucene commits newly-indexed data. Lucene, a search library used by Elasticsearch, merges index segments into larger segments in the background to keep the total number of segments low. If the process of merging segments is slower than the speed at which new segments are created, it could indicate a problem.
When Lucene performs data operations, such as a search operation, Lucene performs the operation against the index segments in the relevant index. For that purpose, each segment contains specific data structures that are loaded in the memory and mapped. Index mapping can have a significant impact on the memory used by segment data structures.
The Logging/Elasticsearch Nodes dashboard contains the following charts about the Elasticsearch index shards.
| Metric | Description | 
|---|---|
| Cluster active shards | The number of active primary shards and the total number of shards, including replicas, in the cluster. If the number of shards grows higher, the cluster performance can start degrading. | 
| Cluster initializing shards | The number of non-active shards in the cluster. A non-active shard is one that is initializing, being reallocated to a different node, or is unassigned. A cluster typically has non–active shards for short periods. A growing number of non–active shards over longer periods could indicate a problem. | 
| Cluster relocating shards | The number of shards that Elasticsearch is relocating to a new node. Elasticsearch relocates nodes for multiple reasons, such as high memory use on a node or after a new node is added to the cluster. | 
| Cluster unassigned shards | The number of unassigned shards. Elasticsearch shards might be unassigned for reasons such as a new index being added or the failure of a node. | 
- Elasticsearch node metrics
- Each Elasticsearch node has a finite amount of resources that can be used to process tasks. When all the resources are being used and Elasticsearch attempts to perform a new task, Elasticsearch put the tasks into a queue until some resources become available.
The Logging/Elasticsearch Nodes dashboard contains the following charts about resource usage for a selected node and the number of tasks waiting in the Elasticsearch queue.
| Metric | Description | 
|---|---|
| ThreadPool tasks | The number of waiting tasks in individual queues, shown by task type. A long–term accumulation of tasks in any queue could indicate node resource shortages or some other problem. | 
| CPU usage | The amount of CPU being used by the selected Elasticsearch node as a percentage of the total CPU allocated to the host container. | 
| Memory usage | The amount of memory being used by the selected Elasticsearch node. | 
| Disk usage | The total disk space being used for index data and metadata on the selected Elasticsearch node. | 
| Documents indexing rate | The rate that documents are indexed on the selected Elasticsearch node. | 
| Indexing latency | The time taken to index the documents on the selected Elasticsearch node. Indexing latency can be affected by many factors, such as JVM Heap memory and overall load. A growing latency indicates a resource capacity shortage in the instance. | 
| Search rate | The number of search requests run on the selected Elasticsearch node. | 
| Search latency | The time taken to complete search requests on the selected Elasticsearch node. Search latency can be affected by many factors. A growing latency indicates a resource capacity shortage in the instance. | 
| Documents count (with replicas) | The number of Elasticsearch documents stored on the selected Elasticsearch node, including documents stored in both the primary shards and replica shards that are allocated on the node. | 
| Documents deleting rate | The number of Elasticsearch documents being deleted from any of the index shards that are allocated to the selected Elasticsearch node. | 
| Documents merging rate | The number of Elasticsearch documents being merged in any of index shards that are allocated to the selected Elasticsearch node. | 
- Elasticsearch node fielddata
- Fielddata is an Elasticsearch data structure that holds lists of terms in an index and is kept in the JVM Heap. Because fielddata building is an expensive operation, Elasticsearch caches the fielddata structures. Elasticsearch can evict a fielddata cache when the underlying index segment is deleted or merged, or if there is not enough JVM HEAP memory for all the fielddata caches.
The Logging/Elasticsearch Nodes dashboard contains the following charts about Elasticsearch fielddata.
| Metric | Description | 
|---|---|
| Fielddata memory size | The amount of JVM Heap used for the fielddata cache on the selected Elasticsearch node. | 
| Fielddata evictions | The number of fielddata structures that were deleted from the selected Elasticsearch node. | 
- Elasticsearch node query cache
- If the data stored in the index does not change, search query results are cached in a node-level query cache for reuse by Elasticsearch.
The Logging/Elasticsearch Nodes dashboard contains the following charts about the Elasticsearch node query cache.
| Metric | Description | 
|---|---|
| Query cache size | The total amount of memory used for the query cache for all the shards allocated to the selected Elasticsearch node. | 
| Query cache evictions | The number of query cache evictions on the selected Elasticsearch node. | 
| Query cache hits | The number of query cache hits on the selected Elasticsearch node. | 
| Query cache misses | The number of query cache misses on the selected Elasticsearch node. | 
- Elasticsearch index throttling
- When indexing documents, Elasticsearch stores the documents in index segments, which are physical representations of the data. At the same time, Elasticsearch periodically merges smaller segments into a larger segment as a way to optimize resource use. If the indexing is faster then the ability to merge segments, the merge process does not complete quickly enough, which can lead to issues with searches and performance. To prevent this situation, Elasticsearch throttles indexing, typically by reducing the number of threads allocated to indexing down to a single thread.
The Logging/Elasticsearch Nodes dashboard contains the following charts about Elasticsearch index throttling.
| Metric | Description | 
|---|---|
| Indexing throttling | The amount of time that Elasticsearch has been throttling the indexing operations on the selected Elasticsearch node. | 
| Merging throttling | The amount of time that Elasticsearch has been throttling the segment merge operations on the selected Elasticsearch node. | 
- Node JVM Heap statistics
- The Logging/Elasticsearch Nodes dashboard contains the following charts about JVM Heap operations.
| Metric | Description | 
|---|---|
| Heap used | The amount of the total allocated JVM Heap space that is used on the selected Elasticsearch node. | 
| GC count | The number of garbage collection operations that have been run on the selected Elasticsearch node, by old and young garbage collection. | 
| GC time | The amount of time that the JVM spent running garbage collection operations on the selected Elasticsearch node, by old and young garbage collection. | 
Chapter 12. Troubleshooting Logging
12.1. Viewing OpenShift Logging status
You can view the status of the Red Hat OpenShift Logging Operator and for a number of OpenShift Logging components.
12.1.1. Viewing the status of the Red Hat OpenShift Logging Operator
You can view the status of your Red Hat OpenShift Logging Operator.
Prerequisites
- OpenShift Logging and Elasticsearch must be installed.
Procedure
- Change to the - openshift-loggingproject.- oc project openshift-logging - $ oc project openshift-logging- Copy to Clipboard Copied! - Toggle word wrap Toggle overflow 
- To view the OpenShift Logging status: - Get the OpenShift Logging status: - oc get clusterlogging instance -o yaml - $ oc get clusterlogging instance -o yaml- Copy to Clipboard Copied! - Toggle word wrap Toggle overflow - Example output - Copy to Clipboard Copied! - Toggle word wrap Toggle overflow 
 
12.1.1.1. Example condition messages
						The following are examples of some condition messages from the Status.Nodes section of the OpenShift Logging instance.
					
A status message similar to the following indicates a node has exceeded the configured low watermark and no shard will be allocated to this node:
Example output
A status message similar to the following indicates a node has exceeded the configured high watermark and shards will be relocated to other nodes:
Example output
A status message similar to the following indicates the Elasticsearch node selector in the CR does not match any nodes in the cluster:
Example output
A status message similar to the following indicates that the requested PVC could not bind to PV:
Example output
A status message similar to the following indicates that the Fluentd pods cannot be scheduled because the node selector did not match any nodes:
Example output
12.1.2. Viewing the status of OpenShift Logging components
You can view the status for a number of OpenShift Logging components.
Prerequisites
- OpenShift Logging and Elasticsearch must be installed.
Procedure
- Change to the - openshift-loggingproject.- oc project openshift-logging - $ oc project openshift-logging- Copy to Clipboard Copied! - Toggle word wrap Toggle overflow 
- View the status of the OpenShift Logging environment: - oc describe deployment cluster-logging-operator - $ oc describe deployment cluster-logging-operator- Copy to Clipboard Copied! - Toggle word wrap Toggle overflow - Example output - Copy to Clipboard Copied! - Toggle word wrap Toggle overflow 
- View the status of the OpenShift Logging replica set: - Get the name of a replica set: - Example output - oc get replicaset - $ oc get replicaset- Copy to Clipboard Copied! - Toggle word wrap Toggle overflow - Example output - Copy to Clipboard Copied! - Toggle word wrap Toggle overflow 
- Get the status of the replica set: - oc describe replicaset cluster-logging-operator-574b8987df - $ oc describe replicaset cluster-logging-operator-574b8987df- Copy to Clipboard Copied! - Toggle word wrap Toggle overflow - Example output - Copy to Clipboard Copied! - Toggle word wrap Toggle overflow 
 
12.2. Viewing the status of the log store
You can view the status of the OpenShift Elasticsearch Operator and for a number of Elasticsearch components.
12.2.1. Viewing the status of the log store
You can view the status of your log store.
Prerequisites
- OpenShift Logging and Elasticsearch must be installed.
Procedure
- Change to the - openshift-loggingproject.- oc project openshift-logging - $ oc project openshift-logging- Copy to Clipboard Copied! - Toggle word wrap Toggle overflow 
- To view the status: - Get the name of the log store instance: - oc get Elasticsearch - $ oc get Elasticsearch- Copy to Clipboard Copied! - Toggle word wrap Toggle overflow - Example output - NAME AGE elasticsearch 5h9m - NAME AGE elasticsearch 5h9m- Copy to Clipboard Copied! - Toggle word wrap Toggle overflow 
- Get the log store status: - oc get Elasticsearch <Elasticsearch-instance> -o yaml - $ oc get Elasticsearch <Elasticsearch-instance> -o yaml- Copy to Clipboard Copied! - Toggle word wrap Toggle overflow - For example: - oc get Elasticsearch elasticsearch -n openshift-logging -o yaml - $ oc get Elasticsearch elasticsearch -n openshift-logging -o yaml- Copy to Clipboard Copied! - Toggle word wrap Toggle overflow - The output includes information similar to the following: - Example output - Copy to Clipboard Copied! - Toggle word wrap Toggle overflow - 1
- In the output, the cluster status fields appear in thestatusstanza.
- 2
- The status of the log store:- The number of active primary shards.
- The number of active shards.
- The number of shards that are initializing.
- The number of log store data nodes.
- The total number of log store nodes.
- The number of pending tasks.
- 
													The log store status: green,red,yellow.
- The number of unassigned shards.
 
- 3
- Any status conditions, if present. The log store status indicates the reasons from the scheduler if a pod could not be placed. Any events related to the following conditions are shown:- Container Waiting for both the log store and proxy containers.
- Container Terminated for both the log store and proxy containers.
- Pod unschedulable. Also, a condition is shown for a number of issues; see Example condition messages.
 
- 4
- The log store nodes in the cluster, withupgradeStatus.
- 5
- The log store client, data, and master pods in the cluster, listed under 'failed`,notReady, orreadystate.
 
 
12.2.1.1. Example condition messages
						The following are examples of some condition messages from the Status section of the Elasticsearch instance.
					
The following status message indicates that a node has exceeded the configured low watermark, and no shard will be allocated to this node.
The following status message indicates that a node has exceeded the configured high watermark, and shards will be relocated to other nodes.
The following status message indicates that the log store node selector in the CR does not match any nodes in the cluster:
The following status message indicates that the log store CR uses a non-existent persistent volume claim (PVC).
The following status message indicates that your log store cluster does not have enough nodes to support the redundancy policy.
This status message indicates your cluster has too many control plane nodes (also known as the master nodes):
The following status message indicates that Elasticsearch storage does not support the change you tried to make.
For example:
						The reason and type fields specify the type of unsupported change:
					
- StorageClassNameChangeIgnored
- Unsupported change to the storage class name.
- StorageSizeChangeIgnored
- Unsupported change the storage size.
- StorageStructureChangeIgnored
- Unsupported change between ephemeral and persistent storage structures. Important- If you try to configure the - ClusterLoggingcustom resource (CR) to switch from ephemeral to persistent storage, the OpenShift Elasticsearch Operator creates a persistent volume claim (PVC) but does not create a persistent volume (PV). To clear the- StorageStructureChangeIgnoredstatus, you must revert the change to the- ClusterLoggingCR and delete the PVC.
12.2.2. Viewing the status of the log store components
You can view the status for a number of the log store components.
- Elasticsearch indices
- You can view the status of the Elasticsearch indices. - Get the name of an Elasticsearch pod: - oc get pods --selector component=elasticsearch -o name - $ oc get pods --selector component=elasticsearch -o name- Copy to Clipboard Copied! - Toggle word wrap Toggle overflow - Example output - pod/elasticsearch-cdm-1godmszn-1-6f8495-vp4lw pod/elasticsearch-cdm-1godmszn-2-5769cf-9ms2n pod/elasticsearch-cdm-1godmszn-3-f66f7d-zqkz7 - pod/elasticsearch-cdm-1godmszn-1-6f8495-vp4lw pod/elasticsearch-cdm-1godmszn-2-5769cf-9ms2n pod/elasticsearch-cdm-1godmszn-3-f66f7d-zqkz7- Copy to Clipboard Copied! - Toggle word wrap Toggle overflow 
- Get the status of the indices: - oc exec elasticsearch-cdm-4vjor49p-2-6d4d7db474-q2w7z -- indices - $ oc exec elasticsearch-cdm-4vjor49p-2-6d4d7db474-q2w7z -- indices- Copy to Clipboard Copied! - Toggle word wrap Toggle overflow - Example output - Copy to Clipboard Copied! - Toggle word wrap Toggle overflow 
 
- Log store pods
- You can view the status of the pods that host the log store. - Get the name of a pod: - oc get pods --selector component=elasticsearch -o name - $ oc get pods --selector component=elasticsearch -o name- Copy to Clipboard Copied! - Toggle word wrap Toggle overflow - Example output - pod/elasticsearch-cdm-1godmszn-1-6f8495-vp4lw pod/elasticsearch-cdm-1godmszn-2-5769cf-9ms2n pod/elasticsearch-cdm-1godmszn-3-f66f7d-zqkz7 - pod/elasticsearch-cdm-1godmszn-1-6f8495-vp4lw pod/elasticsearch-cdm-1godmszn-2-5769cf-9ms2n pod/elasticsearch-cdm-1godmszn-3-f66f7d-zqkz7- Copy to Clipboard Copied! - Toggle word wrap Toggle overflow 
- Get the status of a pod: - oc describe pod elasticsearch-cdm-1godmszn-1-6f8495-vp4lw - $ oc describe pod elasticsearch-cdm-1godmszn-1-6f8495-vp4lw- Copy to Clipboard Copied! - Toggle word wrap Toggle overflow - The output includes the following status information: - Example output - Copy to Clipboard Copied! - Toggle word wrap Toggle overflow 
 
- Log storage pod deployment configuration
- You can view the status of the log store deployment configuration. - Get the name of a deployment configuration: - oc get deployment --selector component=elasticsearch -o name - $ oc get deployment --selector component=elasticsearch -o name- Copy to Clipboard Copied! - Toggle word wrap Toggle overflow - Example output - deployment.extensions/elasticsearch-cdm-1gon-1 deployment.extensions/elasticsearch-cdm-1gon-2 deployment.extensions/elasticsearch-cdm-1gon-3 - deployment.extensions/elasticsearch-cdm-1gon-1 deployment.extensions/elasticsearch-cdm-1gon-2 deployment.extensions/elasticsearch-cdm-1gon-3- Copy to Clipboard Copied! - Toggle word wrap Toggle overflow 
- Get the deployment configuration status: - oc describe deployment elasticsearch-cdm-1gon-1 - $ oc describe deployment elasticsearch-cdm-1gon-1- Copy to Clipboard Copied! - Toggle word wrap Toggle overflow - The output includes the following status information: - Example output - Copy to Clipboard Copied! - Toggle word wrap Toggle overflow 
 
- Log store replica set
- You can view the status of the log store replica set. - Get the name of a replica set: - oc get replicaSet --selector component=elasticsearch -o name - $ oc get replicaSet --selector component=elasticsearch -o name replicaset.extensions/elasticsearch-cdm-1gon-1-6f8495 replicaset.extensions/elasticsearch-cdm-1gon-2-5769cf replicaset.extensions/elasticsearch-cdm-1gon-3-f66f7d- Copy to Clipboard Copied! - Toggle word wrap Toggle overflow 
- Get the status of the replica set: - oc describe replicaSet elasticsearch-cdm-1gon-1-6f8495 - $ oc describe replicaSet elasticsearch-cdm-1gon-1-6f8495- Copy to Clipboard Copied! - Toggle word wrap Toggle overflow - The output includes the following status information: - Example output - Copy to Clipboard Copied! - Toggle word wrap Toggle overflow 
 
12.3. Understanding OpenShift Logging alerts
All of the logging collector alerts are listed on the Alerting UI of the OpenShift Container Platform web console.
12.3.1. Viewing logging collector alerts
Alerts are shown in the OpenShift Container Platform web console, on the Alerts tab of the Alerting UI. Alerts are in one of the following states:
- Firing. The alert condition is true for the duration of the timeout. Click the Options menu at the end of the firing alert to view more information or silence the alert.
- Pending The alert condition is currently true, but the timeout has not been reached.
- Not Firing. The alert is not currently triggered.
Procedure
To view OpenShift Logging and other OpenShift Container Platform alerts:
- In the OpenShift Container Platform console, click Monitoring → Alerting.
- Click the Alerts tab. The alerts are listed, based on the filters selected.
12.3.2. About logging collector alerts
The following alerts are generated by the logging collector. You can view these alerts in the OpenShift Container Platform web console, on the Alerts page of the Alerting UI.
| Alert | Message | Description | Severity | 
|---|---|---|---|
| 
									 | 
									 | The number of FluentD output errors is high, by default more than 10 in the previous 15 minutes. | Warning | 
| 
									 | 
									 | Fluentd is reporting that Prometheus could not scrape a specific Fluentd instance. | Critical | 
| 
									 | 
									 | Fluentd is reporting that the queue size is increasing. | Critical | 
| 
									 | 
									 | The number of FluentD output errors is very high, by default more than 25 in the previous 15 minutes. | Critical | 
12.3.3. About Elasticsearch alerting rules
You can view these alerting rules in Prometheus.
| Alert | Description | Severity | 
|---|---|---|
| 
									 | The cluster health status has been RED for at least 2 minutes. The cluster does not accept writes, shards may be missing, or the master node hasn’t been elected yet. | Critical | 
| 
									 | The cluster health status has been YELLOW for at least 20 minutes. Some shard replicas are not allocated. | Warning | 
| 
									 | The cluster is expected to be out of disk space within the next 6 hours. | Critical | 
| 
									 | The cluster is predicted to be out of file descriptors within the next hour. | Warning | 
| 
									 | The JVM Heap usage on the specified node is high. | Alert | 
| 
									 | The specified node has hit the low watermark due to low free disk space. Shards can not be allocated to this node anymore. You should consider adding more disk space to the node. | Info | 
| 
									 | The specified node has hit the high watermark due to low free disk space. Some shards will be re-allocated to different nodes if possible. Make sure more disk space is added to the node or drop old indices allocated to this node. | Warning | 
| 
									 | The specified node has hit the flood watermark due to low free disk space. Every index that has a shard allocated on this node is enforced a read-only block. The index block must be manually released when the disk use falls below the high watermark. | Critical | 
| 
									 | The JVM Heap usage on the specified node is too high. | Alert | 
| 
									 | Elasticsearch is experiencing an increase in write rejections on the specified node. This node might not be keeping up with the indexing speed. | Warning | 
| 
									 | The CPU used by the system on the specified node is too high. | Alert | 
| 
									 | The CPU used by Elasticsearch on the specified node is too high. | Alert | 
12.4. Collecting logging data for Red Hat Support
When opening a support case, it is helpful to provide debugging information about your cluster to Red Hat Support.
				The must-gather tool enables you to collect diagnostic information for project-level resources, cluster-level resources, and each of the OpenShift Logging components.
			
For prompt support, supply diagnostic information for both OpenShift Container Platform and OpenShift Logging.
					Do not use the hack/logging-dump.sh script. The script is no longer supported and does not collect data.
				
12.4.1. About the must-gather tool
					The oc adm must-gather CLI command collects the information from your cluster that is most likely needed for debugging issues.
				
					For your OpenShift Logging environment, must-gather collects the following information:
				
- Project-level resources, including pods, configuration maps, service accounts, roles, role bindings, and events at the project level
- Cluster-level resources, including nodes, roles, and role bindings at the cluster level
- 
							OpenShift Logging resources in the openshift-loggingandopenshift-operators-redhatnamespaces, including health status for the log collector, the log store, and the log visualizer
					When you run oc adm must-gather, a new pod is created on the cluster. The data is collected on that pod and saved in a new directory that starts with must-gather.local. This directory is created in the current working directory.
				
12.4.2. Prerequisites
- OpenShift Logging and Elasticsearch must be installed.
12.4.3. Collecting OpenShift Logging data
					You can use the oc adm must-gather CLI command to collect information about your OpenShift Logging environment.
				
Procedure
						To collect OpenShift Logging information with must-gather:
					
- 
							Navigate to the directory where you want to store the must-gatherinformation.
- Run the - oc adm must-gathercommand against the OpenShift Logging image:- oc adm must-gather --image=$(oc -n openshift-logging get deployment.apps/cluster-logging-operator -o jsonpath='{.spec.template.spec.containers[?(@.name == "cluster-logging-operator")].image}')- $ oc adm must-gather --image=$(oc -n openshift-logging get deployment.apps/cluster-logging-operator -o jsonpath='{.spec.template.spec.containers[?(@.name == "cluster-logging-operator")].image}')- Copy to Clipboard Copied! - Toggle word wrap Toggle overflow - The - must-gathertool creates a new directory that starts with- must-gather.localwithin the current directory. For example:- must-gather.local.4157245944708210408.
- Create a compressed file from the - must-gatherdirectory that was just created. For example, on a computer that uses a Linux operating system, run the following command:- tar -cvaf must-gather.tar.gz must-gather.local.4157245944708210408 - $ tar -cvaf must-gather.tar.gz must-gather.local.4157245944708210408- Copy to Clipboard Copied! - Toggle word wrap Toggle overflow 
- Attach the compressed file to your support case on the Red Hat Customer Portal.
12.5. Troubleshooting for Critical Alerts
12.5.1. Elasticsearch Cluster Health is Red
At least one primary shard and its replicas are not allocated to a node.
Troubleshooting
- Check the Elasticsearch cluster health and verify that the cluster - statusis red.- oc exec -n openshift-logging -c elasticsearch <elasticsearch_pod_name> -- health - oc exec -n openshift-logging -c elasticsearch <elasticsearch_pod_name> -- health- Copy to Clipboard Copied! - Toggle word wrap Toggle overflow 
- List the nodes that have joined the cluster. - oc exec -n openshift-logging -c elasticsearch <elasticsearch_pod_name> -- es_util --query=_cat/nodes?v - oc exec -n openshift-logging -c elasticsearch <elasticsearch_pod_name> -- es_util --query=_cat/nodes?v- Copy to Clipboard Copied! - Toggle word wrap Toggle overflow 
- List the Elasticsearch pods and compare them with the nodes in the command output from the previous step. - oc -n openshift-logging get pods -l component=elasticsearch - oc -n openshift-logging get pods -l component=elasticsearch- Copy to Clipboard Copied! - Toggle word wrap Toggle overflow 
- If some of the Elasticsearch nodes have not joined the cluster, perform the following steps. - Confirm that Elasticsearch has an elected control plane node. - oc exec -n openshift-logging -c elasticsearch <elasticsearch_pod_name> -- es_util --query=_cat/master?v - oc exec -n openshift-logging -c elasticsearch <elasticsearch_pod_name> -- es_util --query=_cat/master?v- Copy to Clipboard Copied! - Toggle word wrap Toggle overflow 
- Review the pod logs of the elected control plane node for issues. - oc logs <elasticsearch_master_pod_name> -c elasticsearch -n openshift-logging - oc logs <elasticsearch_master_pod_name> -c elasticsearch -n openshift-logging- Copy to Clipboard Copied! - Toggle word wrap Toggle overflow 
- Review the logs of nodes that have not joined the cluster for issues. - oc logs <elasticsearch_node_name> -c elasticsearch -n openshift-logging - oc logs <elasticsearch_node_name> -c elasticsearch -n openshift-logging- Copy to Clipboard Copied! - Toggle word wrap Toggle overflow 
 
- If all the nodes have joined the cluster, perform the following steps, check if the cluster is in the process of recovering. - oc exec -n openshift-logging -c elasticsearch <elasticsearch_pod_name> -- es_util --query=_cat/recovery?active_only=true - oc exec -n openshift-logging -c elasticsearch <elasticsearch_pod_name> -- es_util --query=_cat/recovery?active_only=true- Copy to Clipboard Copied! - Toggle word wrap Toggle overflow - If there is no command output, the recovery process might be delayed or stalled by pending tasks. 
- Check if there are pending tasks. - oc exec -n openshift-logging -c elasticsearch <elasticsearch_pod_name> -- health |grep number_of_pending_tasks - oc exec -n openshift-logging -c elasticsearch <elasticsearch_pod_name> -- health |grep number_of_pending_tasks- Copy to Clipboard Copied! - Toggle word wrap Toggle overflow 
- If there are pending tasks, monitor their status. - If their status changes and indicates that the cluster is recovering, continue waiting. The recovery time varies according to the size of the cluster and other factors. - Otherwise, if the status of the pending tasks does not change, this indicates that the recovery has stalled. 
- If it seems like the recovery has stalled, check if - cluster.routing.allocation.enableis set to- none.- oc exec -n openshift-logging -c elasticsearch <elasticsearch_pod_name> -- es_util --query=_cluster/settings?pretty - oc exec -n openshift-logging -c elasticsearch <elasticsearch_pod_name> -- es_util --query=_cluster/settings?pretty- Copy to Clipboard Copied! - Toggle word wrap Toggle overflow 
- If - cluster.routing.allocation.enableis set to- none, set it to- all.- oc exec -n openshift-logging -c elasticsearch <elasticsearch_pod_name> -- es_util --query=_cluster/settings?pretty -X PUT -d '{"persistent": {"cluster.routing.allocation.enable":"all"}}'- oc exec -n openshift-logging -c elasticsearch <elasticsearch_pod_name> -- es_util --query=_cluster/settings?pretty -X PUT -d '{"persistent": {"cluster.routing.allocation.enable":"all"}}'- Copy to Clipboard Copied! - Toggle word wrap Toggle overflow 
- Check which indices are still red. - oc exec -n openshift-logging -c elasticsearch <elasticsearch_pod_name> -- es_util --query=_cat/indices?v - oc exec -n openshift-logging -c elasticsearch <elasticsearch_pod_name> -- es_util --query=_cat/indices?v- Copy to Clipboard Copied! - Toggle word wrap Toggle overflow 
- If any indices are still red, try to clear them by performing the following steps. - Clear the cache. - oc exec -n openshift-logging -c elasticsearch <elasticsearch_pod_name> -- es_util --query=<elasticsearch_index_name>/_cache/clear?pretty - oc exec -n openshift-logging -c elasticsearch <elasticsearch_pod_name> -- es_util --query=<elasticsearch_index_name>/_cache/clear?pretty- Copy to Clipboard Copied! - Toggle word wrap Toggle overflow 
- Increase the max allocation retries. - oc exec -n openshift-logging -c elasticsearch <elasticsearch_pod_name> -- es_util --query=<elasticsearch_index_name>/_settings?pretty -X PUT -d '{"index.allocation.max_retries":10}'- oc exec -n openshift-logging -c elasticsearch <elasticsearch_pod_name> -- es_util --query=<elasticsearch_index_name>/_settings?pretty -X PUT -d '{"index.allocation.max_retries":10}'- Copy to Clipboard Copied! - Toggle word wrap Toggle overflow 
- Delete all the scroll items. - oc exec -n openshift-logging -c elasticsearch <elasticsearch_pod_name> -- es_util --query=_search/scroll/_all -X DELETE - oc exec -n openshift-logging -c elasticsearch <elasticsearch_pod_name> -- es_util --query=_search/scroll/_all -X DELETE- Copy to Clipboard Copied! - Toggle word wrap Toggle overflow 
- Increase the timeout. - oc exec -n openshift-logging -c elasticsearch <elasticsearch_pod_name> -- es_util --query=<elasticsearch_index_name>/_settings?pretty -X PUT -d '{"index.unassigned.node_left.delayed_timeout":"10m"}'- oc exec -n openshift-logging -c elasticsearch <elasticsearch_pod_name> -- es_util --query=<elasticsearch_index_name>/_settings?pretty -X PUT -d '{"index.unassigned.node_left.delayed_timeout":"10m"}'- Copy to Clipboard Copied! - Toggle word wrap Toggle overflow 
 
- If the preceding steps do not clear the red indices, delete the indices individually. - Identify the red index name. - oc exec -n openshift-logging -c elasticsearch <elasticsearch_pod_name> -- es_util --query=_cat/indices?v - oc exec -n openshift-logging -c elasticsearch <elasticsearch_pod_name> -- es_util --query=_cat/indices?v- Copy to Clipboard Copied! - Toggle word wrap Toggle overflow 
- Delete the red index. - oc exec -n openshift-logging -c elasticsearch <elasticsearch_pod_name> -- es_util --query=<elasticsearch_red_index_name> -X DELETE - oc exec -n openshift-logging -c elasticsearch <elasticsearch_pod_name> -- es_util --query=<elasticsearch_red_index_name> -X DELETE- Copy to Clipboard Copied! - Toggle word wrap Toggle overflow 
 
- If there are no red indices and the cluster status is red, check for a continuous heavy processing load on a data node. - Check if the Elasticsearch JVM Heap usage is high. - oc exec -n openshift-logging -c elasticsearch <elasticsearch_pod_name> -- es_util --query=_nodes/stats?pretty - oc exec -n openshift-logging -c elasticsearch <elasticsearch_pod_name> -- es_util --query=_nodes/stats?pretty- Copy to Clipboard Copied! - Toggle word wrap Toggle overflow - In the command output, review the - node_name.jvm.mem.heap_used_percentfield to determine the JVM Heap usage.
- Check for high CPU utilization.
 
12.5.2. Elasticsearch Cluster Health is Yellow
Replica shards for at least one primary shard are not allocated to nodes.
Troubleshooting
- 
							Increase the node count by adjusting nodeCountin theClusterLoggingCR.
12.5.3. Elasticsearch Node Disk Low Watermark Reached
Elasticsearch does not allocate shards to nodes that reach the low watermark.
Troubleshooting
- Identify the node on which Elasticsearch is deployed. - oc -n openshift-logging get po -o wide - oc -n openshift-logging get po -o wide- Copy to Clipboard Copied! - Toggle word wrap Toggle overflow 
- Check if there are - unassigned shards.- oc exec -n openshift-logging -c elasticsearch <elasticsearch_pod_name> -- es_util --query=_cluster/health?pretty | grep unassigned_shards - oc exec -n openshift-logging -c elasticsearch <elasticsearch_pod_name> -- es_util --query=_cluster/health?pretty | grep unassigned_shards- Copy to Clipboard Copied! - Toggle word wrap Toggle overflow 
- If there are unassigned shards, check the disk space on each node. - for pod in `oc -n openshift-logging get po -l component=elasticsearch -o jsonpath='{.items[*].metadata.name}'`; do echo $pod; oc -n openshift-logging exec -c elasticsearch $pod -- df -h /elasticsearch/persistent; done- for pod in `oc -n openshift-logging get po -l component=elasticsearch -o jsonpath='{.items[*].metadata.name}'`; do echo $pod; oc -n openshift-logging exec -c elasticsearch $pod -- df -h /elasticsearch/persistent; done- Copy to Clipboard Copied! - Toggle word wrap Toggle overflow 
- Check the - nodes.node_name.fsfield to determine the free disk space on that node.- If the used disk percentage is above 85%, the node has exceeded the low watermark, and shards can no longer be allocated to this node. 
- Try to increase the disk space on all nodes.
- If increasing the disk space is not possible, try adding a new data node to the cluster.
- If adding a new data node is problematic, decrease the total cluster redundancy policy. - Check the current - redundancyPolicy.- oc -n openshift-logging get es elasticsearch -o jsonpath='{.spec.redundancyPolicy}'- oc -n openshift-logging get es elasticsearch -o jsonpath='{.spec.redundancyPolicy}'- Copy to Clipboard Copied! - Toggle word wrap Toggle overflow Note- If you are using a - ClusterLoggingCR, enter:- oc -n openshift-logging get cl -o jsonpath='{.items[*].spec.logStore.elasticsearch.redundancyPolicy}'- oc -n openshift-logging get cl -o jsonpath='{.items[*].spec.logStore.elasticsearch.redundancyPolicy}'- Copy to Clipboard Copied! - Toggle word wrap Toggle overflow 
- 
									If the cluster redundancyPolicyis higher thanSingleRedundancy, set it toSingleRedundancyand save this change.
 
- If the preceding steps do not fix the issue, delete the old indices. - Check the status of all indices on Elasticsearch. - oc exec -n openshift-logging -c elasticsearch <elasticsearch_pod_name> -- indices - oc exec -n openshift-logging -c elasticsearch <elasticsearch_pod_name> -- indices- Copy to Clipboard Copied! - Toggle word wrap Toggle overflow 
- Identify an old index that can be deleted.
- Delete the index. - oc exec -n openshift-logging -c elasticsearch <elasticsearch_pod_name> -- es_util --query=<elasticsearch_index_name> -X DELETE - oc exec -n openshift-logging -c elasticsearch <elasticsearch_pod_name> -- es_util --query=<elasticsearch_index_name> -X DELETE- Copy to Clipboard Copied! - Toggle word wrap Toggle overflow 
 
12.5.4. Elasticsearch Node Disk High Watermark Reached
Elasticsearch attempts to relocate shards away from a node that has reached the high watermark.
Troubleshooting
- Identify the node on which Elasticsearch is deployed. - oc -n openshift-logging get po -o wide - oc -n openshift-logging get po -o wide- Copy to Clipboard Copied! - Toggle word wrap Toggle overflow 
- Check the disk space on each node. - for pod in `oc -n openshift-logging get po -l component=elasticsearch -o jsonpath='{.items[*].metadata.name}'`; do echo $pod; oc -n openshift-logging exec -c elasticsearch $pod -- df -h /elasticsearch/persistent; done- for pod in `oc -n openshift-logging get po -l component=elasticsearch -o jsonpath='{.items[*].metadata.name}'`; do echo $pod; oc -n openshift-logging exec -c elasticsearch $pod -- df -h /elasticsearch/persistent; done- Copy to Clipboard Copied! - Toggle word wrap Toggle overflow 
- Check if the cluster is rebalancing. - oc exec -n openshift-logging -c elasticsearch <elasticsearch_pod_name> -- es_util --query=_cluster/health?pretty | grep relocating_shards - oc exec -n openshift-logging -c elasticsearch <elasticsearch_pod_name> -- es_util --query=_cluster/health?pretty | grep relocating_shards- Copy to Clipboard Copied! - Toggle word wrap Toggle overflow - If the command output shows relocating shards, the High Watermark has been exceeded. The default value of the High Watermark is 90%. - The shards relocate to a node with low disk usage that has not crossed any watermark threshold limits. 
- To allocate shards to a particular node, free up some space.
- Try to increase the disk space on all nodes.
- If increasing the disk space is not possible, try adding a new data node to the cluster.
- If adding a new data node is problematic, decrease the total cluster redundancy policy. - Check the current - redundancyPolicy.- oc -n openshift-logging get es elasticsearch -o jsonpath='{.spec.redundancyPolicy}'- oc -n openshift-logging get es elasticsearch -o jsonpath='{.spec.redundancyPolicy}'- Copy to Clipboard Copied! - Toggle word wrap Toggle overflow Note- If you are using a - ClusterLoggingCR, enter:- oc -n openshift-logging get cl -o jsonpath='{.items[*].spec.logStore.elasticsearch.redundancyPolicy}'- oc -n openshift-logging get cl -o jsonpath='{.items[*].spec.logStore.elasticsearch.redundancyPolicy}'- Copy to Clipboard Copied! - Toggle word wrap Toggle overflow 
- 
									If the cluster redundancyPolicyis higher thanSingleRedundancy, set it toSingleRedundancyand save this change.
 
- If the preceding steps do not fix the issue, delete the old indices. - Check the status of all indices on Elasticsearch. - oc exec -n openshift-logging -c elasticsearch <elasticsearch_pod_name> -- indices - oc exec -n openshift-logging -c elasticsearch <elasticsearch_pod_name> -- indices- Copy to Clipboard Copied! - Toggle word wrap Toggle overflow 
- Identify an old index that can be deleted.
- Delete the index. - oc exec -n openshift-logging -c elasticsearch <elasticsearch_pod_name> -- es_util --query=<elasticsearch_index_name> -X DELETE - oc exec -n openshift-logging -c elasticsearch <elasticsearch_pod_name> -- es_util --query=<elasticsearch_index_name> -X DELETE- Copy to Clipboard Copied! - Toggle word wrap Toggle overflow 
 
12.5.5. Elasticsearch Node Disk Flood Watermark Reached
Elasticsearch enforces a read-only index block on every index that has both of these conditions:
- One or more shards are allocated to the node.
- One or more disks exceed the flood stage.
Troubleshooting
- Check the disk space of the Elasticsearch node. - for pod in `oc -n openshift-logging get po -l component=elasticsearch -o jsonpath='{.items[*].metadata.name}'`; do echo $pod; oc -n openshift-logging exec -c elasticsearch $pod -- df -h /elasticsearch/persistent; done- for pod in `oc -n openshift-logging get po -l component=elasticsearch -o jsonpath='{.items[*].metadata.name}'`; do echo $pod; oc -n openshift-logging exec -c elasticsearch $pod -- df -h /elasticsearch/persistent; done- Copy to Clipboard Copied! - Toggle word wrap Toggle overflow - Check the - nodes.node_name.fsfield to determine the free disk space on that node.
- If the used disk percentage is above 95%, it signifies that the node has crossed the flood watermark. Writing is blocked for shards allocated on this particular node.
- Try to increase the disk space on all nodes.
- If increasing the disk space is not possible, try adding a new data node to the cluster.
- If adding a new data node is problematic, decrease the total cluster redundancy policy. - Check the current - redundancyPolicy.- oc -n openshift-logging get es elasticsearch -o jsonpath='{.spec.redundancyPolicy}'- oc -n openshift-logging get es elasticsearch -o jsonpath='{.spec.redundancyPolicy}'- Copy to Clipboard Copied! - Toggle word wrap Toggle overflow Note- If you are using a - ClusterLoggingCR, enter:- oc -n openshift-logging get cl -o jsonpath='{.items[*].spec.logStore.elasticsearch.redundancyPolicy}'- oc -n openshift-logging get cl -o jsonpath='{.items[*].spec.logStore.elasticsearch.redundancyPolicy}'- Copy to Clipboard Copied! - Toggle word wrap Toggle overflow 
- 
									If the cluster redundancyPolicyis higher thanSingleRedundancy, set it toSingleRedundancyand save this change.
 
- If the preceding steps do not fix the issue, delete the old indices. - Check the status of all indices on Elasticsearch. - oc exec -n openshift-logging -c elasticsearch <elasticsearch_pod_name> -- indices - oc exec -n openshift-logging -c elasticsearch <elasticsearch_pod_name> -- indices- Copy to Clipboard Copied! - Toggle word wrap Toggle overflow 
- Identify an old index that can be deleted.
- Delete the index. - oc exec -n openshift-logging -c elasticsearch <elasticsearch_pod_name> -- es_util --query=<elasticsearch_index_name> -X DELETE - oc exec -n openshift-logging -c elasticsearch <elasticsearch_pod_name> -- es_util --query=<elasticsearch_index_name> -X DELETE- Copy to Clipboard Copied! - Toggle word wrap Toggle overflow 
 
- Continue freeing up and monitoring the disk space until the used disk space drops below 90%. Then, unblock write to this particular node. - oc exec -n openshift-logging -c elasticsearch <elasticsearch_pod_name> -- es_util --query=_all/_settings?pretty -X PUT -d '{"index.blocks.read_only_allow_delete": null}'- oc exec -n openshift-logging -c elasticsearch <elasticsearch_pod_name> -- es_util --query=_all/_settings?pretty -X PUT -d '{"index.blocks.read_only_allow_delete": null}'- Copy to Clipboard Copied! - Toggle word wrap Toggle overflow 
12.5.6. Elasticsearch JVM Heap Use is High
The Elasticsearch node JVM Heap memory used is above 75%.
Troubleshooting
Consider increasing the heap size.
12.5.7. Aggregated Logging System CPU is High
System CPU usage on the node is high.
Troubleshooting
Check the CPU of the cluster node. Consider allocating more CPU resources to the node.
12.5.8. Elasticsearch Process CPU is High
Elasticsearch process CPU usage on the node is high.
Troubleshooting
Check the CPU of the cluster node. Consider allocating more CPU resources to the node.
12.5.9. Elasticsearch Disk Space is Running Low
The Elasticsearch Cluster is predicted to be out of disk space within the next 6 hours based on current disk usage.
Troubleshooting
- Get the disk space of the Elasticsearch node. - for pod in `oc -n openshift-logging get po -l component=elasticsearch -o jsonpath='{.items[*].metadata.name}'`; do echo $pod; oc -n openshift-logging exec -c elasticsearch $pod -- df -h /elasticsearch/persistent; done- for pod in `oc -n openshift-logging get po -l component=elasticsearch -o jsonpath='{.items[*].metadata.name}'`; do echo $pod; oc -n openshift-logging exec -c elasticsearch $pod -- df -h /elasticsearch/persistent; done- Copy to Clipboard Copied! - Toggle word wrap Toggle overflow 
- 
							In the command output, check the nodes.node_name.fsfield to determine the free disk space on that node.
- Try to increase the disk space on all nodes.
- If increasing the disk space is not possible, try adding a new data node to the cluster.
- If adding a new data node is problematic, decrease the total cluster redundancy policy. - Check the current - redundancyPolicy.- oc -n openshift-logging get es elasticsearch -o jsonpath='{.spec.redundancyPolicy}'- oc -n openshift-logging get es elasticsearch -o jsonpath='{.spec.redundancyPolicy}'- Copy to Clipboard Copied! - Toggle word wrap Toggle overflow Note- If you are using a - ClusterLoggingCR, enter:- oc -n openshift-logging get cl -o jsonpath='{.items[*].spec.logStore.elasticsearch.redundancyPolicy}'- oc -n openshift-logging get cl -o jsonpath='{.items[*].spec.logStore.elasticsearch.redundancyPolicy}'- Copy to Clipboard Copied! - Toggle word wrap Toggle overflow 
- 
									If the cluster redundancyPolicyis higher thanSingleRedundancy, set it toSingleRedundancyand save this change.
 
- If the preceding steps do not fix the issue, delete the old indices. - Check the status of all indices on Elasticsearch. - oc exec -n openshift-logging -c elasticsearch <elasticsearch_pod_name> -- indices - oc exec -n openshift-logging -c elasticsearch <elasticsearch_pod_name> -- indices- Copy to Clipboard Copied! - Toggle word wrap Toggle overflow 
- Identify an old index that can be deleted.
- Delete the index. - oc exec -n openshift-logging -c elasticsearch <elasticsearch_pod_name> -- es_util --query=<elasticsearch_index_name> -X DELETE - oc exec -n openshift-logging -c elasticsearch <elasticsearch_pod_name> -- es_util --query=<elasticsearch_index_name> -X DELETE- Copy to Clipboard Copied! - Toggle word wrap Toggle overflow 
 
12.5.10. Elasticsearch FileDescriptor Usage is high
Based on current usage trends, the predicted number of file descriptors on the node is insufficient.
Troubleshooting
						Check and, if needed, configure the value of max_file_descriptors for each node, as described in the Elasticsearch File descriptors topic.
					
Chapter 13. Uninstalling OpenShift Logging
You can remove OpenShift Logging from your OpenShift Container Platform cluster.
13.1. Uninstalling OpenShift Logging from OpenShift Container Platform
				You can stop log aggregation by deleting the ClusterLogging custom resource (CR). After deleting the CR, there are other OpenShift Logging components that remain, which you can optionally remove.
			
				Deleting the ClusterLogging CR does not remove the persistent volume claims (PVCs). To preserve or delete the remaining PVCs, persistent volumes (PVs), and associated data, you must take further action.
			
Prerequisites
- OpenShift Logging and Elasticsearch must be installed.
Procedure
To remove OpenShift Logging:
- Use the OpenShift Container Platform web console to remove the - ClusterLoggingCR:- Switch to the Administration → Custom Resource Definitions page.
- On the Custom Resource Definitions page, click ClusterLogging.
- On the Custom Resource Definition Details page, click Instances.
- 
								Click the Options menu 
								 next to the instance and select Delete ClusterLogging. next to the instance and select Delete ClusterLogging.
 
- Optional: Delete the custom resource definitions (CRD): - Switch to the Administration → Custom Resource Definitions page.
- 
								Click the Options menu 
								 next to ClusterLogForwarder and select Delete Custom Resource Definition. next to ClusterLogForwarder and select Delete Custom Resource Definition.
- 
								Click the Options menu 
								 next to ClusterLogging and select Delete Custom Resource Definition. next to ClusterLogging and select Delete Custom Resource Definition.
- 
								Click the Options menu 
								 next to Elasticsearch and select Delete Custom Resource Definition. next to Elasticsearch and select Delete Custom Resource Definition.
 
- Optional: Remove the Red Hat OpenShift Logging Operator and OpenShift Elasticsearch Operator: - Switch to the Operators → Installed Operators page.
- 
								Click the Options menu 
								 next to the Red Hat OpenShift Logging Operator and select Uninstall Operator. next to the Red Hat OpenShift Logging Operator and select Uninstall Operator.
- 
								Click the Options menu 
								 next to the OpenShift Elasticsearch Operator and select Uninstall Operator. next to the OpenShift Elasticsearch Operator and select Uninstall Operator.
 
- Optional: Remove the OpenShift Logging and Elasticsearch projects. - Switch to the Home → Projects page.
- 
								Click the Options menu 
								 next to the openshift-logging project and select Delete Project. next to the openshift-logging project and select Delete Project.
- 
								Confirm the deletion by typing openshift-loggingin the dialog box and click Delete.
- Click the Options menu  next to the openshift-operators-redhat project and select Delete Project.
							Important next to the openshift-operators-redhat project and select Delete Project.
							Important- Do not delete the - openshift-operators-redhatproject if other global operators are installed in this namespace.
- 
								Confirm the deletion by typing openshift-operators-redhatin the dialog box and click Delete.
 
- To keep the PVCs for reuse with other pods, keep the labels or PVC names that you need to reclaim the PVCs.
- Optional: If you do not want to keep the PVCs, you can delete them. Warning- Releasing or deleting PVCs can delete PVs and cause data loss. - Switch to the Storage → Persistent Volume Claims page.
- 
								Click the Options menu 
								 next to each PVC and select Delete Persistent Volume Claim. next to each PVC and select Delete Persistent Volume Claim.
- If you want to recover storage space, you can delete the PVs.
 
Chapter 14. Log Record Fields
The following fields can be present in log records exported by OpenShift Logging. Although log records are typically formatted as JSON objects, the same data model can be applied to other encodings.
			To search these fields from Elasticsearch and Kibana, use the full dotted field name when searching. For example, with an Elasticsearch /_search URL, to look for a Kubernetes pod name, use /_search/q=kubernetes.pod_name:name-of-my-pod.
		
The top level fields may be present in every record.
Chapter 15. message
			The original log entry text, UTF-8 encoded. This field may be absent or empty if a non-empty structured field is present. See the description of structured for more.
		
| Data type | text | 
| Example value | 
							 | 
Chapter 16. structured
			Original log entry as a structured object. This field may be present if the forwarder was configured to parse structured JSON logs. If the original log entry was a valid structured log, this field will contain an equivalent JSON structure. Otherwise this field will be empty or absent, and the message field will contain the original log message. The structured field can have any subfields that are included in the log message, there are no restrictions defined here.
		
| Data type | group | 
| Example value | map[message:starting fluentd worker pid=21631 ppid=21618 worker=0 pid:21631 ppid:21618 worker:0] | 
Chapter 17. @timestamp
A UTC value that marks when the log payload was created or, if the creation time is not known, when the log payload was first collected. The “@” prefix denotes a field that is reserved for a particular use. By default, most tools look for “@timestamp” with ElasticSearch.
| Data type | date | 
| Example value | 
							 | 
Chapter 18. hostname
			The name of the host where this log message originated. In a Kubernetes cluster, this is the same as kubernetes.host.
		
| Data type | keyword | 
Chapter 19. ipaddr4
The IPv4 address of the source server. Can be an array.
| Data type | ip | 
Chapter 20. ipaddr6
The IPv6 address of the source server, if available. Can be an array.
| Data type | ip | 
Chapter 21. level
			The logging level from various sources, including rsyslog(severitytext property), a Python logging module, and others.
		
			The following values come from syslog.h, and are preceded by their numeric equivalents:
		
- 
					0=emerg, system is unusable.
- 
					1=alert, action must be taken immediately.
- 
					2=crit, critical conditions.
- 
					3=err, error conditions.
- 
					4=warn, warning conditions.
- 
					5=notice, normal but significant condition.
- 
					6=info, informational.
- 
					7=debug, debug-level messages.
			The two following values are not part of syslog.h but are widely used:
		
- 
					8=trace, trace-level messages, which are more verbose thandebugmessages.
- 
					9=unknown, when the logging system gets a value it doesn’t recognize.
			Map the log levels or priorities of other logging systems to their nearest match in the preceding list. For example, from python logging, you can match CRITICAL with crit, ERROR with err, and so on.
		
| Data type | keyword | 
| Example value | 
							 | 
Chapter 22. pid
The process ID of the logging entity, if available.
| Data type | keyword | 
Chapter 23. service
			The name of the service associated with the logging entity, if available. For example, syslog’s APP-NAME and rsyslog’s programname properties are mapped to the service field.
		
| Data type | keyword | 
Chapter 24. tags
Optional. An operator-defined list of tags placed on each log by the collector or normalizer. The payload can be a string with whitespace-delimited string tokens or a JSON list of string tokens.
| Data type | text | 
Chapter 25. file
			The path to the log file from which the collector reads this log entry. Normally, this is a path in the /var/log file system of a cluster node.
		
| Data type | text | 
Chapter 26. offset
The offset value. Can represent bytes to the start of the log line in the file (zero- or one-based), or log line numbers (zero- or one-based), so long as the values are strictly monotonically increasing in the context of a single log file. The values are allowed to wrap, representing a new version of the log file (rotation).
| Data type | long | 
Chapter 27. kubernetes
The namespace for Kubernetes-specific metadata
| Data type | group | 
27.1. kubernetes.pod_name
The name of the pod
| Data type | keyword | 
27.2. kubernetes.pod_id
The Kubernetes ID of the pod
| Data type | keyword | 
27.3. kubernetes.namespace_name
The name of the namespace in Kubernetes
| Data type | keyword | 
27.4. kubernetes.namespace_id
The ID of the namespace in Kubernetes
| Data type | keyword | 
27.5. kubernetes.host
The Kubernetes node name
| Data type | keyword | 
27.6. kubernetes.container_name
The name of the container in Kubernetes
| Data type | keyword | 
27.7. kubernetes.annotations
Annotations associated with the Kubernetes object
| Data type | group | 
27.8. kubernetes.labels
Labels present on the original Kubernetes Pod
| Data type | group | 
27.9. kubernetes.event
				The Kubernetes event obtained from the Kubernetes master API. This event description loosely follows type Event in Event v1 core.
			
| Data type | group | 
27.9.1. kubernetes.event.verb
					The type of event, ADDED, MODIFIED, or DELETED
				
| Data type | keyword | 
| Example value | 
									 | 
27.9.2. kubernetes.event.metadata
Information related to the location and time of the event creation
| Data type | group | 
27.9.2.1. kubernetes.event.metadata.name
The name of the object that triggered the event creation
| Data type | keyword | 
| Example value | 
										 | 
27.9.2.2. kubernetes.event.metadata.namespace
						The name of the namespace where the event originally occurred. Note that it differs from kubernetes.namespace_name, which is the namespace where the eventrouter application is deployed.
					
| Data type | keyword | 
| Example value | 
										 | 
27.9.2.3. kubernetes.event.metadata.selfLink
A link to the event
| Data type | keyword | 
| Example value | 
										 | 
27.9.2.4. kubernetes.event.metadata.uid
The unique ID of the event
| Data type | keyword | 
| Example value | 
										 | 
27.9.2.5. kubernetes.event.metadata.resourceVersion
A string that identifies the server’s internal version of the event. Clients can use this string to determine when objects have changed.
| Data type | integer | 
| Example value | 
										 | 
27.9.3. kubernetes.event.involvedObject
The object that the event is about.
| Data type | group | 
27.9.3.1. kubernetes.event.involvedObject.kind
The type of object
| Data type | keyword | 
| Example value | 
										 | 
27.9.3.2. kubernetes.event.involvedObject.namespace
						The namespace name of the involved object. Note that it may differ from kubernetes.namespace_name, which is the namespace where the eventrouter application is deployed.
					
| Data type | keyword | 
| Example value | 
										 | 
27.9.3.3. kubernetes.event.involvedObject.name
The name of the object that triggered the event
| Data type | keyword | 
| Example value | 
										 | 
27.9.3.4. kubernetes.event.involvedObject.uid
The unique ID of the object
| Data type | keyword | 
| Example value | 
										 | 
27.9.3.5. kubernetes.event.involvedObject.apiVersion
The version of kubernetes master API
| Data type | keyword | 
| Example value | 
										 | 
27.9.3.6. kubernetes.event.involvedObject.resourceVersion
A string that identifies the server’s internal version of the pod that triggered the event. Clients can use this string to determine when objects have changed.
| Data type | keyword | 
| Example value | 
										 | 
27.9.4. kubernetes.event.reason
A short machine-understandable string that gives the reason for generating this event
| Data type | keyword | 
| Example value | 
									 | 
27.9.5. kubernetes.event.source_component
The component that reported this event
| Data type | keyword | 
| Example value | 
									 | 
27.9.6. kubernetes.event.firstTimestamp
The time at which the event was first recorded
| Data type | date | 
| Example value | 
									 | 
27.9.7. kubernetes.event.count
The number of times this event has occurred
| Data type | integer | 
| Example value | 
									 | 
27.9.8. kubernetes.event.type
					The type of event, Normal or Warning. New types could be added in the future.
				
| Data type | keyword | 
| Example value | 
									 | 
Chapter 28. OpenShift
The namespace for openshift-logging specific metadata
| Data type | group | 
28.1. openshift.labels
Labels added by the Cluster Log Forwarder configuration
| Data type | group | 
        Legal Notice
        
          
            
          
        
      
 
Copyright © 2025 Red Hat
OpenShift documentation is licensed under the Apache License 2.0 (https://www.apache.org/licenses/LICENSE-2.0).
Modified versions must remove all Red Hat trademarks.
Portions adapted from https://github.com/kubernetes-incubator/service-catalog/ with modifications by Red Hat.
Red Hat, Red Hat Enterprise Linux, the Red Hat logo, the Shadowman logo, JBoss, OpenShift, Fedora, the Infinity logo, and RHCE are trademarks of Red Hat, Inc., registered in the United States and other countries.
Linux® is the registered trademark of Linus Torvalds in the United States and other countries.
Java® is a registered trademark of Oracle and/or its affiliates.
XFS® is a trademark of Silicon Graphics International Corp. or its subsidiaries in the United States and/or other countries.
MySQL® is a registered trademark of MySQL AB in the United States, the European Union and other countries.
Node.js® is an official trademark of Joyent. Red Hat Software Collections is not formally related to or endorsed by the official Joyent Node.js open source or commercial project.
The OpenStack® Word Mark and OpenStack logo are either registered trademarks/service marks or trademarks/service marks of the OpenStack Foundation, in the United States and other countries and are used with the OpenStack Foundation’s permission. We are not affiliated with, endorsed or sponsored by the OpenStack Foundation, or the OpenStack community.
All other trademarks are the property of their respective owners.