Managing overcloud observability

Red Hat OpenStack Platform 17.1

Tracking physical and virtual resources, and collecting metrics

OpenStack Documentation Team

rhos-docs@redhat.com

Legal Notice

Abstract

Use operational tools to help you measure and maintain your Red Hat OpenStack Platform environment.

Providing feedback on Red Hat documentation
Copy link

We appreciate your input on our documentation. Tell us how we can make it better.

Providing documentation feedback in Jira

Use the Create Issue form to provide feedback on the documentation for Red Hat OpenStack Services on OpenShift (RHOSO) or earlier releases of {osp_long} ({osp_acro}). When you create an issue for RHOSO or {osp_acro} documents, the issue is recorded in the RHOSO Jira project, where you can track the progress of your feedback.

To complete the Create Issue form, ensure that you are logged in to Jira. If you do not have a Red Hat Jira account, you can create an account at https://issues.redhat.com.

Click the following link to open a Create Issue page: Create Issue
Complete the Summary and Description fields. In the Description field, include the documentation URL, chapter or section number, and a detailed description of the issue. Do not modify any other fields in the form.
Click Create.

Chapter 1. Introduction to operational measurements
Copy link

You can use observability components such as ceilometer, collectd, and the logging service to collect data from your Red Hat OpenStack Platform (RHOSP) environment. You can store the data that you collect in Gnocchi for the autoscaling use case or you can use metrics_qdr to forward the data to Service Telemetry Framework(STF).

For more information about autoscaling, see Auto Scaling for Instances

For more information about STF, see Service Telemetry Framework 1.5

1.1. Observability architecture
Copy link

Red Hat OpenStack Platform (RHOSP) Observability provides user-level usage data for OpenStack-based clouds. You can configure observability components to collect data from notifications sent by existing RHOSP components, such as Compute usage events, or by polling RHOSP infrastructure resources, such as libvirt. Ceilometer publishes collected data to various targets such as data stores and message queues, including Service Telemetry Framework(STF).

Observability consists of the following components:

Data collection: Observability uses Ceilometer to gather metric and event data. For more information, see Section 1.2.1, “Ceilometer”.
Storage: Observability stores metric data in Gnocchi. For more information, see Section 1.3, “Storage with Gnocchi”.
Alarm service: Observability uses the Alarming service (Aodh) to trigger actions based on defined rules against metric or event data collected by Ceilometer.

After you collect the data, you can use a third-party tool to display and analyze metric data, and you can use the Alarming service to configure alarms for events.

Figure 1.1. Observability architecture

1.1.1. Support status of monitoring components
Copy link

Use this table to view the support status of monitoring components in Red Hat OpenStack Platform (RHOSP).

Expand

Table 1.1. Support status
Component	Fully supported since	Deprecated in	Removed since	Note
Aodh	RHOSP 9	RHOSP 15		Supported for the autoscaling use case.
Ceilometer	RHOSP 4			Supported for collection of metrics and events for RHOSP in the autoscaling and Service Telemetry Framework (STF) use cases.
Collectd	RHOSP 11	RHOSP 17.1		Supported for collection of infrastructure metrics for STF.
Gnocchi	RHOSP 9	RHOSP 15		Supported for storage of metrics for the autoscaling use case.
Panko	RHOSP 11	RHOSP 12, not installed by default since RHOSP 14	RHOSP 17.0
QDR	RHOSP 13	RHOSP 17.1		Supported for transmission of metrics and events data from RHOSP to STF.

1.2. Data collection in Red Hat OpenStack Platform
Copy link

Red Hat OpenStack Platform (RHOSP) supports two types of data collection:

Ceilometer for the RHOSP component-level monitoring. For more information, see Section 1.2.1, “Ceilometer”.
collectd for infrastructure monitoring. For more information, see Section 1.2.2, “collectd”.

1.2.1. Ceilometer
Copy link

Ceilometer is the default data collection component of Red Hat OpenStack Platform (RHOSP) that provides the ability to normalize and transform data across all of the current RHOSP core components. Ceilometer collects metering and event data relating to RHOSP services.

The Ceilometer service uses three agents to collect data from Red Hat OpenStack Platform (RHOSP) components:

A compute agent (ceilometer-agent-compute): Runs on each Compute node and polls for resource utilization statistics. This agent is the same as polling agent ceilometer-polling running with parameter --polling namespace-compute.
A central agent (ceilometer-agent-central): Runs on a central management server to poll for resource utilization statistics for resources that are not tied to instances or Compute nodes. You can start multiple agents to scale services horizontally. This is the same as the polling agent ceilometer-polling that operates with the parameter --polling namespace-central.
A notification agent (ceilometer-agent-notification): Runs on a central management server and consumes messages from the message queues to build event and metering data. Data publishes to defined targets. Gnocchi is the default target. These services use the RHOSP notification bus to communicate.

The Ceilometer agents use publishers to send data to the corresponding end points, for example Gnocchi or AMQP version 1 (QDR).

1.2.2. collectd
Copy link

Collectd is another data collecting agent that you can use to provide infrastructure metrics. It repeatedly pulls data from configured sources. You can forward metrics to Service Telemetry Framework (STF) to store and visualize the data.

1.3. Storage with Gnocchi
Copy link

Gnocchi is an open-source time-series database. You can use gnocchi to store and provide access to metrics and resources to operators and users. Gnocchi uses an archive policy to define which aggregations to compute and how many aggregates to retain; and an indexer driver to store the index of all resources, archive policies, and metrics.

The use of Gnocchi in Red Hat OpenStack Platform (RHOSP) is supported for the autoscaling use-case. For more information about autoscaling, see Auto Scaling for Instances

1.3.1. Archive policies: Storing both short and long-term data in a time-series database
Copy link

An archive policy defines which aggregations to compute and how many aggregates to retain. Gnocchi supports different aggregation methods, such as minimum, maximum, average, Nth percentile, and standard deviation. These aggregations are computed over a period of time called granularity and retained for a specific timespan.

The archive policy defines how the metrics are aggregated and for how long they are stored. Each archive policy is defined as the number of points over a timespan.

For example, if your archive policy defines a policy of 10 points with a granularity of 1 second, the time-series archive keeps up to 10 seconds, each representing an aggregation over 1 second. This means that the time series, at a maximum, retains 10 seconds of data between the more recent point and the older point.

The archive policy also defines which aggregate methods are used. The default is set to the parameter default_aggregation_methods whose values by default are set to mean, min, max. sum, std, count. So, depending on the use case, the archive policy and the granularity vary.

Additional resources

For more information about archive policies, see Section 2.3, “Planning and managing archive policies”.

1.3.2. Indexer driver
Copy link

The indexer is responsible for storing the index of all resources, archive policies, and metrics along with their definitions, types, and properties. It is also responsible for linking resources with metrics. Red Hat OpenStack Platform director installs the indexer driver by default. You need a database to index all the resources and metrics that Gnocchi handles. The supported driver is MySQL.

1.3.3. Gnocchi terminology
Copy link

This table contains definitions of the commonly used terms for Gnocchi features.

Expand

Table 1.2. Gnocchi terminology
Term	Definition
Aggregation method	A function used to aggregate multiple measures into an aggregate. For example, the min aggregation method aggregates the values of different measures to the minimum value of all the measures in the time range.
Aggregate	A data point tuple generated from several measures according to the archive policy. An aggregate is composed of a timestamp and a value.
Archive policy	An aggregate storage policy attached to a metric. An archive policy determines how long aggregates are kept in a metric and how aggregates are aggregated (the aggregation method).
Granularity	The time between two aggregates in an aggregated time series of a metric.
Measure	An incoming data point tuple sent to the Time series database by the API. A measure is composed of a timestamp and a value.
Metric	An entity storing aggregates identified by an UUID. A metric can be attached to a resource using a name. How a metric stores its aggregates is defined by the archive policy to which the metric is associated.
Resource	An entity representing anything in your infrastructure that you associate a metric with. A resource is identified by a unique ID and can contain attributes.
Time series	A list of aggregates ordered by time.
Timespan	The time period for which a metric keeps its aggregates. It is used in the context of archive policy.

Chapter 2. Planning for operational measurements
Copy link

You can use Ceilometer or collectd to collect telemetry data for autoscaling or Service Telemetry Framework (STF).

2.1. Collectd measurements
Copy link

The following are the default collectd measurements:

cpu
disk free
disk usage
hugepages
interface
load
memory
unixsock
uptime

2.2. Planning for data storage
Copy link

Gnocchi stores a collection of data points, where each data point is an aggregate. The storage format is compressed using different techniques. As a result, to calculate the size of a time-series database, you must estimate the size based on the worst-case scenario.

Warning

The use of Red Hat OpenStack Platform (RHOSP) Object Storage (swift) for time series database (Gnocchi) storage is only supported for small and non-production environments.

Procedure

Calculate the number of data points:
number of points = timespan / granularity
For example, if you want to retain a year of data with one-minute resolution, use the formula:
number of data points = (365 days X 24 hours X 60 minutes) / 1 minute number of data points = 525600
Calculate the size of the time-series database:
size in bytes = number of data points X 8 bytes
If you apply this formula to the example, the result is 4.1 MB:
size in bytes = 525600 points X 8 bytes = 4204800 bytes = 4.1 MB
This value is an estimated storage requirement for a single aggregated time-series database. If your archive policy uses multiple aggregation methods (min, max, mean, sum, std, count), multiply this value by the number of aggregation methods you use.

Additional resources

Section 1.3.1, “Archive policies: Storing both short and long-term data in a time-series database”
Section 2.3, “Planning and managing archive policies”

2.3. Planning and managing archive policies
Copy link

You can use an archive policy to configure how you aggregate the metrics and for how long you store the metrics in the time-series database. An archive policy is defined as the number of points over a timespan.

If your archive policy defines a policy of 10 points with a granularity of 1 second, the time series archive keeps up to 10 seconds, each representing an aggregation over 1 second. This means that the time series retains, at a maximum, 10 seconds of data between the more recent point and the older point. The archive policy also defines the aggregate method to use. The default is set to the parameter default_aggregation_methods, where the default values are set to mean, min, max. sum, std, count. So, depending on the use case, the archive policy and the granularity can vary.

To plan an archive policy, ensure that you are familiar with the following concepts:

Metrics. For more information, see Section 2.3.1, “Metrics”.
Measures. For more information, see Section 2.3.2, “Creating custom measures”.

2.3.1. Metrics
Copy link

Gnocchi provides an object type called metric. A metric is anything that you can measure, for example, the CPU usage of a server, the temperature of a room, or the number of bytes sent by a network interface. A metric has the following properties:

A UUID to identify it
A name
The archive policy used to store and aggregate the measures

Additional resources

For terminology definitions, see Gnocchi Metric-as-a-Service terminology.

2.3.2. Creating custom measures
Copy link

A measure is an incoming tuple that the API sends to Gnocchi. It consists of a timestamp and a value. You can create your own custom measures.

Procedure

Create a custom measure:

openstack metric measures add -m <MEASURE1> -m <MEASURE2> .. -r <RESOURCE_NAME> <METRIC_NAME>

$ openstack metric measures add -m <MEASURE1> -m <MEASURE2> .. -r <RESOURCE_NAME> <METRIC_NAME>

Copy to Clipboard

Toggle word wrap

2.3.3. Verifying the metric status
Copy link

You can use the openstack metric command to verify a successful deployment.

Procedure

Verify the deployment:

openstack metric status
+-----------------------------------------------------+-------+
| Field                                           	| Value |
+-----------------------------------------------------+-------+
| storage/number of metric having measures to process | 0 	|
| storage/total number of measures to process     	| 0 	|
+-----------------------------------------------------+-------+

(overcloud) [stack@undercloud-0 ~]$ openstack metric status
+-----------------------------------------------------+-------+
| Field                                           	| Value |
+-----------------------------------------------------+-------+
| storage/number of metric having measures to process | 0 	|
| storage/total number of measures to process     	| 0 	|
+-----------------------------------------------------+-------+

Copy to Clipboard

Toggle word wrap

If there are no error messages, your deployment is successful.

2.3.4. Creating an archive policy
Copy link

You can create an archive policy to define how you aggregate the metrics and for how long you store the metrics in the time-series database.

Procedure

Create an archive policy. Replace <archive-policy-name> with the name of the policy and replace <aggregation-method> with the method of aggregation.
```
openstack metric archive policy create <archive-policy-name> --definition <definition> \
--aggregation-method <aggregation-method>
```
```
$ openstack metric archive policy create <archive-policy-name> --definition <definition> \
--aggregation-method <aggregation-method>
```
Copy to Clipboard Toggle word wrap
Note
<definition> is the policy definition. Separate multiple attributes with a comma (,). Separate the name and value of the archive policy definition with a colon (:).

2.3.5. Viewing an archive policy
Copy link

Use the following steps to examine your archive policies.

Procedure

List the archive policies.
```
openstack metric archive policy list
```
```
$ openstack metric archive policy list
```
Copy to Clipboard Toggle word wrap

View the details of an archive policy:

openstack metric archive-policy show <archive-policy-name>

# openstack metric archive-policy show <archive-policy-name>

Copy to Clipboard

Toggle word wrap

2.3.6. Deleting an archive policy
Copy link

Use the following step if you want to delete an archive policy.

Procedure

Delete the archive policy. Replace <archive-policy-name> with the name of the policy that you want to delete.
```
openstack metric archive policy delete <archive-policy-name>
```
```
$ openstack metric archive policy delete <archive-policy-name>
```
Copy to Clipboard Toggle word wrap

Verification

Check that the archive policy that you deleted is absent from the list of archive policies.
```
openstack metric archive policy list
```
```
$ openstack metric archive policy list
```
Copy to Clipboard Toggle word wrap

2.3.7. Creating an archive policy rule
Copy link

You can use an archive policy rule to configure the mapping between a metric and an archive policy.

Procedure

Create an archive policy rule. Replace <rule-name> with the name of the rule and replace <archive-policy-name> with the name of the archive policy:

openstack metric archive-policy-rule create <rule-name> /
--archive-policy-name  <archive-policy-name>

$ openstack metric archive-policy-rule create <rule-name> /
--archive-policy-name  <archive-policy-name>

Copy to Clipboard

Toggle word wrap

Chapter 3. Installing and configuring the logs service
Copy link

You can use log messages for troubleshooting and monitoring system events. The log collection agent Rsyslog collects logs on the client side and sends these log records to a remote Elasticsearch storage system separate from your supported Red Hat OpenStack Platform (RHOSP) environment, for example.

3.1. The log system architecture and components
Copy link

Monitoring tools use a client-server model, with the client deployed onto the Red Hat OpenStack Platform (RHOSP) overcloud nodes. The Rsyslog service provides client-side logging.

Examples of logs in RHOSP include the following:

Operating system logs, such as syslog and audit log files.
Logs from the infrastructure components, such as RabbitMQ and MariaDB.
Logs from the RHOSP services, such as Identity (keystone) and Compute (nova).

The log files record actions, errors, warnings, and other events. In a distributed environment, collecting the various logs in one location can help you with debugging and administration.

Note

RHOSP director does not deploy the server-side components for logging.

3.2. Enabling logging with Elasticsearch
Copy link

Elasticsearch is a server-side database that you can use to store logs. To enable the logs service for Elasticsearch, you must authenticate the logs service for Elasticsearch.

Note

The Rsyslog service only uses Elasticsearch as a data store for logging.

Prerequisites

You have deployed Elasticsearch.
You have the username, password, and URL of the server.
To use ElasticSearch 8, your Red Hat OpenStack Platform (RHOSP) deployment must be version 17.1.4 or newer.

Procedure

Create a file in your custom templates directory, such as $HOME/custom_templates/logging-connector.yaml, that you can edit to configure the RsyslogElasticsearchSetting parameters for your environment, such as in the following example:

parameter_defaults:
    RsyslogElasticsearchSetting:
        uid: "elastic"
        pwd: "yourownpassword"
        skipverifyhost: "on"
        allowunsignedcerts: "on"
        server: "https://openstack-log-storage.elasticsearch.tld"
        serverport: 443
        esversion.major: "8"

parameter_defaults:
    RsyslogElasticsearchSetting:
        uid: "elastic"
        pwd: "yourownpassword"
        skipverifyhost: "on"
        allowunsignedcerts: "on"
        server: "https://openstack-log-storage.elasticsearch.tld"
        serverport: 443
        esversion.major: "8"

Copy to Clipboard

Toggle word wrap

1: (Optional) Include the line esversion.major: "8" to use ElasticSearch 8.

Add the file paths of the logging-environment-rsyslog.yaml and the logging-connector.yaml environment files to the overcloud deployment command:
```
openstack overcloud deploy \
<overcloud_environment_files> \
-e <filepath>/logging-environment-rsyslog.yaml
-e $HOME/custom_templates/logging-connector.yaml
```
```
$ openstack overcloud deploy \
<overcloud_environment_files> \
-e <filepath>/logging-environment-rsyslog.yaml
-e $HOME/custom_templates/logging-connector.yaml
```
Copy to Clipboard Toggle word wrap
- Replace <overcloud_environment_files> with the list of environment files in your existing deployment.
- Replace <filepath> with the file path to your logging-environment-rsyslog.yaml file, for example /usr/share/openstack-tripleo-heat-templates/environments/.

3.3. Configurable logging parameters
Copy link

This table contains descriptions of logging parameters that you use to configure logging features in Red Hat OpenStack Platform (RHOSP). You can find these parameters in the /usr/share/openstack-tripleo-heat-templates/deployment/logging/rsyslog-container-puppet.yaml file.

Expand

Table 3.1. Configurable logging parameters
Parameter	Description
`RsyslogElasticsearchSetting`	Configuration for `rsyslog-elasticsearch` plugin.
`RsyslogElasticsearchTlsCACert`	Contains the content of the CA cert for the CA that issued the Elasticsearch server cert.
`RsyslogElasticsearchTlsClientCert`	Contains the content of the client cert for doing client cert authorization against Elasticsearch.
`RsyslogElasticsearchTlsClientKey`	Contains the content of the private key corresponding to the cert `RsyslogElasticsearchTlsClientCert`.
`RsyslogReopenOnTruncate`	Prompts rsyslog to reopen the input file when the file size on disk is less than the current offset in memory.
`RsyslogMaxMessageSize`	Sets the limit for the size of the log message.

3.4. Overriding the default path for a log file
Copy link

If you modify the default containers to include the path to the service log file, you must also modify the default log file path. Every composable service has a <service_name>LoggingSource parameter. For example, for the nova-compute service, the parameter is NovaComputeLoggingSource.

Procedure

To override the default path for the nova-compute service, add the path to the NovaComputeLoggingSource parameter in your configuration file:
```
 NovaComputeLoggingSource:
      tag: openstack.nova.compute
      file: <filepath>/nova-compute.log
```
```
 NovaComputeLoggingSource:
      tag: openstack.nova.compute
      file: <filepath>/nova-compute.log
```
Copy to Clipboard Toggle word wrap
- Replace <filepath> with the file path to your nova-compute.log file
- Ensure that you define the values of the tag and file parameters for the service. You can use the default values for the other parameters.

You can modify the format for a specific service. The format passes to the Rsyslog configuration. The following example shows the basic syntax:

<service_name>LoggingSource:
    tag: <service_name>.tag
    path: <service_name>.path
    format: <service_name>.format

<service_name>LoggingSource:
    tag: <service_name>.tag
    path: <service_name>.path
    format: <service_name>.format

Copy to Clipboard

Toggle word wrap

The following example shows a more complex transformation:

 ServiceLoggingSource:
    tag: openstack.Service
    path: /var/log/containers/service/service.log
    format: multiline
    format_firstline: '/^\d{4}-\d{2}-\d{2} \d{2}:\d{2}:\d{2}.\d{3} \d+ \S+ \S+ \[(req-\S+ \S+ \S+ \S+ \S+ \S+|-)\]/'
    format1: '/^(?<Timestamp>\S+ \S+) (?<Pid>\d+) (?<log_level>\S+) (?<python_module>\S+) (\[(req-(?<request_id>\S+) (?<user_id>\S+) (?<tenant_id>\S+) (?<domain_id>\S+) (?<user_domain>\S+) (?<project_domain>\S+)|-)\])? (?<Payload>.*)?$/'

 ServiceLoggingSource:
    tag: openstack.Service
    path: /var/log/containers/service/service.log
    format: multiline
    format_firstline: '/^\d{4}-\d{2}-\d{2} \d{2}:\d{2}:\d{2}.\d{3} \d+ \S+ \S+ \[(req-\S+ \S+ \S+ \S+ \S+ \S+|-)\]/'
    format1: '/^(?<Timestamp>\S+ \S+) (?<Pid>\d+) (?<log_level>\S+) (?<python_module>\S+) (\[(req-(?<request_id>\S+) (?<user_id>\S+) (?<tenant_id>\S+) (?<domain_id>\S+) (?<user_domain>\S+) (?<project_domain>\S+)|-)\])? (?<Payload>.*)?$/'

Copy to Clipboard

Toggle word wrap

If you enabled centralized logging, you can use the following definition in a custom template to forward additional log files, for example, /var/log/messages:

parameter_defaults:
  ExtraConfig:
    tripleo_logging_sources_messages:
      - tag: openstack.host.messages
        file: /var/log/host/messages
        startmsg.regex: "^[a-zA-Z]{3} [ 1-9][0-9] [:0-9]{8}"

parameter_defaults:
  ExtraConfig:
    tripleo_logging_sources_messages:
      - tag: openstack.host.messages
        file: /var/log/host/messages
        startmsg.regex: "^[a-zA-Z]{3} [ 1-9][0-9] [:0-9]{8}"

Copy to Clipboard

Toggle word wrap

3.5. Modifying the format of a log record
Copy link

You can modify the format of the start of the log record for a specific service. This passes directly to the Rsyslog configuration.

The default format for the Red Hat OpenStack Platform (RHOSP) log record is ('^[0-9]{4}-[0-9]{2}-[0-9]{2} [0-9]{2}:[0-9]{2}:[0-9]{2}(.[0-9]+ [0-9]+)? (DEBUG|INFO|WARNING|ERROR) ').

Procedure

To add a different regular expression for parsing the start of log records, add startmsg.regex to the configuration:

NovaComputeLoggingSource:
       tag: openstack.nova.compute
       file: /some/other/path/nova-compute.log
       startmsg.regex: `^[0-9]{4}-[0-9]{2}-[0-9]{2} [0-9]{2}:[0-9]{2}:[0-9]{2}(.[0-9]+ \\+[0-9]+)? [A-Z]+ \\([a-z]+\\)`

NovaComputeLoggingSource:
       tag: openstack.nova.compute
       file: /some/other/path/nova-compute.log
       startmsg.regex: `^[0-9]{4}-[0-9]{2}-[0-9]{2} [0-9]{2}:[0-9]{2}:[0-9]{2}(.[0-9]+ \\+[0-9]+)? [A-Z]+ \\([a-z]+\\)`

Copy to Clipboard

Toggle word wrap

3.6. Verifying the connection between Rsyslog and Elasticsearch
Copy link

On the client side, you can test and verify the communication between Rsyslog and Elasticsearch.

Procedure

Navigate to the Elasticsearch connection log file, which is /var/log/rsyslog/omelasticsearch.log in the Rsyslog container or /var/log/containers/rsyslog/omelasticsearch.log on the host. If this log file does not exist or if the log file exists but does not contain logs, there is no connection problem. If the log file is present and contains logs, Rsyslog has not connected successfully.

Note

To test the connection from the server side, view the Elasticsearch logs for connection issues.

3.7. Tracebacks
Copy link

If you are troubleshooting an issue, you can use a traceback log to diagnose the issue. In log files, tracebacks usually have several lines of information, all relating to the same issue.

Rsyslog provides a regular expression to define how a log record starts. Each log record usually starts with a timestamp and the first line of the traceback is the only line that contains this information. Rsyslog bundles the indented records with the first line and sends them as one log record.

For that behaviour configuration option startmsg.regex in <Service>LoggingSource is used. The following regular expression is the default value for all <service>LoggingSource parameters in director:

startmsg.regex='^[0-9]{4}-[0-9]{2}-[0-9]{2} [0-9]{2}:[0-9]{2}:[0-9]{2}(.[0-9]+ [0-9]+)? (DEBUG|INFO|WARNING|ERROR) '

startmsg.regex='^[0-9]{4}-[0-9]{2}-[0-9]{2} [0-9]{2}:[0-9]{2}:[0-9]{2}(.[0-9]+ [0-9]+)? (DEBUG|INFO|WARNING|ERROR) '

Copy to Clipboard

Toggle word wrap

When this default does not match log records of your added or modified LoggingSource, you must change startmsg.regex accordingly.

3.8. Location of log files for Red Hat OpenStack Platform services
Copy link

Each Red Hat OpenStack Platform (RHOSP) component has a separate logging directory containing files specific to a running service.

3.8.1. Bare Metal Provisioning (ironic) log files
Copy link

Expand

Service	Service name	Log path
OpenStack Ironic API	openstack-ironic-api.service	/var/log/containers/ironic/ironic-api.log
OpenStack Ironic Conductor	openstack-ironic-conductor.service	/var/log/containers/ironic/ironic-conductor.log

3.8.2. Block Storage (cinder) log files
Copy link

Expand

Service	Service name	Log path
Block Storage API	openstack-cinder-api.service	/var/log/containers/cinder-api.log
Block Storage Backup	openstack-cinder-backup.service	/var/log/containers/cinder/backup.log
Informational messages	The cinder-manage command	/var/log/containers/cinder/cinder-manage.log
Block Storage Scheduler	openstack-cinder-scheduler.service	/var/log/containers/cinder/scheduler.log
Block Storage Volume	openstack-cinder-volume.service	/var/log/containers/cinder/volume.log

3.8.3. Compute (nova) log files
Copy link

Expand

Service	Service name	Log path
OpenStack Compute API service	openstack-nova-api.service	/var/log/containers/nova/nova-api.log
OpenStack Compute certificate server	openstack-nova-cert.service	/var/log/containers/nova/nova-cert.log
OpenStack Compute service	openstack-nova-compute.service	/var/log/containers/nova/nova-compute.log
OpenStack Compute Conductor service	openstack-nova-conductor.service	/var/log/containers/nova/nova-conductor.log
OpenStack Compute VNC console authentication server	openstack-nova-consoleauth.service	/var/log/containers/nova/nova-consoleauth.log
Informational messages	nova-manage command	/var/log/containers/nova/nova-manage.log
OpenStack Compute NoVNC Proxy service	openstack-nova-novncproxy.service	/var/log/containers/nova/nova-novncproxy.log
OpenStack Compute Scheduler service	openstack-nova-scheduler.service	/var/log/containers/nova/nova-scheduler.log

3.8.4. Dashboard (horizon) log files
Copy link

Expand

Service	Service name	Log path
Log of certain user interactions	Dashboard interface	/var/log/containers/horizon/horizon.log

The Apache HTTP server uses additional log files for the Dashboard web interface, which you can access by using a web browser or command-line client, for example, keystone and nova. You can track Dashboard usage and diagnose faults with the following log files:

Expand

Purpose	Log path
All processed HTTP requests	/var/log/containers/httpd/horizon_access.log
HTTP errors	/var/log/containers/httpd/horizon_error.log
Admin-role API requests	/var/log/containers/httpd/keystone_wsgi_admin_access.log
Admin-role API errors	/var/log/containers/httpd/keystone_wsgi_admin_error.log
Member-role API requests	/var/log/containers/httpd/keystone_wsgi_main_access.log
Member-role API errors	/var/log/containers/httpd/keystone_wsgi_main_error.log

Note

There is also /var/log/containers/httpd/default_error.log, which stores errors reported by other web services that are running on the same host.

3.8.5. Identity Service (keystone) log files
Copy link

Expand

Service	Service name	Log Path
OpenStack Identity Service	openstack-keystone.service	/var/log/containers/keystone/keystone.log

3.8.6. Image Service (glance) log files
Copy link

Expand

Service	Service name	Log path
OpenStack Image Service API server	openstack-glance-api.service	/var/log/containers/glance/api.log
OpenStack Image Service Registry server	openstack-glance-registry.service	/var/log/containers/glance/registry.log

3.8.7. Networking (neutron) log files
Copy link

Expand

Service	Service name	Log path
OpenStack Neutron DHCP Agent	neutron-dhcp-agent.service	/var/log/containers/neutron/dhcp-agent.log
OpenStack Networking Layer 3 Agent	neutron-l3-agent.service	/var/log/containers/neutron/l3-agent.log
Metadata agent service	neutron-metadata-agent.service	/var/log/containers/neutron/metadata-agent.log
Metadata namespace proxy	n/a	/var/log/containers/neutron/neutron-ns-metadata-proxy-UUID.log
Open vSwitch agent	neutron-openvswitch-agent.service	/var/log/containers/neutron/openvswitch-agent.log
OpenStack Networking service	neutron-server.service	/var/log/containers/neutron/server.log

3.8.8. Object Storage (swift) log files
Copy link

OpenStack Object Storage sends logs to the system logging facility only.

Note

By default, all Object Storage log files go to /var/log/containers/swift/swift.log, using the local0, local1, and local2 syslog facilities.

The Object Storage log messages are either from REST API services or background daemons.

The API service messages contain one line per API request. The front-end and back-end services both post messages.
The daemon messages contain human-readable information about daemon tasks. The source identity is always at the beginning of the line.

Here is an example of a proxy message:

Apr 20 15:20:34 rhev-a24c-01 proxy-server: 127.0.0.1 127.0.0.1 20/Apr/2015/19/20/34 GET /v1/AUTH_zaitcev%3Fformat%3Djson%26marker%3Dtestcont HTTP/1.0 200 - python-swiftclient-2.1.0 AUTH_tk737d6... - 2 - txc454fa8ea4844d909820a-0055355182 - 0.0162 - - 1429557634.806570053 1429557634.822791100

Apr 20 15:20:34 rhev-a24c-01 proxy-server: 127.0.0.1 127.0.0.1 20/Apr/2015/19/20/34 GET /v1/AUTH_zaitcev%3Fformat%3Djson%26marker%3Dtestcont HTTP/1.0 200 - python-swiftclient-2.1.0 AUTH_tk737d6... - 2 - txc454fa8ea4844d909820a-0055355182 - 0.0162 - - 1429557634.806570053 1429557634.822791100

Copy to Clipboard

Toggle word wrap

Here is an example of daemon messages:

Apr 27 17:08:15 rhev-a24c-02 object-auditor: Object audit (ZBF). Since Mon Apr 27 21:08:15 2015: Locally: 1 passed, 0 quarantined, 0 errors files/sec: 4.34 , bytes/sec: 0.00, Total time: 0.23, Auditing time: 0.00, Rate: 0.00
Apr 27 17:08:16 rhev-a24c-02 object-auditor: Object audit (ZBF) "forever" mode completed: 0.56s. Total quarantined: 0, Total errors: 0, Total files/sec: 14.31, Total bytes/sec: 0.00, Auditing time: 0.02, Rate: 0.04
Apr 27 17:08:16 rhev-a24c-02 account-replicator: Beginning replication run
Apr 27 17:08:16 rhev-a24c-02 account-replicator: Replication run OVER
Apr 27 17:08:16 rhev-a24c-02 account-replicator: Attempted to replicate 5 dbs in 0.12589 seconds (39.71876/s)
Apr 27 17:08:16 rhev-a24c-02 account-replicator: Removed 0 dbs
Apr 27 17:08:16 rhev-a24c-02 account-replicator: 10 successes, 0 failures

Apr 27 17:08:15 rhev-a24c-02 object-auditor: Object audit (ZBF). Since Mon Apr 27 21:08:15 2015: Locally: 1 passed, 0 quarantined, 0 errors files/sec: 4.34 , bytes/sec: 0.00, Total time: 0.23, Auditing time: 0.00, Rate: 0.00
Apr 27 17:08:16 rhev-a24c-02 object-auditor: Object audit (ZBF) "forever" mode completed: 0.56s. Total quarantined: 0, Total errors: 0, Total files/sec: 14.31, Total bytes/sec: 0.00, Auditing time: 0.02, Rate: 0.04
Apr 27 17:08:16 rhev-a24c-02 account-replicator: Beginning replication run
Apr 27 17:08:16 rhev-a24c-02 account-replicator: Replication run OVER
Apr 27 17:08:16 rhev-a24c-02 account-replicator: Attempted to replicate 5 dbs in 0.12589 seconds (39.71876/s)
Apr 27 17:08:16 rhev-a24c-02 account-replicator: Removed 0 dbs
Apr 27 17:08:16 rhev-a24c-02 account-replicator: 10 successes, 0 failures

Copy to Clipboard

Toggle word wrap

3.8.9. Orchestration (heat) log files
Copy link

Expand

Service	Service name	Log path
OpenStack Heat API Service	openstack-heat-api.service	/var/log/containers/heat/heat-api.log
OpenStack Heat Engine Service	openstack-heat-engine.service	/var/log/containers/heat/heat-engine.log
Orchestration service events	n/a	/var/log/containers/heat/heat-manage.log

3.8.10. Shared Filesystem Service (manila) log files
Copy link

Expand

Service	Service name	Log path
OpenStack Manila API Server	openstack-manila-api.service	/var/log/containers/manila/api.log
OpenStack Manila Scheduler	openstack-manila-scheduler.service	/var/log/containers/manila/scheduler.log//
OpenStack Manila Share Service	openstack-manila-share.service	/var/log/containers/manila/share.log

You can also log information from the Manila Python library in /var/log/containers/manila/manila-manage.log.

3.8.11. Telemetry (ceilometer) log files
Copy link

Expand

Service	Service name	Log path
OpenStack ceilometer notification agent	ceilometer_agent_notification	/var/log/containers/ceilometer/agent-notification.log
OpenStack ceilometer central agent	ceilometer_agent_central	/var/log/containers/ceilometer/central.log
OpenStack ceilometer collection	openstack-ceilometer-collector.service	/var/log/containers/ceilometer/collector.log
OpenStack ceilometer compute agent	ceilometer_agent_compute	/var/log/containers/ceilometer/compute.log

3.8.12. Log files for supporting services
Copy link

The following services are used by the core RHOSP components and have their own log directories and files.

Expand

Service	Service name	Log path
Message broker (RabbitMQ)	rabbitmq-server.service	/var/log/rabbitmq/rabbit@short_hostname.log /var/log/rabbitmq/rabbit@short_hostname-sasl.log (for Simple Authentication and Security Layer related log messages)
Database server (MariaDB)	mariadb.service	/var/log/mariadb/mariadb.log
Virtual network switch (Open vSwitch)	openvswitch-nonetwork.service	/var/log/openvswitch/ovsdb-server.log /var/log/openvswitch/ovs-vswitchd.log

3.8.13. aodh (alarming service) log files
Copy link

Expand

Service	Container name	Log path
Alarming API	aodh_api	/var/log/containers/httpd/aodh-api/aodh_wsgi_access.log
Alarm evaluator log	aodh_evaluator	/var/log/containers/aodh/aodh-evaluator.log
Alarm listener	aodh_listener	/var/log/containers/aodh/aodh-listener.log
Alarm notification	aodh_notifier	/var/log/containers/aodh/aodh-notifier.log

3.8.14. gnocchi (metric storage) log files
Copy link

Expand

Service	Container name	Log path
Gnocchi API	gnocchi_api	/var/log/containers/httpd/gnocchi-api/gnocchi_wsgi_access.log
Gnocchi metricd	gnocchi_metricd	/var/log/containers/gnocchi/gnocchi-metricd.log
Gnocchi statsd	gnocchi_statsd	/var/log/containers/gnocchi/gnocchi-statsd.log

Chapter 4. collectd plugins
Copy link

You can configure multiple collectd plugins depending on your Red Hat OpenStack Platform (RHOSP) environment.

The following list of plugins shows the available heat template ExtraConfig parameters that you can set to override the default values. Each section provides the general configuration name for the ExtraConfig option. For example, if there is a collectd plugin called example_plugin, the format of the plugin title is collectd::plugin::example_plugin.

Reference the tables of available parameters for specific plugins, such as in the following example:

ExtraConfig:
  collectd::plugin::example_plugin::<parameter>: <value>

ExtraConfig:
  collectd::plugin::example_plugin::<parameter>: <value>

Copy to Clipboard

Toggle word wrap

Reference the metrics tables of specific plugins for Prometheus or Grafana queries.

4.1. collectd::plugin::aggregation
Copy link

You can aggregate several values into one with the aggregation plugin. Use the aggregation functions such as sum, average, min, and max to calculate metrics, for example average and total CPU statistics.

Expand

Table 4.1. aggregation parameters
Parameter	Type
host	String
plugin	String
plugininstance	Integer
agg_type	String
typeinstance	String
sethost	String
setplugin	String
setplugininstance	Integer
settypeinstance	String
groupby	Array of Strings
calculatesum	Boolean
calculatenum	Boolean
calculateaverage	Boolean
calculateminimum	Boolean
calculatemaximum	Boolean
calculatestddev	Boolean

Example configuration:

Deploy three aggregate configurations to create the following files:

aggregator-calcCpuLoadAvg.conf: average CPU load for all CPU cores grouped by host and state
aggregator-calcCpuLoadMinMax.conf: minimum and maximum CPU load groups by host and state
aggregator-calcMemoryTotalMaxAvg.conf: maximum, average, and total for memory grouped by type

The aggregation configurations use the default cpu and memory plugin configurations.

parameter_defaults:
  CollectdExtraPlugins:
    - aggregation

  ExtraConfig:
    collectd::plugin::aggregation::aggregators:
      calcCpuLoadAvg:
        plugin: "cpu"
        agg_type: "cpu"
        groupby:
          - "Host"
          - "TypeInstance"
        calculateaverage: True
      calcCpuLoadMinMax:
        plugin: "cpu"
        agg_type: "cpu"
        groupby:
          - "Host"
          - "TypeInstance"
        calculatemaximum: True
        calculateminimum: True
      calcMemoryTotalMaxAvg:
        plugin: "memory"
        agg_type: "memory"
        groupby:
          - "TypeInstance"
        calculatemaximum: True
        calculateaverage: True
        calculatesum: True

parameter_defaults:
  CollectdExtraPlugins:
    - aggregation

  ExtraConfig:
    collectd::plugin::aggregation::aggregators:
      calcCpuLoadAvg:
        plugin: "cpu"
        agg_type: "cpu"
        groupby:
          - "Host"
          - "TypeInstance"
        calculateaverage: True
      calcCpuLoadMinMax:
        plugin: "cpu"
        agg_type: "cpu"
        groupby:
          - "Host"
          - "TypeInstance"
        calculatemaximum: True
        calculateminimum: True
      calcMemoryTotalMaxAvg:
        plugin: "memory"
        agg_type: "memory"
        groupby:
          - "TypeInstance"
        calculatemaximum: True
        calculateaverage: True
        calculatesum: True

Copy to Clipboard

Toggle word wrap

4.2. collectd::plugin::amqp1
Copy link

Use the amqp1 plugin to write values to an amqp1 message bus, for example, AMQ Interconnect.

Expand

Table 4.2. amqp1 parameters
Parameter	Type
manage_package	Boolean
transport	String
host	String
port	Integer
user	String
password	String
address	String
instances	Hash
retry_delay	Integer
send_queue_limit	Integer
interval	Integer

Use the send_queue_limit parameter to limit the length of the outgoing metrics queue.

Note

If there is no AMQP1 connection, the plugin continues to queue messages to send, which can result in unbounded memory consumption. The default value is 0, which disables the outgoing metrics queue.

Increase the value of the send_queue_limit parameter if metrics are missing.

Example configuration:

parameter_defaults:
  CollectdExtraPlugins:
    - amqp1

  ExtraConfig:
    collectd::plugin::amqp1::send_queue_limit: 5000

parameter_defaults:
  CollectdExtraPlugins:
    - amqp1

  ExtraConfig:
    collectd::plugin::amqp1::send_queue_limit: 5000

Copy to Clipboard

Toggle word wrap

4.3. collectd::plugin::apache
Copy link

Use the apache plugin to collect Apache data from the mod_status plugin that is provided by the Apache web server. Each instance provided has a per-interval value specified in seconds. If you provide the timeout interval parameter for an instance, the value is in milliseconds.

Expand

Table 4.3. apache parameters
Parameter	Type
instances	Hash
interval	Integer
manage-package	Boolean
package_install_options	List

Expand

Table 4.4. apache instances parameters
Parameter	Type
url	HTTP URL
user	String
password	String
verifypeer	Boolean
verifyhost	Boolean
cacert	AbsolutePath
sslciphers	String
timeout	Integer

Example configuration:

In this example, the instance name is localhost, which connects to the Apache web server at http://10.0.0.111/mod_status?auto. You must append ?auto to the end of the URL to prevent the status page returning as a type that is incompatible with the plugin.

parameter_defaults:
  CollectdExtraPlugins:
  - apache

  ExtraConfig:
    collectd::plugin::apache::instances:
      localhost:
        url: "http://10.0.0.111/mod_status?auto"

parameter_defaults:
  CollectdExtraPlugins:
  - apache

  ExtraConfig:
    collectd::plugin::apache::instances:
      localhost:
        url: "http://10.0.0.111/mod_status?auto"

Copy to Clipboard

Toggle word wrap

Additional resources

For more information about configuring the apache plugin, see apache.

4.4. collectd::plugin::battery
Copy link

Use the battery plugin to report the remaining capacity, power, or voltage of laptop batteries.

Expand

Table 4.5. battery parameters
Parameter	Type
values_percentage	Boolean
report_degraded	Boolean
query_state_fs	Boolean
interval	Integer

Additional resources

For more information about configuring the battery plugin, see battery.

4.5. collectd::plugin::bind
Copy link

Use the bind plugin to retrieve encoded statistics about queries and responses from a DNS server, and submit those values to collectd.

Expand

Table 4.6. bind parameters
Parameter	Type
url	HTTP URL
memorystats	Boolean
opcodes	Boolean
parsetime	Boolean
qtypes	Boolean
resolverstats	Boolean
serverstats	Boolean
zonemaintstats	Boolean
views	Array
interval	Integer

Expand

Table 4.7. bind views parameters
Parameter	Type
name	String
qtypes	Boolean
resolverstats	Boolean
cacherrsets	Boolean
zones	List of strings

Example configuration:

parameter_defaults:
  CollectdExtraPlugins:
  - bind

  ExtraConfig:
    collectd::plugins::bind:
      url: http://localhost:8053/
      memorystats: true
      opcodes: true
      parsetime: false
      qtypes: true
      resolverstats: true
      serverstats: true
      zonemaintstats: true
      views:
      - name: internal
        qtypes: true
        resolverstats: true
        cacherrsets: true
      - name: external
        qtypes: true
        resolverstats: true
        cacherrsets: true
        zones:
        - "example.com/IN"

parameter_defaults:
  CollectdExtraPlugins:
  - bind

  ExtraConfig:
    collectd::plugins::bind:
      url: http://localhost:8053/
      memorystats: true
      opcodes: true
      parsetime: false
      qtypes: true
      resolverstats: true
      serverstats: true
      zonemaintstats: true
      views:
      - name: internal
        qtypes: true
        resolverstats: true
        cacherrsets: true
      - name: external
        qtypes: true
        resolverstats: true
        cacherrsets: true
        zones:
        - "example.com/IN"

Copy to Clipboard

Toggle word wrap

4.6. collectd::plugin::ceph
Copy link

Use the ceph plugin to gather data from ceph daemons.

Expand

Table 4.8. ceph parameters
Parameter	Type
daemons	Array
longrunavglatency	Boolean
convertspecialmetrictypes	Boolean
package_name	String

Example configuration:

parameter_defaults:
    ExtraConfig:
        collectd::plugin::ceph::daemons:
           - ceph-osd.0
           - ceph-osd.1
           - ceph-osd.2
           - ceph-osd.3
           - ceph-osd.4

parameter_defaults:
    ExtraConfig:
        collectd::plugin::ceph::daemons:
           - ceph-osd.0
           - ceph-osd.1
           - ceph-osd.2
           - ceph-osd.3
           - ceph-osd.4

Copy to Clipboard

Toggle word wrap

Note

If an Object Storage Daemon (OSD) is not on every node, you must list the OSDs.

When you deploy collectd, the ceph plugin is added to the Ceph nodes. Do not add the ceph plugin on Ceph nodes to CollectdExtraPlugins because this results in a deployment failure.

Additional resources

For more information about configuring the ceph plugin, see ceph.

4.7. collectd::plugins::cgroups
Copy link

Use the cgroups plugin to collect information for processes in a cgroup.

Expand

Table 4.9. cgroups parameters
Parameter	Type
ignore_selected	Boolean
interval	Integer
cgroups	List

Additional resources

For more information about configuring the cgroups plugin, see cgroups.

4.8. collectd::plugin::connectivity
Copy link

Use the connectivity plugin to monitor the state of network interfaces.

Note

If no interfaces are listed, all interfaces are monitored by default.

Expand

Table 4.10. connectivity parameters
Parameter	Type
interfaces	Array

Example configuration:

parameter_defaults:
    ExtraConfig:
        collectd::plugin::connectivity::interfaces:
        - eth0
        - eth1

parameter_defaults:
    ExtraConfig:
        collectd::plugin::connectivity::interfaces:
        - eth0
        - eth1

Copy to Clipboard

Toggle word wrap

Additional resources

For more information about configuring the connectivity plugin, see connectivity.

4.9. collectd::plugin::conntrack
Copy link

Use the conntrack plugin to track the number of entries in the Linux connection-tracking table. There are no parameters for this plugin.

4.10. collectd::plugin::contextswitch
Copy link

Use the ContextSwitch plugin to collect the number of context switches that the system handles. The only parameter available is interval, which is a polling interval defined in seconds.

Additional resources

For more information about configuring the contextswitch plugin, see contextswitch.

4.11. collectd::plugin::cpu
Copy link

Use the cpu plugin to monitor the time that the CPU spends in various states, for example, idle, executing user code, executing system code, waiting for IO-operations, and other states.

The cpu plugin collects jiffies, not percentage values. The value of a jiffy depends on the clock frequency of your hardware platform, and therefore is not an absolute time interval unit.

To report a percentage value, set the Boolean parameters reportbycpu and reportbystate to true, and then set the Boolean parameter valuespercentage to true.

This plugin is enabled by default.

Expand

Table 4.11. cpu metrics
Name	Description	Query
idle	Amount of idle time	`collectd_cpu_total{...,type_instance='idle'}`
interrupt	CPU blocked by interrupts	`collectd_cpu_total{...,type_instance='interrupt'}`
nice	Amount of time running low priority processes	`collectd_cpu_total{...,type_instance='nice'}`
softirq	Amount of cycles spent in servicing interrupt requests	`collectd_cpu_total{...,type_instance='waitirq'}`
steal	The percentage of time a virtual CPU waits for a real CPU while the hypervisor is servicing another virtual processor	`collectd_cpu_total{...,type_instance='steal'}`
system	Amount of time spent on system level (kernel)	`collectd_cpu_total{...,type_instance='system'}`
user	Jiffies that user processes use	`collectd_cpu_total{...,type_instance='user'}`
wait	CPU waiting on outstanding I/O request	`collectd_cpu_total{...,type_instance='wait'}`

Expand

Table 4.12. cpu parameters
Parameter	Type	Defaults
reportbystate	Boolean	true
valuespercentage	Boolean	true
reportbycpu	Boolean	true
reportnumcpu	Boolean	false
reportgueststate	Boolean	false
subtractgueststate	Boolean	true
interval	Integer	120

Example configuration:

parameter_defaults:
    CollectdExtraPlugins:
      - cpu
    ExtraConfig:
        collectd::plugin::cpu::reportbystate: true

parameter_defaults:
    CollectdExtraPlugins:
      - cpu
    ExtraConfig:
        collectd::plugin::cpu::reportbystate: true

Copy to Clipboard

Toggle word wrap

Additional resources

For more information about configuring the cpu plugin, see cpu.

4.12. collectd::plugin::cpufreq
Copy link

Use the cpufreq plugin to collect the current CPU frequency. There are no parameters for this plugin.

4.13. collectd::plugin::csv
Copy link

Use the csv plugin to write values to a local file in CSV format.

Expand

Table 4.13. csv parameters
Parameter	Type
datadir	String
storerates	Boolean
interval	Integer

4.14. collectd::plugin::df
Copy link

Use the df plugin to collect disk space usage information for file systems.

This plugin is enabled by default.

Expand

Table 4.14. df metrics
Name	Description	Query
free	Amount of free disk space	`collectd_df_df_complex{...,type_instance="free"}`
reserved	Amount of reserved disk space	`collectd_df_df_complex{...,type_instance="reserved"}`
used	Amount of used disk space	`collectd_df_df_complex{...,type_instance="used"}`

Expand

Table 4.15. df parameters
Parameter	Type	Defaults
devices	Array	`[]`
fstypes	Array	`['xfs']`
ignoreselected	Boolean	true
mountpoints	Array	`[]`
reportbydevice	Boolean	true
reportinodes	Boolean	true
reportreserved	Boolean	true
valuesabsolute	Boolean	true
valuespercentage	Boolean	false

Example configuration:

parameter_defaults:
  ExtraConfig:
    collectd::plugin::df::fstypes: ['tmpfs','xfs']

parameter_defaults:
  ExtraConfig:
    collectd::plugin::df::fstypes: ['tmpfs','xfs']

Copy to Clipboard

Toggle word wrap

Additional resources

For more information about configuring the df plugin, see df.

4.15. collectd::plugin::disk
Copy link

Use the disk plugin to collect performance statistics of hard disks and, if supported, partitions.

Note

The disk plugin monitors all disks by default. You can use the ignoreselected parameter to ignore a list of disks. The example configuration ignores the sda, sdb, and sdc disks, and monitors all disks not included in the list.

This plugin is enabled by default.

Expand

Table 4.16. disk parameters
Parameter	Type	Defaults
disks	Array	`[]`
ignoreselected	Boolean	false
udevnameattr	String	<undefined>

Expand

Table 4.17. disk metrics
Name	Description
merged	The number of queued operations that can be merged together, for example, one physical disk access served two or more logical operations.
time	The average time an I/O-operation takes to complete. The values might not be accurate.
io_time	Time spent doing I/Os (ms). You can use this metric as a device load percentage. A value of 1 second matches 100% of load.
weighted_io_time	Measure of both I/O completion time and the backlog that might be accumulating.
pending_operations	Shows queue size of pending I/O operations.

Example configuration:

parameter_defaults:
  ExtraConfig:
    collectd::plugin::disk::disks: ['sda', 'sdb', 'sdc']
    collectd::plugin::disk::ignoreselected: true

parameter_defaults:
  ExtraConfig:
    collectd::plugin::disk::disks: ['sda', 'sdb', 'sdc']
    collectd::plugin::disk::ignoreselected: true

Copy to Clipboard

Toggle word wrap

Additional resources

For more information about configuring the disk plugin, see disk.

4.16. collectd::plugin::hugepages
Copy link

Use the hugepages plugin to collect hugepages information.

This plugin is enabled by default.

This plugin is enabled by default.

Copy to Clipboard

Toggle word wrap

Expand

Table 4.18. hugepages parameters
Parameter	Type	Defaults
report_per_node_hp	Boolean	true
report_root_hp	Boolean	true
values_pages	Boolean	true
values_bytes	Boolean	false
values_percentage	Boolean	false

Example configuration:

parameter_defaults:
  ExtraConfig:
    collectd::plugin::hugepages::values_percentage: true

parameter_defaults:
  ExtraConfig:
    collectd::plugin::hugepages::values_percentage: true

Copy to Clipboard

Toggle word wrap

Additional resources

For more information about configuring the hugepages plugin, see hugepages.

4.17. collectd::plugin::interface
Copy link

Use the interface plugin to measure interface traffic in octets, packets per second, and error rate per second.

This plugin is enabled by default.

This plugin is enabled by default.

Copy to Clipboard

Toggle word wrap

Expand

Table 4.19. interface parameters
Parameter	Type	Default
interfaces	Array	`[]`
ignoreselected	Boolean	false
reportinactive	Boolean	true

Example configuration:

parameter_defaults:
  ExtraConfig:
    collectd::plugin::interface::interfaces:
      - lo
    collectd::plugin::interface::ignoreselected: true

parameter_defaults:
  ExtraConfig:
    collectd::plugin::interface::interfaces:
      - lo
    collectd::plugin::interface::ignoreselected: true

Copy to Clipboard

Toggle word wrap

Additional resources

For more information about configuring the interfaces plugin, see interfaces.

4.18. collectd::plugin::load
Copy link

Use the load plugin to collect the system load and an overview of the system use.

This plugin is enabled by default.

This plugin is enabled by default.

Copy to Clipboard

Toggle word wrap

Expand

Table 4.20. plugin parameters
Parameter	Type	Default
report_relative	Boolean	true

Example configuration:

parameter_defaults:
  ExtraConfig:
    collectd::plugin::load::report_relative: false

parameter_defaults:
  ExtraConfig:
    collectd::plugin::load::report_relative: false

Copy to Clipboard

Toggle word wrap

Additional resources

For more information about configuring the load plugin, see load.

4.19. collectd::plugin::mcelog
Copy link

Use the mcelog plugin to send notifications and statistics that are relevant to Machine Check Exceptions when they occur. Configure mcelog to run in daemon mode and enable logging capabilities.

Expand

Table 4.21. mcelog parameters
Parameter	Type
Mcelogfile	String
Memory	Hash `{ mcelogclientsocket[string], persistentnotification[boolean] }`

Example configuration:

parameter_defaults:
    CollectdExtraPlugins: mcelog
    CollectdEnableMcelog: true

parameter_defaults:
    CollectdExtraPlugins: mcelog
    CollectdEnableMcelog: true

Copy to Clipboard

Toggle word wrap

Additional resources

For more information about configuring the mcelog plugin, see mcelog.

4.20. collectd::plugin::memcached
Copy link

Use the memcached plugin to retrieve information about memcached cache usage, memory, and other related information.

Expand

Table 4.22. memcached parameters
Parameter	Type
instances	Hash
interval	Integer

Example configuration:

parameter_defaults:
  CollectdExtraPlugins:
  - memcached

  ExtraConfig:
    collectd::plugin::memcached::instances:
      local:
        host: "%{hiera('fqdn_canonical')}"
        port: 11211

parameter_defaults:
  CollectdExtraPlugins:
  - memcached

  ExtraConfig:
    collectd::plugin::memcached::instances:
      local:
        host: "%{hiera('fqdn_canonical')}"
        port: 11211

Copy to Clipboard

Toggle word wrap

Additional resources

For more information about configuring the memcached plugin, see memcached.

4.21. collectd::plugin::memory
Copy link

Use the memory plugin to retrieve information about the memory of the system.

This plugin is enabled by default.

This plugin is enabled by default.

Copy to Clipboard

Toggle word wrap

Expand

Table 4.23. memory parameters
Parameter	Type
Defaults	valuesabsolute
Boolean	true
valuespercentage	Boolean

Example configuration:

parameter_defaults:
  ExtraConfig:
    collectd::plugin::memory::valuesabsolute: true
    collectd::plugin::memory::valuespercentage: false

parameter_defaults:
  ExtraConfig:
    collectd::plugin::memory::valuesabsolute: true
    collectd::plugin::memory::valuespercentage: false

Copy to Clipboard

Toggle word wrap

Additional resources

For more information about configuring the memory plugin, see memory.

4.22. collectd::plugin::ntpd
Copy link

Use the ntpd plugin to query a local NTP server that is configured to allow access to statistics, and retrieve information about the configured parameters and the time sync status.

Expand

Table 4.24. ntpd parameters
Parameter	Type
host	Hostname
port	Port number (Integer)
reverselookups	Boolean
includeunitid	Boolean
interval	Integer

Example configuration:

parameter_defaults:
  CollectdExtraPlugins:
  - ntpd

  ExtraConfig:
    collectd::plugin::ntpd::host: localhost
    collectd::plugin::ntpd::port: 123
    collectd::plugin::ntpd::reverselookups: false
    collectd::plugin::ntpd::includeunitid: false

parameter_defaults:
  CollectdExtraPlugins:
  - ntpd

  ExtraConfig:
    collectd::plugin::ntpd::host: localhost
    collectd::plugin::ntpd::port: 123
    collectd::plugin::ntpd::reverselookups: false
    collectd::plugin::ntpd::includeunitid: false

Copy to Clipboard

Toggle word wrap

Additional resources

For more information about configuring the ntpd plugin, see ntpd.

4.23. collectd::plugin::ovs_stats
Copy link

Use the ovs_stats plugin to collect statistics of OVS-connected interfaces. The ovs_stats plugin uses the OVSDB management protocol (RFC7047) monitor mechanism to get statistics from OVSDB.

Expand

Table 4.25. ovs_stats parameters
Parameter	Type
address	String
bridges	List
port	Integer
socket	String

Example configuration:

The following example shows how to enable the ovs_stats plugin. If you deploy your overcloud with OVS, you do not need to enable the ovs_stats plugin.

    parameter_defaults:
        CollectdExtraPlugins:
          - ovs_stats
        ExtraConfig:
          collectd::plugin::ovs_stats::socket: '/run/openvswitch/db.sock'

    parameter_defaults:
        CollectdExtraPlugins:
          - ovs_stats
        ExtraConfig:
          collectd::plugin::ovs_stats::socket: '/run/openvswitch/db.sock'

Copy to Clipboard

Toggle word wrap

Additional resources

For more information about configuring the ovs_stats plugin, see ovs_stats.

4.24. collectd::plugin::processes
Copy link

The processes plugin provides information about system processes. If you do not specify custom process matching, the plugin collects only the number of processes by state and the process fork rate.

To collect more details about specific processes, you can use the process parameter to specify a process name or the process_match option to specify process names that match a regular expression. The statistics for a process_match output are grouped by process name.

Expand

Table 4.26. plugin parameters
Parameter	Type	Defaults
processes	Array	<undefined>
process_matches	Array	<undefined>
collect_context_switch	Boolean	<undefined>
collect_file_descriptor	Boolean	<undefined>
collect_memory_maps	Boolean	<undefined>

Additional resources

For more information about configuring the processes plugin, see [processes].

4.25. collectd::plugin::smart
Copy link

Use the smart plugin to collect SMART (self-monitoring, analysis and reporting technology) information from physical disks on the node. You must also set the parameter CollectdContainerAdditionalCapAdd to CAP_SYS_RAWIO to allow the smart plugin to read SMART telemetry. If you do not set the CollectdContainerAdditionalCapAdd parameter, the following message is written to the collectd error logs:

smart plugin: Running collectd as root, but the CAP_SYS_RAWIO capability is missing. The plugin's read function will probably fail. Is your init system dropping capabilities?.

Expand

Table 4.27. smart parameters
Parameter	Type
disks	Array
ignoreselected	Boolean
interval	Integer

Example configuration:

parameter_defaults:
  CollectdExtraPlugins:
  - smart

  CollectdContainerAdditionalCapAdd: "CAP_SYS_RAWIO"

parameter_defaults:
  CollectdExtraPlugins:
  - smart

  CollectdContainerAdditionalCapAdd: "CAP_SYS_RAWIO"

Copy to Clipboard

Toggle word wrap

Additional information

For more information about configuring the smart plugin, see smart.

4.26. collectd::plugin::swap
Copy link

Use the swap plugin to collect information about the available and used swap space.

Expand

Table 4.28. swap parameters
Parameter	Type
reportbydevice	Boolean
reportbytes	Boolean
valuesabsolute	Boolean
valuespercentage	Boolean
reportio	Boolean

Example configuration:

parameter_defaults:
  CollectdExtraPlugins:
  - swap

  ExtraConfig:
    collectd::plugin::swap::reportbydevice: false
    collectd::plugin::swap::reportbytes: true
    collectd::plugin::swap::valuesabsolute: true
    collectd::plugin::swap::valuespercentage: false
    collectd::plugin::swap::reportio: true

parameter_defaults:
  CollectdExtraPlugins:
  - swap

  ExtraConfig:
    collectd::plugin::swap::reportbydevice: false
    collectd::plugin::swap::reportbytes: true
    collectd::plugin::swap::valuesabsolute: true
    collectd::plugin::swap::valuespercentage: false
    collectd::plugin::swap::reportio: true

Copy to Clipboard

Toggle word wrap

4.27. collectd::plugin::tcpconns
Copy link

Use the tcpconns plugin to collect information about the number of TCP connections inbound or outbound from the configured port. The local port configuration represents ingress connections. The remote port configuration represents egress connections.

Expand

Table 4.29. tcpconns parameters
Parameter	Type
localports	Port (Array)
remoteports	Port (Array)
listening	Boolean
allportssummary	Boolean

Example configuration:

parameter_defaults:
  CollectdExtraPlugins:
  - tcpconns

  ExtraConfig:
    collectd::plugin::tcpconns::listening: false
    collectd::plugin::tcpconns::localports:
    - 22
    collectd::plugin::tcpconns::remoteports:
    - 22

parameter_defaults:
  CollectdExtraPlugins:
  - tcpconns

  ExtraConfig:
    collectd::plugin::tcpconns::listening: false
    collectd::plugin::tcpconns::localports:
    - 22
    collectd::plugin::tcpconns::remoteports:
    - 22

Copy to Clipboard

Toggle word wrap

4.28. collectd::plugin::thermal
Copy link

Use the thermal plugin to retrieve ACPI thermal zone information.

Expand

Table 4.30. thermal parameters
Parameter	Type
devices	Array
ignoreselected	Boolean
interval	Integer

Example configuration:

parameter_defaults:
  CollectdExtraPlugins:
  - thermal

parameter_defaults:
  CollectdExtraPlugins:
  - thermal

Copy to Clipboard

Toggle word wrap

4.29. collectd::plugin::uptime
Copy link

Use the uptime plugin to collect information about system uptime.

This plugin is enabled by default.

This plugin is enabled by default.

Copy to Clipboard

Toggle word wrap

Expand

Table 4.31. uptime parameters
Parameter	Type
interval	Integer

4.30. collectd::plugin::virt
Copy link

Use the virt plugin to collect CPU, disk, network load, and other metrics through the libvirt API for virtual machines on the host.

This plugin is enabled by default on compute hosts.

Expand

Table 4.32. virt parameters
Parameter	Type
connection	String
refresh_interval	Hash
domain	String
block_device	String
interface_device	String
ignore_selected	Boolean
plugin_instance_format	String
hostname_format	String
interface_format	String
extra_stats	String

Example configuration:

ExtraConfig:
  collectd::plugin::virt::hostname_format: "name uuid hostname"
  collectd::plugin::virt::plugin_instance_format: metadata

ExtraConfig:
  collectd::plugin::virt::hostname_format: "name uuid hostname"
  collectd::plugin::virt::plugin_instance_format: metadata

Copy to Clipboard

Toggle word wrap

Additional resources

For more information about configuring the virt plugin, see virt.

4.31. collectd::plugin::vmem
Copy link

Use the vmem plugin to collect information about virtual memory from the kernel subsystem.

Expand

Table 4.33. vmem parameters
Parameter	Type
verbose	Boolean
interval	Integer

Example configuration:

parameter_defaults:
  CollectdExtraPlugins:
  - vmem

  ExtraConfig:
    collectd::plugin::vmem::verbose: true

parameter_defaults:
  CollectdExtraPlugins:
  - vmem

  ExtraConfig:
    collectd::plugin::vmem::verbose: true

Copy to Clipboard

Toggle word wrap

4.32. collectd::plugin::write_http
Copy link

Use the write_http output plugin to submit values to an HTTP server by using POST requests and encoding metrics with JSON, or by using the PUTVAL command.

Expand

Table 4.34. write_http parameters
Parameter	Type
ensure	Enum[present, absent]
nodes	Hash[String, Hash[String, Scalar]]
urls	Hash[String, Hash[String, Scalar]]
manage_package	Boolean

Example configuration:

parameter_defaults:
    CollectdExtraPlugins:
      - write_http
    ExtraConfig:
        collectd::plugin::write_http::nodes:
            collectd:
                url: “http://collectd.tld.org/collectd”
                metrics: true
                header: “X-Custom-Header: custom_value"

parameter_defaults:
    CollectdExtraPlugins:
      - write_http
    ExtraConfig:
        collectd::plugin::write_http::nodes:
            collectd:
                url: “http://collectd.tld.org/collectd”
                metrics: true
                header: “X-Custom-Header: custom_value"

Copy to Clipboard

Toggle word wrap

Additional resources

For more information about configuring the write_http plugin, see write_http.

4.33. collectd::plugin::write_kafka
Copy link

Use the write_kafka plugin to send values to a Kafka topic. Configure the write_kafka plugin with one or more topic blocks. For each topic block, you must specify a unique name and one Kafka producer. You can use the following per-topic parameters inside the topic block:

Expand

Table 4.35. write_kafka parameters
Parameter	Type
kafka_hosts	Array[String]
topics	Hash
properties	Hash
meta	Hash

Example configuration:

parameter_defaults:
    CollectdExtraPlugins:
       - write_kafka
    ExtraConfig:
      collectd::plugin::write_kafka::kafka_hosts:
        - remote.tld:9092
      collectd::plugin::write_kafka::topics:
        mytopic:
          format: JSON

parameter_defaults:
    CollectdExtraPlugins:
       - write_kafka
    ExtraConfig:
      collectd::plugin::write_kafka::kafka_hosts:
        - remote.tld:9092
      collectd::plugin::write_kafka::topics:
        mytopic:
          format: JSON

Copy to Clipboard

Toggle word wrap

Additional resources:

For more information about how to configure the write_kafka plugin, see write_kafka.

4.34. Unsupported collectd plugins
Copy link

Warning

The following plugins have undefined use cases for many Red Hat OpenStack Platform (RHOSP) environments, and are therefore unsupported.

Expand

Table 4.36. Unsupported collectd plugins
Parameter	Notes
cURL	You can use this plugin to read files with `libcurl` and then parse the files according to the configuration.
cURL-JSON	You can use this plugin to query JSON data using the cURL library and parse the data according to the configuration.
DNS	You can use this plugin to interpret the packets and collect statistics of your DNS traffic on UDP port 53.
Entropy	You can use this plugin to report the available entropy on a system.
Ethstat	You can use this plugin to read performance statistics from the Ethernet cards.
Exec	You can use this plugin to execute scripts or applications and print to STDOUT.
fhcount	You can use this plugin to provide statistics about used, unused, and total number of file handles on Linux.
FileCount	You can use this plugin to count the number of files in a directory and its subdirectories.
FSCache	You can use this plugin to collect information about the file-system-based caching infrastructure for network file systems and other slow media.
HDDTemp	You can use this plugin to collect the temperature of hard disks.
IntelRDT	You can use this plugin to collect information provided by monitoring features of Intel Resource Director Technology.
IPMI	You can use this plugin to read hardware sensors from servers using the Intelligent Platform Management Interface (IPMI).
IRQ	You can use this plugin to collect the number of times each interrupt is handled by the operating system.
LogFile	You can use this plugin to write to a log file.
MySQL	You can use this plugin to connect to a MySQL database and issue a SHOW STATS command periodically. The command returns the server status variables, many of which are collected.
Netlink	You can use this plugin to get statistics for interfaces, qdiscs, classes, or filters.
Network	You can use this plugin to interact with other `collectd` instances.
NFS	You can use this plugin to get information about the usage of the Network File System (NFS), version 2, 3, and 4.
numa	You can use this plugin to report statistics of the Non-Uniform Memory Access (NUMA) subsystem of Linux.
OpenLDAP	You can use this plugin to report the status of OpenLDAP.
OpenVPN	You can use this plugin to read the status file printed by OpenVPN.
OVS Events	You can use this plugin to monitor the link status of OpenvSwitch-connected interfaces and send notifications when the link state change occurs.
PCIe Errors	You can use this plugin to monitor and report PCI Express errors.
Ping	You can use this plugin to measure network latency using Internet Control Message Protocol (ICMP) echo requests.
procevent	You can use this plugin to monitor process starts and exits.
Python	You can use this plugin to bind to python.
Sensors	You can use this plugin to read hardware sensors using `lm-sensors`.
Serial	You can use this plugin to collect traffic on the serial interface.
SNMP	You can use this plugin to read values from network devices using the Simple Network Management Protocol (SNMP).
SNMP Agent	You can use this plugin to handle queries from the principal SNMP agent and return the data collected by read plugins.
StatsD	You can use this plugin to implement the `StatsD` network protocol to allow clients to report events.
sysevent	You can use this plugin to listen for incoming `rsyslog` messages on a network socket.
SysLog	You can use this plugin to receive log messages from the `collectd` daemon and dispatch the messages to syslog.
Table	You can use this plugin to parse plain text files in table format.
Tail	You can use this plugin to tail log files, in a similar way as the `tail -F` command. Each line is given to one or more matches that test if the line is relevant for any statistics using a `posix` extended regular expression.
Tail CSV	You can use this plugin to tail CSV files.
threshold	You can use this plugin to generate notifications on given thresholds.
turbostat	You can use this plugin to read CPU frequency and C-state residency on modern Intel turbo-capable processors.
UnixSock	You can use this plugin to communicate with the `collectd` daemon.
Users	You can use this plugin to count the number of users currently logged into the system.
UUID	You can use this plugin to determine the uniquie identifier (UUID) of the system that it is running on.
Write Graphite	You can use this plugin to store values in `Carbon`, which is the storage layer of `Graphite`.
Write HTTP	You can use this plugin to send the values collected by `collectd` to a web server using HTTP POST requests.
Write Log	You can use this plugin to write metrics as INFO log messages.
Write Prometheus	You can use this plugin to implement a web server that can be scraped by Prometheus.

Legal Notice
Copy link

The text of and illustrations in this document are licensed by Red Hat under a Creative Commons Attribution–Share Alike 3.0 Unported license ("CC-BY-SA"). An explanation of CC-BY-SA is available at http://creativecommons.org/licenses/by-sa/3.0/. In accordance with CC-BY-SA, if you distribute this document or an adaptation of it, you must provide the URL for the original version.

Red Hat, as the licensor of this document, waives the right to enforce, and agrees not to assert, Section 4d of CC-BY-SA to the fullest extent permitted by applicable law.

Red Hat, Red Hat Enterprise Linux, the Shadowman logo, the Red Hat logo, JBoss, OpenShift, Fedora, the Infinity logo, and RHCE are trademarks of Red Hat, Inc., registered in the United States and other countries.

Linux® is the registered trademark of Linus Torvalds in the United States and other countries.

Java® is a registered trademark of Oracle and/or its affiliates.

XFS® is a trademark of Silicon Graphics International Corp. or its subsidiaries in the United States and/or other countries.

MySQL® is a registered trademark of MySQL AB in the United States, the European Union and other countries.

Node.js® is an official trademark of Joyent. Red Hat is not formally related to or endorsed by the official Joyent Node.js open source or commercial project.

The OpenStack® Word Mark and OpenStack logo are either registered trademarks/service marks or trademarks/service marks of the OpenStack Foundation, in the United States and other countries and are used with the OpenStack Foundation's permission. We are not affiliated with, endorsed or sponsored by the OpenStack Foundation, or the OpenStack community.

All other trademarks are the property of their respective owners.

Managing overcloud observability

Tracking physical and virtual resources, and collecting metrics

OpenStack Documentation Team

Providing feedback on Red Hat documentationCopy linkLink copied to clipboard!

Chapter 1. Introduction to operational measurementsCopy linkLink copied to clipboard!

1.1. Observability architectureCopy linkLink copied to clipboard!

1.1.1. Support status of monitoring componentsCopy linkLink copied to clipboard!

1.2. Data collection in Red Hat OpenStack PlatformCopy linkLink copied to clipboard!

1.2.1. CeilometerCopy linkLink copied to clipboard!

1.2.2. collectdCopy linkLink copied to clipboard!

1.3. Storage with GnocchiCopy linkLink copied to clipboard!

1.3.1. Archive policies: Storing both short and long-term data in a time-series databaseCopy linkLink copied to clipboard!

1.3.2. Indexer driverCopy linkLink copied to clipboard!

1.3.3. Gnocchi terminologyCopy linkLink copied to clipboard!

Chapter 2. Planning for operational measurementsCopy linkLink copied to clipboard!

2.1. Collectd measurementsCopy linkLink copied to clipboard!

2.2. Planning for data storageCopy linkLink copied to clipboard!

2.3. Planning and managing archive policiesCopy linkLink copied to clipboard!

2.3.1. MetricsCopy linkLink copied to clipboard!

2.3.2. Creating custom measuresCopy linkLink copied to clipboard!

2.3.3. Verifying the metric statusCopy linkLink copied to clipboard!

2.3.4. Creating an archive policyCopy linkLink copied to clipboard!

2.3.5. Viewing an archive policyCopy linkLink copied to clipboard!

2.3.6. Deleting an archive policyCopy linkLink copied to clipboard!

2.3.7. Creating an archive policy ruleCopy linkLink copied to clipboard!

Chapter 3. Installing and configuring the logs serviceCopy linkLink copied to clipboard!

3.1. The log system architecture and componentsCopy linkLink copied to clipboard!

3.2. Enabling logging with ElasticsearchCopy linkLink copied to clipboard!

3.3. Configurable logging parametersCopy linkLink copied to clipboard!

3.4. Overriding the default path for a log fileCopy linkLink copied to clipboard!

3.5. Modifying the format of a log recordCopy linkLink copied to clipboard!

3.6. Verifying the connection between Rsyslog and ElasticsearchCopy linkLink copied to clipboard!

3.7. TracebacksCopy linkLink copied to clipboard!

3.8. Location of log files for Red Hat OpenStack Platform servicesCopy linkLink copied to clipboard!

3.8.1. Bare Metal Provisioning (ironic) log filesCopy linkLink copied to clipboard!

3.8.2. Block Storage (cinder) log filesCopy linkLink copied to clipboard!

3.8.3. Compute (nova) log filesCopy linkLink copied to clipboard!

3.8.4. Dashboard (horizon) log filesCopy linkLink copied to clipboard!

3.8.5. Identity Service (keystone) log filesCopy linkLink copied to clipboard!

3.8.6. Image Service (glance) log filesCopy linkLink copied to clipboard!

3.8.7. Networking (neutron) log filesCopy linkLink copied to clipboard!

3.8.8. Object Storage (swift) log filesCopy linkLink copied to clipboard!

3.8.9. Orchestration (heat) log filesCopy linkLink copied to clipboard!

3.8.10. Shared Filesystem Service (manila) log filesCopy linkLink copied to clipboard!

3.8.11. Telemetry (ceilometer) log filesCopy linkLink copied to clipboard!

3.8.12. Log files for supporting servicesCopy linkLink copied to clipboard!

3.8.13. aodh (alarming service) log filesCopy linkLink copied to clipboard!

3.8.14. gnocchi (metric storage) log filesCopy linkLink copied to clipboard!

Chapter 4. collectd pluginsCopy linkLink copied to clipboard!

4.1. collectd::plugin::aggregationCopy linkLink copied to clipboard!

4.2. collectd::plugin::amqp1Copy linkLink copied to clipboard!

4.3. collectd::plugin::apacheCopy linkLink copied to clipboard!

4.4. collectd::plugin::batteryCopy linkLink copied to clipboard!

4.5. collectd::plugin::bindCopy linkLink copied to clipboard!

4.6. collectd::plugin::cephCopy linkLink copied to clipboard!

4.7. collectd::plugins::cgroupsCopy linkLink copied to clipboard!

4.8. collectd::plugin::connectivityCopy linkLink copied to clipboard!

4.9. collectd::plugin::conntrackCopy linkLink copied to clipboard!

4.10. collectd::plugin::contextswitchCopy linkLink copied to clipboard!

4.11. collectd::plugin::cpuCopy linkLink copied to clipboard!

4.12. collectd::plugin::cpufreqCopy linkLink copied to clipboard!

4.13. collectd::plugin::csvCopy linkLink copied to clipboard!

4.14. collectd::plugin::dfCopy linkLink copied to clipboard!

4.15. collectd::plugin::diskCopy linkLink copied to clipboard!

4.16. collectd::plugin::hugepagesCopy linkLink copied to clipboard!

4.17. collectd::plugin::interfaceCopy linkLink copied to clipboard!

4.18. collectd::plugin::loadCopy linkLink copied to clipboard!

4.19. collectd::plugin::mcelogCopy linkLink copied to clipboard!

4.20. collectd::plugin::memcachedCopy linkLink copied to clipboard!

4.21. collectd::plugin::memoryCopy linkLink copied to clipboard!

4.22. collectd::plugin::ntpdCopy linkLink copied to clipboard!

4.23. collectd::plugin::ovs_statsCopy linkLink copied to clipboard!

4.24. collectd::plugin::processesCopy linkLink copied to clipboard!

4.25. collectd::plugin::smartCopy linkLink copied to clipboard!

4.26. collectd::plugin::swapCopy linkLink copied to clipboard!

4.27. collectd::plugin::tcpconnsCopy linkLink copied to clipboard!

4.28. collectd::plugin::thermalCopy linkLink copied to clipboard!

4.29. collectd::plugin::uptimeCopy linkLink copied to clipboard!

4.30. collectd::plugin::virtCopy linkLink copied to clipboard!

4.31. collectd::plugin::vmemCopy linkLink copied to clipboard!

Providing feedback on Red Hat documentation
Copy link

Chapter 1. Introduction to operational measurements
Copy link

1.1. Observability architecture
Copy link

1.1.1. Support status of monitoring components
Copy link

1.2. Data collection in Red Hat OpenStack Platform
Copy link

1.2.1. Ceilometer
Copy link

1.2.2. collectd
Copy link

1.3. Storage with Gnocchi
Copy link

1.3.1. Archive policies: Storing both short and long-term data in a time-series database
Copy link

1.3.2. Indexer driver
Copy link

1.3.3. Gnocchi terminology
Copy link

Chapter 2. Planning for operational measurements
Copy link

2.1. Collectd measurements
Copy link

2.2. Planning for data storage
Copy link

2.3. Planning and managing archive policies
Copy link

2.3.1. Metrics
Copy link

2.3.2. Creating custom measures
Copy link

2.3.3. Verifying the metric status
Copy link

2.3.4. Creating an archive policy
Copy link

2.3.5. Viewing an archive policy
Copy link

2.3.6. Deleting an archive policy
Copy link

2.3.7. Creating an archive policy rule
Copy link

Chapter 3. Installing and configuring the logs service
Copy link

3.1. The log system architecture and components
Copy link

3.2. Enabling logging with Elasticsearch
Copy link

3.3. Configurable logging parameters
Copy link

3.4. Overriding the default path for a log file
Copy link

3.5. Modifying the format of a log record
Copy link

3.6. Verifying the connection between Rsyslog and Elasticsearch
Copy link

3.7. Tracebacks
Copy link

3.8. Location of log files for Red Hat OpenStack Platform services
Copy link

3.8.1. Bare Metal Provisioning (ironic) log files
Copy link

3.8.2. Block Storage (cinder) log files
Copy link

3.8.3. Compute (nova) log files
Copy link

3.8.4. Dashboard (horizon) log files
Copy link

3.8.5. Identity Service (keystone) log files
Copy link

3.8.6. Image Service (glance) log files
Copy link

3.8.7. Networking (neutron) log files
Copy link

3.8.8. Object Storage (swift) log files
Copy link

3.8.9. Orchestration (heat) log files
Copy link

3.8.10. Shared Filesystem Service (manila) log files
Copy link

3.8.11. Telemetry (ceilometer) log files
Copy link

3.8.12. Log files for supporting services
Copy link

3.8.13. aodh (alarming service) log files
Copy link

3.8.14. gnocchi (metric storage) log files
Copy link

Chapter 4. collectd plugins
Copy link

4.1. collectd::plugin::aggregation
Copy link

4.2. collectd::plugin::amqp1
Copy link

4.3. collectd::plugin::apache
Copy link

4.4. collectd::plugin::battery
Copy link

4.5. collectd::plugin::bind
Copy link

4.6. collectd::plugin::ceph
Copy link

4.7. collectd::plugins::cgroups
Copy link

4.8. collectd::plugin::connectivity
Copy link

4.9. collectd::plugin::conntrack
Copy link

4.10. collectd::plugin::contextswitch
Copy link

4.11. collectd::plugin::cpu
Copy link

4.12. collectd::plugin::cpufreq
Copy link

4.13. collectd::plugin::csv
Copy link

4.14. collectd::plugin::df
Copy link

4.15. collectd::plugin::disk
Copy link

4.16. collectd::plugin::hugepages
Copy link

4.17. collectd::plugin::interface
Copy link

4.18. collectd::plugin::load
Copy link

4.19. collectd::plugin::mcelog
Copy link

4.20. collectd::plugin::memcached
Copy link

4.21. collectd::plugin::memory
Copy link

4.22. collectd::plugin::ntpd
Copy link

4.23. collectd::plugin::ovs_stats
Copy link

4.24. collectd::plugin::processes
Copy link

4.25. collectd::plugin::smart
Copy link

4.26. collectd::plugin::swap
Copy link

4.27. collectd::plugin::tcpconns
Copy link

4.28. collectd::plugin::thermal
Copy link

4.29. collectd::plugin::uptime
Copy link

4.30. collectd::plugin::virt
Copy link

4.31. collectd::plugin::vmem
Copy link

4.32. collectd::plugin::write_http
Copy link

4.33. collectd::plugin::write_kafka
Copy link

4.34. Unsupported collectd plugins
Copy link