Este contenido no está disponible en el idioma seleccionado.
Chapter 1. Administrator metrics
1.1. OpenShift Serverless administrator metrics Copiar enlaceEnlace copiado en el portapapeles!
Metrics enable cluster administrators to monitor how OpenShift Serverless cluster components and workloads are performing.
To view different metrics for OpenShift Serverless, you can check the OpenShift Container Platform monitoring documentation.
1.1.1. Prerequisites for OpenShift Serverless administrator metrics Copiar enlaceEnlace copiado en el portapapeles!
- You have enabled metrics for your cluster.
- You have access to an account with cluster administrator access (or dedicated administrator access for OpenShift Dedicated or Red Hat OpenShift Service on AWS).
If Service Mesh is enabled with mTLS, metrics for Knative Serving are disabled by default because Service Mesh prevents Prometheus from scraping metrics.
To resolve this issue, see the Enabling Knative Serving metrics when using Service Mesh with mTLS section in the Service Mesh integration documentation.
Scraping the metrics does not affect autoscaling of a Knative service, because scraping requests do not go through the activator. so, no scraping takes place if no pods are running.
1.2. Serverless controller metrics Copiar enlaceEnlace copiado en el portapapeles!
Any component that implements controller logic emits the following metrics. These metrics show details about reconciliation operations and the work queue behavior that adds reconciliation requests to the queue.
| Metric name | Description | Type | Tags | Unit |
|---|---|---|---|---|
|
| The depth of the work queue. | Gauge |
| Integer (no units) |
|
| The number of reconcile operations. | Counter |
| Integer (no units) |
|
| The latency of reconcile operations. | Histogram |
| Milliseconds |
|
| The total number of add actions handled by the work queue. | Counter |
| Integer (no units) |
|
| The length of time an item stays in the work queue before being requested. | Histogram |
| Seconds |
|
| The total number of retries that have been handled by the work queue. | Counter |
| Integer (no units) |
|
| The length of time it takes to process and item from the work queue. | Histogram |
| Seconds |
|
| The length of time that outstanding work queue items have been in progress. | Histogram |
| Seconds |
|
| The length of time that the longest outstanding work queue items has been in progress. | Histogram |
| Seconds |
1.3. Webhook metrics Copiar enlaceEnlace copiado en el portapapeles!
Webhook metrics report useful information about operations. For example, if a large number of operations fail, this might indicate an issue with a user-created resource.
| Metric name | Description | Type | Tags | Unit |
|---|---|---|---|---|
|
| The number of requests that are routed to the webhook. | Counter |
| Integer (no units) |
|
| The response time for a webhook request. | Histogram |
| Milliseconds |
1.4. Knative Eventing metrics Copiar enlaceEnlace copiado en el portapapeles!
Cluster administrators can view the following metrics for Knative Eventing components. By aggregating the metrics from HTTP code, you can separate the events into two categories; successful events (2xx) and failed events (5xx).
1.4.1. Broker ingress metrics Copiar enlaceEnlace copiado en el portapapeles!
You can use the following metrics to debug the broker ingress, evaluate its performance, and identify that events the ingress component dispatches.
| Metric name | Description | Type | Tags | Unit |
|---|---|---|---|---|
|
| Number of events received by a broker. | Counter |
| Integer (no units) |
|
| The time taken to dispatch an event to a channel. | Histogram |
| Milliseconds |
1.4.2. Broker filter metrics Copiar enlaceEnlace copiado en el portapapeles!
You can use the following metrics to debug broker filters, evaluate their performance, and confirm that the filters dispatch events. You can also measure the latency of event filtering.
| Metric name | Description | Type | Tags | Unit |
|---|---|---|---|---|
|
| Number of events received by a broker. | Counter |
| Integer (no units) |
|
| The time taken to dispatch an event to a channel. | Histogram |
| Milliseconds |
|
| The time required to process an event before dispatching it to a trigger subscriber. | Histogram |
| Milliseconds |
1.4.3. InMemoryChannel dispatcher metrics Copiar enlaceEnlace copiado en el portapapeles!
You can use the following metrics to debug InMemoryChannel channels, evaluate their performance, and identify the events that the channels dispatch.
| Metric name | Description | Type | Tags | Unit |
|---|---|---|---|---|
|
|
Number of events dispatched by | Counter |
| Integer (no units) |
|
|
The time taken to dispatch an event from an | Histogram |
| Milliseconds |
1.4.4. Event source metrics Copiar enlaceEnlace copiado en el portapapeles!
You can use the following metrics to verify that the event source delivered events to the connected event sink.
| Metric name | Description | Type | Tags | Unit |
|---|---|---|---|---|
|
| Number of events sent by the event source. | Counter |
| Integer (no units) |
|
| The event source retries and sends events that failed during the initial delivery trial. | Counter |
| Integer (no units) |
1.4.5. Knative Kafka broker metrics Copiar enlaceEnlace copiado en el portapapeles!
You can use the following metrics to debug and visualize the performance of Kafka broker.
| Metric name | Description | Type | Tags | Unit |
|---|---|---|---|---|
|
| Number of events received by a broker | Counter |
| Dimensionless |
|
| The time spent dispatching an event to a Kafka cluster | Histogram |
| Milliseconds |
|
| Number of expected replicas for a given Kafka consumer group resource | Gauge |
Note In this context, resources refer to user facing entities such as Kafka source, trigger, and subscription. Avoid using internal or generated names when using these resources. | Dimensionless |
|
| Number of ready replicas for a given Kafka consumer group resource | Gauge |
Note In this context, resources refer to user facing entities such as Kafka source, trigger, and subscription. Avoid using internal or generated names when using these resources. | Dimensionless |
1.4.6. Knative Kafka trigger metrics Copiar enlaceEnlace copiado en el portapapeles!
You can use the following metrics to debug and visualize the performance of Kafka triggers.
| Metric name | Description | Type | Tags | Unit |
|---|---|---|---|---|
|
| Number of events dispatched by a trigger to a subscriber | Counter |
| Dimensionless |
|
| The time spent dispatching an event to a subscriber | Histogram |
| Milliseconds |
|
| The time spent processing and filtering an event | Histogram |
| Milliseconds |
1.4.7. Knative Kafka channel metrics Copiar enlaceEnlace copiado en el portapapeles!
You can use the following metrics to debug and visualize the performance of Kafka channel.
| Metric name | Description | Type | Tags | Unit |
|---|---|---|---|---|
|
| Number of events received by a Kafka channel | Counter |
| Dimensionless |
|
| The time spent dispatching an event to a Kafka cluster | Histogram |
| Milliseconds |
1.4.8. Knative Kafka subscription metrics Copiar enlaceEnlace copiado en el portapapeles!
You can use the following metrics to debug and visualize the performance of subscriptions associated with the Kafka channel.
| Metric name | Description | Type | Tags | Unit |
|---|---|---|---|---|
|
| Number of events dispatched by a subscription to a subscriber | Counter |
| Dimensionless |
|
| The time spent dispatching an event to a subscriber | Histogram |
| Milliseconds |
|
| The time spent processing an event | Histogram |
| Dimensionless |
1.4.9. Knative Kafka source metrics Copiar enlaceEnlace copiado en el portapapeles!
You can use the following metrics to debug and visualize the performance of Kafka sources.
| Metric name | Description | Type | Tags | Unit |
|---|---|---|---|---|
|
| Number of events dispatched by a Kafka source | Counter |
| Dimensionless |
|
| The time spent dispatching an event to a sink | Histogram |
| Milliseconds |
|
| The time spent processing an event | Histogram |
| Milliseconds |
|
| Number of expected replicas for a given Kafka consumer group resource | Gauge |
Note In this context, resources refer to user facing entities such as Kafka source,trigger, and subscription. Avoid using internal or generated names when using these resources. | Dimensionless |
|
| Number of ready replicas for a given Kafka consumer group resource | Gauge |
Note In this context, resources refer to user facing entities such as Kafka source,trigger, and subscription. Avoid using internal or generated names when using these resources. | Dimensionless |
1.4.10. Knative Kafka sink metrics Copiar enlaceEnlace copiado en el portapapeles!
You can use the following metrics to debug and visualize the performance of Kafka sinks.
| Metric name | Description | Type | Tags | Unit |
|---|---|---|---|---|
|
| Number of events received by a broker | Counter |
| Dimensionless |
|
| The time spent dispatching an event to a Kafka cluster | Histogram |
| Milliseconds |
1.5. Knative Serving metrics Copiar enlaceEnlace copiado en el portapapeles!
Cluster administrators can view the following metrics for Knative Serving components.
1.5.1. Activator metrics Copiar enlaceEnlace copiado en el portapapeles!
You can use the following metrics to understand how applications respond when traffic passes through the activator.
| Metric name | Description | Type | Tags | Unit |
|---|---|---|---|---|
|
| The number of concurrent requests that routes to the activator, or average concurrency over a reporting period. | Gauge |
| Integer (no units) |
|
| The number of requests that route to the activator. The activator handler processes these requests. | Counter |
| Integer (no units) |
|
| The response time in milliseconds for a fulfilled, routed request. | Histogram |
| Milliseconds |
1.5.2. Autoscaler metrics Copiar enlaceEnlace copiado en el portapapeles!
The autoscaler component exposes several metrics related to autoscaler behavior for each revision. For example, you can monitor the number of pods that the autoscaler targets for a service. You can also monitor the average requests per second during the stable window and whether the autoscaler enters panic mode when using the Knative pod autoscaler (KPA).
| Metric name | Description | Type | Tags | Unit |
|---|---|---|---|---|
|
| The number of pods the autoscaler tries to assign for a service. | Gauge |
| Integer (no units) |
|
| The excess burst capacity served over the stable window. | Gauge |
| Integer (no units) |
|
| The average number of requests for each observed pod over the stable window. | Gauge |
| Integer (no units) |
|
| The average number of requests for each observed pod over the panic window. | Gauge |
| Integer (no units) |
|
| The number of concurrent requests that the autoscaler tries to send to each pod. | Gauge |
| Integer (no units) |
|
| The average number of requests-per-second for each observed pod over the stable window. | Gauge |
| Integer (no units) |
|
| The average number of requests-per-second for each observed pod over the panic window. | Gauge |
| Integer (no units) |
|
| The number of requests-per-second that the autoscaler targets for each pod. | Gauge |
| Integer (no units) |
|
|
This value is | Gauge |
| Integer (no units) |
|
| The number of pods that the autoscaler has requested from the Kubernetes cluster. | Gauge |
| Integer (no units) |
|
| The number of pods the system allocates that are currently ready. | Gauge |
| Integer (no units) |
|
| The number of pods that have a not ready state. | Gauge |
| Integer (no units) |
|
| The number of pods that are currently pending. | Gauge |
| Integer (no units) |
|
| The number of pods that are currently terminating. | Gauge |
| Integer (no units) |
1.5.3. Go runtime metrics Copiar enlaceEnlace copiado en el portapapeles!
Each Knative Serving control plane process emits many Go runtime memory statistics, as defined in MemStats.
The name tag for each metric is an empty tag.
| Metric name | Description | Type | Tags | Unit |
|---|---|---|---|---|
|
|
The number of bytes of allocated heap objects. This metric is the same as | Gauge |
| Integer (no units) |
|
| The cumulative bytes allocated for heap objects. | Gauge |
| Integer (no units) |
|
| The total bytes of memory obtained from the operating system. | Gauge |
| Integer (no units) |
|
| The number of pointer lookups performed by the runtime. | Gauge |
| Integer (no units) |
|
| The cumulative count of heap objects allocated. | Gauge |
| Integer (no units) |
|
| The cumulative count of heap objects that are free. | Gauge |
| Integer (no units) |
|
| The number of bytes of allocated heap objects. | Gauge |
| Integer (no units) |
|
| The number of bytes of heap memory obtained from the operating system. | Gauge |
| Integer (no units) |
|
| The number of bytes in idle, unused spans. | Gauge |
| Integer (no units) |
|
| The number of bytes in spans that are currently in use. | Gauge |
| Integer (no units) |
|
| The number of bytes of physical memory returned to the operating system. | Gauge |
| Integer (no units) |
|
| The number of allocated heap objects. | Gauge |
| Integer (no units) |
|
| The number of bytes in stack spans that are currently in use. | Gauge |
| Integer (no units) |
|
| The number of bytes of stack memory obtained from the operating system. | Gauge |
| Integer (no units) |
|
|
The number of bytes of allocated | Gauge |
| Integer (no units) |
|
|
The number of bytes of memory obtained from the operating system for | Gauge |
| Integer (no units) |
|
|
The number of bytes of allocated | Gauge |
| Integer (no units) |
|
|
The number of bytes of memory obtained from the operating system for | Gauge |
| Integer (no units) |
|
| The number of bytes of memory in profiling bucket hash tables. | Gauge |
| Integer (no units) |
|
| The number of bytes of memory in garbage collection metadata. | Gauge |
| Integer (no units) |
|
| The number of bytes of memory in miscellaneous, off-heap runtime allocations. | Gauge |
| Integer (no units) |
|
| The target heap size of the next garbage collection cycle. | Gauge |
| Integer (no units) |
|
| The time that the last garbage collection was completed. | Gauge |
| Nanoseconds |
|
| The cumulative time in garbage collection stop-the-world pauses since the program started. | Gauge |
| Nanoseconds |
|
| The number of completed garbage collection cycles. | Gauge |
| Integer (no units) |
|
| The number of garbage collection cycles that were forced due to an application calling the garbage collection function. | Gauge |
| Integer (no units) |
|
| The fraction of the available CPU time of the program that has been used by the garbage collector since the program started. | Gauge |
| Integer (no units) |