Chapter 8. Analyzing outliers and errors with exemplars


Use exemplars to connect a metric to a recorded trace to analyze the root cause of errors or latencies.

Metrics are aggregations over several events, and show you if your system is operating within defined bounds. They are great to monitor error rates or tail latencies and to set up alerting or drive performance optimizations. Still, the aggregation makes it difficult to find root causes for latencies or errors reported in metrics.

Root causes for errors and latencies can be found by enabling tracing. To connect a metric to a recorded trace, there is the concept of exemplars.

Once exemplars are set up, Red Hat build of Keycloak reports metrics with their last recorded trace as an exemplar. A dashboard tool like Grafana can link the exemplar from a metrics dashboard to a trace view.

Metrics that support exemplars are:

  • http_server_requests_seconds_count (including histograms)
    See the chapter HTTP metrics for details on this metric.
  • keycloak_credentials_password_hashing_validations_total
    See the chapter Self-provided metrics for details on this metric.
  • keycloak_user_events_total
    See the chapter Self-provided metrics for details on this metric.

See below for a screenshot of a heatmap visualization for latencies that is showing an exemplar when hovering over one of the pink indicators.

Figure 8.1. Heatmap diagram with exemplar

8.1. Setting up exemplars

To benefit from exemplars, perform the following steps:

  1. Enable metrics for Red Hat build of Keycloak as described in chapter Gaining insights with metrics.
  2. Enable tracing for Red Hat build of Keycloak as described in chapter Root cause analysis with tracing.
  3. Enable exemplar storage in your monitoring system.

    For Prometheus, this is a preview feature that you need to enable.

  4. Scrape the metrics using the OpenMetricsText1.0.0 protocol, which is not enabled by default in Prometheus.

    If you are using PodMonitors or similar in a Kubernetes environment, this can be achieved by adding it to the spec of the custom resource:

    apiVersion: monitoring.coreos.com/v1
    kind: PodMonitor
    metadata:
      ...
    spec:
      scrapeProtocols:
        - OpenMetricsText1.0.0
    Copy to Clipboard Toggle word wrap
  5. Configure your metrics datasource where to link to for traces.

    When using Grafana and Prometheus, this would be setting up a exemplarTraceIdDestinations for the Prometheus datasource, which then points to your tracing datasource that is provided by tools like Jaeger or Tempo.

  6. Enable exemplars in your dashboards.

    Enable the Exemplars toggle in each query on each dashboard where you want to show exemplars. When set up correctly, you will notice little dots or stars in your dashboards that you can click on to view the traces.

Note
  • If you do not specify the scrape protocol, Prometheus will by default not send it in the content negotiation, and Keycloak will then fall back to the PrometheusText protocol which will not contain the exemplars.
  • If you enabled tracing and metrics, but the request sampling did not record a trace, the exposed metric will not contain any exemplars.
  • If you access the metrics endpoint with your browser, the content negotiation will lead to the format PrometheusText being returned, and you will not see any exemplars.

8.2. Verifying that exemplars work as expected

Perform the following steps to verify that Red Hat build of Keycloak is set up correctly for exemplars:

  1. Follow the instructions to set up metrics and tracing for Red Hat build of Keycloak.
  2. For test purposes, record all traces by setting the tracing ration to 1.0. See Root cause analysis with tracing for recommended sampling settings in production systems.
  3. Log in to the Keycloak instance to create some traces.
  4. Scrape the metrics with a command similar to the following and search for those metrics that have an exemplar set:

    $ curl -s http://localhost:9000/metrics \
    -H 'Accept: application/openmetrics-text; version=1.0.0; charset=utf-8' \
    | grep "#.*trace_id"
    Copy to Clipboard Toggle word wrap

    This should result in an output similar to the following. Note the additional # after which the span and trace IDs are added:

    http_server_requests_seconds_count {...} ... # {span_id="...",trace_id="..."} ...
    Copy to Clipboard Toggle word wrap
Back to top
Red Hat logoGithubredditYoutubeTwitter

Learn

Try, buy, & sell

Communities

About Red Hat Documentation

We help Red Hat users innovate and achieve their goals with our products and services with content they can trust. Explore our recent updates.

Making open source more inclusive

Red Hat is committed to replacing problematic language in our code, documentation, and web properties. For more details, see the Red Hat Blog.

About Red Hat

We deliver hardened solutions that make it easier for enterprises to work across platforms and environments, from the core datacenter to the network edge.

Theme

© 2025 Red Hat