Red Hat Ansible Automation Platform | 2.6 | Understand primary workloads for automation controller

Automation controller project synchronization
Copy link

Users define the source of automation content within the automation controller projects, such as Ansible Playbooks. The primary workload for these projects is synchronization. Project update jobs in the API manage synchronization. These jobs are also known as source control updates in the UI.

These project update jobs run only on the control plane and in task pods within the OpenShift Container Platform. Their role is to update the automation controller with the latest automation content. This content comes from its defined source, such as a Git repository.

Updating projects is not performance-sensitive, provided that they store only playbooks and Ansible-related text files. However, issues might arise when projects become excessively large.

Do not store large volumes of binary data within a project. If jobs require access to additional data, they should retrieve it from object storage or file storage. This retrieval must be done within the playbook’s scope.

Jobs and automation workloads
Copy link

Jobs are the primary workload for automation controller and run on the execution plane. They include the following job types:

Standard jobs
Workflow, sliced, and bulk jobs
System jobs

Standard jobs
Copy link

Standard jobs involve the execution of an Ansible Playbook from a Project against a set of hosts from an Inventory. Jobs are initiated by a control node, which then streams, processes, and stores job results.

A performance sensitive part of this is the processing of the playbook output. The output is captured and serialized into job events by the automation controller. A single Ansible task running against a host typically produces multiple job events, for example:

Task start
Host-specific details
Task completion

Event volume varies significantly with the playbook’s configured verbosity level. For example, a simple debug task that prints Hello World on one host might produce 6 job events at verbosity 1, increasing to 34 job events at verbosity 3.

The dispatcher and the callback receiver collaborate to process, transmit, and store job events. These actions contribute to the platform’s storage and processing usage. Job events are processed on the control plane and stored in the database. The dispatcher processes job events, and the callback receiver stores them.

Workflow, sliced and bulk jobs
Copy link

To enable complex automation and orchestration, use the following job types to extend standard jobs:

Sliced jobs: Split jobs to run against slices of the inventory in parallel
Bulk jobs: Launch multiple jobs in a single request
Workflow jobs: Coordinate multiple job templates

These job types coordinate the launch and management of multiple underlying standard jobs. They impact job scheduling, which occurs in the control plane, but otherwise do not have significant impact on their services.

System jobs
Copy link

System jobs involve internal maintenance tasks, such as clean up of old job event data. The execution frequency of system jobs is managed by schedules. System jobs run on the control plane, because they run management commands that interact with the database. These workloads involve key platform activities.

Reducing the frequency of system jobs or increasing the number of days of data to retain can degrade database performance. It is generally recommended to retain as few days of data as possible. Use external logging features for long-term audit data storage. Storing more data in the database can make queries that scan large tables more costly.

Tune Event-Driven Ansible activations
Copy link

Activations are used by Event-Driven Ansible to run instances of ansible-rulebook. These activations can either connect to external event sources or listen to an event stream for incoming payloads.

Activation and output management uses the following:

Event-Driven Ansible hybrid nodes
Platform gateway for event stream handling
The WebSocket server in each API node or pod
The database for audit event storage

Activations process discrete payloads called events. The activation’s resource usage is affected by the event arrival rate and the complexity of the rulebook’s rules.

When events match rules, they trigger actions, which launch jobs in automation controller. Event auditing stores audit events in the database and is enabled by default.

Each event is sent from the activation to the WebSocket server to be serialized and written to the database. This process stresses the server and can cause performance issues. Selecting Skip audit events in the UI for a given activation eliminates this workload.

When Skip audit events is selected, rules are still fired. However, the fire count in the API and UI is updated at a periodic interval (default 300 seconds) rather than immediately.

Minimize the impact of collection syncing
Copy link

Private automation hub can synchronize collections from remote ansible-galaxy repositories, such as galaxy.ansible.com or automation hub on console.redhat.com.

Pulp content workers and the database synchronize the repositories. The automation controller can download these collections during project updates, or use them to build automation execution environments. Collections are also available for any other client by using the ansible-galaxy CLI to download and use.

The performance of collection synchronization is impacted by the following:

The number of collections listed in the requirements.yml
The number of versions synced
The number of versions retained

Synchronization uses memory in direct proportion to the number of collections and versions synchronized. Using a targeted requirements.yml with specific versions can limit this impact.

Hosting collections uses storage space. Manage the storage space that collections use by specifying the retained number of versions on the repository.

Pull hosted container images from private automation hub
Copy link

Private automation hub hosts container images for automation execution and decision environments. Event-Driven Ansible and automation controller pull these images to run activations or jobs. The pull frequency for these containers is determined by the following:

The frequency of job starts
The pull policy configured for the automation execution environments and decision environments

The performance of pushing and pulling container images from automation hub depends on the disk performance of the underlying storage. This is because Pulp content workers store and fetch the layers of the container image from disk.

The size of layers can impact the memory used by the Pulp content workers. This is because they serve entire layers in a single operation.

The frequency of container image pulls is determined by the following factors:

The pull policy on jobs and activations
The frequency of job or activation starts
The node or Container Group’s existing image status

Reference workloads for growth topologies
Copy link

The following table provides reference data for typical workloads, performance metrics, and capacity planning for the tested Ansible Automation Platform growth topologies.

Expand

Table 1. Reference workloads for growth topologies
Component / Feature	Metric
REST API	8 requests per second (RPS)
REST API 50 percentile latency at 8 RPS	500 milliseconds
REST API 99 percentile latency at 8 RPS	1.5 seconds
Hosts in automation controller inventory	1,000 hosts
Job start rate in automation controller (max burst rate with standard launch)	20 jobs started per second
Concurrent jobs in automation controller	10 concurrent jobs with default forks (5 forks is default) + 100 with forks=1
Callback receiver event processing rate	10,000 job events per second at peak
Job History with 30 days retention	2kb event; 500 events per playbook run; 500 jobs a day + Less than 60Gb (as specified as minimum required disk on Database node)
(Certified) Sync time	Less than 30 minutes
(Validated) Sync time	Less than 5 minutes
Activation processing events with skip audit events enabled (6 activation) with events incoming via Event Stream and execution strategy set to sequential (default) in the rulebook	1 actionable event/minute with minimal payload with job template action on local automation controller where each job completes in 1 minute

Reference workloads for enterprise topologies
Copy link

The following table provides reference data for typical workloads, performance metrics, and capacity planning for the tested Ansible Automation Platform enterprise topologies.

Expand

Table 2. Reference workloads for enterprise topologies
Component / Feature	Metric
REST API	16 requests per second (RPS)
REST API 50 percentile latency at 16 RPS	500 milliseconds
REST API 99 percentile latency at 16 RPS	1.5 seconds
Hosts in automation controller inventory	10,000 hosts
Job start rate in automation controller	80 jobs started per second
Concurrent jobs in automation controller	40 with default forks (5 forks is default) + 400 with forks=1
Callback receiver event rate	40,000 events per second at peak
Job History with 7 days retention	2kb event; 500 events per playbook run; 2000 jobs a day + Less than 60Gb (as specified as minimum required disk on Database node)
(Certified) Sync time	Less than 30 minutes
(Validated) Sync time	Less than 5 minutes
Processing events with skip audit events enabled (6 activations) with events incoming via Event Stream and execution strategy set to sequential (default) in the rulebook	3 actionable event/minute with minimal payload with job template action on local automation controller where each job completes in 1 minute

Saltar a la sección

Este contenido no está disponible en el idioma seleccionado.

Understand primary workloads for automation controller

Automation controller project synchronization
Copy link

Jobs and automation workloads
Copy link

Standard jobs
Copy link

Workflow, sliced and bulk jobs
Copy link

System jobs
Copy link

Tune Event-Driven Ansible activations
Copy link

Minimize the impact of collection syncing
Copy link

Pull hosted container images from private automation hub
Copy link

Reference workloads for growth topologies
Copy link

Reference workloads for enterprise topologies
Copy link

Aprender

Pruebe, compre y venda

Comunidades

Acerca de Red Hat

Hacer que el código abierto sea más inclusivo

Acerca de la documentación de Red Hat

Theme

Red Hat legal and privacy links

Red Hat legal and privacy links

Saltar a la sección

Este contenido no está disponible en el idioma seleccionado.

Understand primary workloads for automation controller

Automation controller project synchronizationCopy linkLink copied!

Jobs and automation workloadsCopy linkLink copied!

Standard jobsCopy linkLink copied!

Workflow, sliced and bulk jobsCopy linkLink copied!

System jobsCopy linkLink copied!

Tune Event-Driven Ansible activationsCopy linkLink copied!

Minimize the impact of collection syncingCopy linkLink copied!

Pull hosted container images from private automation hubCopy linkLink copied!

Reference workloads for growth topologiesCopy linkLink copied!

Reference workloads for enterprise topologiesCopy linkLink copied!

Aprender

Pruebe, compre y venda

Comunidades

Acerca de Red Hat

Hacer que el código abierto sea más inclusivo

Acerca de la documentación de Red Hat

Theme

Red Hat legal and privacy links

Red Hat legal and privacy links

Automation controller project synchronization
Copy link

Jobs and automation workloads
Copy link

Standard jobs
Copy link

Workflow, sliced and bulk jobs
Copy link

System jobs
Copy link

Tune Event-Driven Ansible activations
Copy link

Minimize the impact of collection syncing
Copy link

Pull hosted container images from private automation hub
Copy link

Reference workloads for growth topologies
Copy link

Reference workloads for enterprise topologies
Copy link