Chapter 14. Troubleshooting the Red Hat Ansible Automation Platform Operator on OpenShift Container Platform

14.1. Understanding automation controller operator logs
Copia collegamento

When the operator deploys an Automation Controller instance, it runs an installer role inside the operator container. If the automation controller’s status is Failed, you must check the automation-controller-operator container logs. These logs provide the installer role’s output and are a critical first step in debugging deployment issues.

14.2. Viewing events in the OpenShift Container Platform
Copia collegamento

You can view events in the OpenShift Container Platform web console to monitor for errors and troubleshoot issues. This helps you quickly diagnose problems by examining the status of custom resources and their related events.

You can debug by first reviewing the status conditions of the Ansible Automation Platform custom resource (CR) and then checking any nested CRs for errors.

Procedure

Log in to the OpenShift Container Platform web console.
In the navigation menu, select Home Events.
Select your project from the project list.
To view events for a specific resource, navigate to that resource’s page. Many resource pages, such as pods and deployments, have their own Events tab.
Select a resource to bring you to the Pod Details page.

Verification

Check the Conditions section on the Pod details page to confirm no errors are listed in the Message column.

14.3. Viewing operator logs
Copia collegamento

The following procedure is an example of how to view the logs for an automation-controller-operator pod.

Procedure

To find the pod name, run:
```
oc get pods | grep operator
```

to view the logs for the pod, run:

oc logs <operator-pod-name> -f

Alternatively, to view the logs without first getting the pod name, run:

oc logs deployments/automation-controller-operator-controller-manager -c automation-controller-manager -f

14.4. Configuring log verbosity
Copia collegamento

You can enable task output for debugging on any custom resources (CRs) by setting no_log to false in the component section of the AnsibleAutomationPlatform CR.

The logs then show output for any failed tasks that originally had no_log set to true. All Ansible Automation Platform components (automation controller, automation hub, and Event-Driven Ansible) support the no_log setting.

Procedure

Edit the Ansible Automation Platform CR and set the no_log field to false for the component you want to debug.

For automation controller:

apiVersion: aap.ansible.com/v1alpha1
kind: AnsibleAutomationPlatform
metadata:
  name: myaap
spec:
  controller:
    no_log: false

For automation hub:

apiVersion: aap.ansible.com/v1alpha1
kind: AnsibleAutomationPlatform
metadata:
  name: myaap
spec:
  hub:
    no_log: false

For Event-Driven Ansible:

apiVersion: aap.ansible.com/v1alpha1
kind: AnsibleAutomationPlatform
metadata:
  name: myaap
spec:
  eda:
    no_log: false

Note

This might expose sensitive data in the logs. On production clusters, this value must generally be set to true unless you are actively debugging an issue.

To increase the Ansible Playbook verbosity from the operator, set the verbosity level using an annotation on the Ansible Automation Platform CR:

apiVersion: aap.ansible.com/v1alpha1
kind: AnsibleAutomationPlatform
metadata:
  name: myaap
  annotations:
    ansible.sdk.operatorframework.io/verbosity: "4"
spec:
  # ... component configuration ...

14.5. Inspecting a OpenShift Container Platform resource
Copia collegamento

To inspect a OpenShift Container Platform resource, you must use the oc command to get a summary or the full YAML definition of the resource.

Procedure

To view a human-readable summary of a resource, run:

oc describe -n <namespace> <resource> <resource-name>

To view the complete YAML definition of a resource, use the -o yaml flag:
```
oc get -n <namespace> <resource> <resource-name> -o yaml
```
- For example, to get the YAML for the automationcontroller custom resource, run:
  oc get -n aap automationcontroller aap -o yaml

14.6. Core Ansible Automation Platform resources
Copia collegamento

The following table lists and describes the core custom resources (CRs) that the Ansible Automation Platform Operator manages. Understanding these resources will help you with advanced troubleshooting and configuration.

Expand

Resource name	Description
`ansibleautomationplatform`	CR for deploying the entire Ansible Automation Platform.
`ansibleautomationplatformbackup`	CR for creating backups of the entire Ansible Automation Platform instance.
`ansibleautomationplatformrestore`	CR for restoring the entire Ansible Automation Platform instance from a backup.
`automationcontroller`	CR defining the desired state of an automation controller instance.
`automationcontrollerbackup`	CR for creating backups of automation controller data and configuration.
`automationcontrollerrestore`	CR for restoring the automation controller from a backup.
`automationhub`	CR for deploying an automation hub (Galaxy) instance.
`automationhubbackup`	CR for creating backups of automation hub data and configuration.
`automationhubrestore`	CR for restoring automation hub from a backup.
`eda`	CR for deploying an Event-Driven Ansible (EDA) instance.
`edabackup`	CR for creating backups of EDA data and configuration.
`edarestore`	CR for restoring EDA from a backup.
`ansiblelightspeed`	CR for deploying an Red Hat Ansible Lightspeed instance.

14.7. Standard Kubernetes resources
Copia collegamento

Standard Kubernetes resources are a core part of the OpenShift Container Platform. The following table describes the standard resources you can inspect to troubleshoot the state and configuration of an application.

Expand

Resource name	Description
`pod`	Smallest deployable unit containing one or more containers running the application workloads.
`deployment`	Manages pod configuration and scaling.
`pvc`	A PersistentVolumeClaim (PVC) is a request for storage resources, used for persistent data storage.
`service`	Exposes pods as network services with stable IP addresses and DNS names within the cluster.
`ingress`	Manages external HTTP and HTTPS access to services within the cluster.
`route`	An OpenShift-specific resource for exposing services externally (similar to an ingress).
`secrets`	Stores sensitive data like passwords, tokens, and certificates.
`serviceaccount`	Provides identity for processes running in pods to access permissions to other Kubernetes resources.

14.8. Discovering custom resource definition configuration parameters
Copia collegamento

The Ansible Automation Platform Operator manages multiple custom resources (CRs), each with its own configuration parameters. Use the oc explain command to discover all available configuration options for the AnsibleAutomationPlatform CR and its nested components.

Procedure

To see all available configuration parameters for a top-level CR, run:
```
oc explain ansibleautomationplatform.spec
```

To view component-specific configuration options nested under the Ansible Automation Platform CR, query them through the component section:

oc explain ansibleautomationplatform.spec.controller.postgres_configuration_secret
oc explain ansibleautomationplatform.spec.controller.route_tls_termination_mechanism
oc explain ansibleautomationplatform.spec.hub.storage_type
oc explain ansibleautomationplatform.spec.eda.automation_server_url

To explore all nested fields for a specific component, use the --recursive flag:
```
oc explain ansibleautomationplatform.spec.controller --recursive
oc explain ansibleautomationplatform.spec.hub --recursive
oc explain ansibleautomationplatform.spec.eda --recursive
```
Note
You can also query individual component CRs directly if needed:
oc explain automationcontroller.spec oc explain automationhub.spec oc explain eda.spec
However, when configuring components through the Ansible Automation Platform CR (recommended approach), use the nested paths shown above.

14.9. Collecting Diagnostic Data
Copia collegamento

Use the oc adm must-gather command to collect comprehensive diagnostic data about your cluster and the Ansible Automation Platform components. This data is essential when contacting Red Hat Support.

Procedure

To start the must-gather tool, run:
```
oc adm must-gather --image=registry.redhat.io/ansible-automation-platform-26/aap-must-gather-rhel9
```
Note
For version 2.6, the base image name changes to registry.redhat.io/ansible-automation-platform-26/aap-must-gather-rhel9.
View the collected data, use the omc tool to query the must-gather tarball as if it were a live cluster.
```
omc use <path-to-must-gather>
omc get pods
```

14.10. Debugging crashing pods
Copia collegamento

If a pod is failing or crashing, use the oc debug command. This command creates a new pod with the same configuration and mounts as the pod you specified, allowing you to access it for debugging.

Procedure

To connect to the pod, run:
```
oc debug <pod-name>
```

14.11. Operator service account error
Copia collegamento

Manually modifying the aap_operator_service_account user in the Ansible Automation Platform database or UI removes the required is_superuser flag. This action causes a critical failure in the platform gateway operator’s reconciliation loop.

You see the following error:

TASK [ansibleautomationplatform : Create operator service account user] … CommandError: Error: That username is already taken

The Ansible Automation Platform operator automatically recreates the service account when the account is missing. To restore the required superuser privileges, you must remove the existing, incorrectly configured user.

After you delete the user, the platform gateway operator automatically runs its idempotency logic, recreates the account, and ensures it has the necessary is_superuser=True flag, restoring the reconciliation loop’s functionality.

Questo contenuto non è disponibile nella lingua selezionata.

14.1. Understanding automation controller operator logs
Copia collegamento

14.2. Viewing events in the OpenShift Container Platform
Copia collegamento

14.3. Viewing operator logs
Copia collegamento

14.4. Configuring log verbosity
Copia collegamento

14.5. Inspecting a OpenShift Container Platform resource
Copia collegamento

14.6. Core Ansible Automation Platform resources
Copia collegamento

14.7. Standard Kubernetes resources
Copia collegamento

14.8. Discovering custom resource definition configuration parameters
Copia collegamento

14.9. Collecting Diagnostic Data
Copia collegamento

14.10. Debugging crashing pods
Copia collegamento

14.11. Operator service account error
Copia collegamento

Formazione

Prova, acquista e vendi

Community

Informazioni sulla documentazione di Red Hat

Rendiamo l’open source più inclusivo

Informazioni su Red Hat

Theme

Red Hat legal and privacy links

Red Hat legal and privacy links

Questo contenuto non è disponibile nella lingua selezionata.

Chapter 14. Troubleshooting the Red Hat Ansible Automation Platform Operator on OpenShift Container Platform

14.1. Understanding automation controller operator logsCopia collegamentoCollegamento copiato negli appunti!

14.2. Viewing events in the OpenShift Container PlatformCopia collegamentoCollegamento copiato negli appunti!

14.3. Viewing operator logsCopia collegamentoCollegamento copiato negli appunti!

14.4. Configuring log verbosityCopia collegamentoCollegamento copiato negli appunti!

14.5. Inspecting a OpenShift Container Platform resourceCopia collegamentoCollegamento copiato negli appunti!

14.6. Core Ansible Automation Platform resourcesCopia collegamentoCollegamento copiato negli appunti!

14.7. Standard Kubernetes resourcesCopia collegamentoCollegamento copiato negli appunti!

14.8. Discovering custom resource definition configuration parametersCopia collegamentoCollegamento copiato negli appunti!

14.9. Collecting Diagnostic DataCopia collegamentoCollegamento copiato negli appunti!

14.10. Debugging crashing podsCopia collegamentoCollegamento copiato negli appunti!

14.11. Operator service account errorCopia collegamentoCollegamento copiato negli appunti!

Formazione

Prova, acquista e vendi

Community

Informazioni sulla documentazione di Red Hat

Rendiamo l’open source più inclusivo

Informazioni su Red Hat

Theme

Red Hat legal and privacy links

Red Hat legal and privacy links

14.1. Understanding automation controller operator logs
Copia collegamento

14.2. Viewing events in the OpenShift Container Platform
Copia collegamento

14.3. Viewing operator logs
Copia collegamento

14.4. Configuring log verbosity
Copia collegamento

14.5. Inspecting a OpenShift Container Platform resource
Copia collegamento

14.6. Core Ansible Automation Platform resources
Copia collegamento

14.7. Standard Kubernetes resources
Copia collegamento

14.8. Discovering custom resource definition configuration parameters
Copia collegamento

14.9. Collecting Diagnostic Data
Copia collegamento

14.10. Debugging crashing pods
Copia collegamento

14.11. Operator service account error
Copia collegamento