Performing a minor update of Red Hat OpenStack Platform
Apply the latest bug fixes and security improvements to Red Hat OpenStack Platform
Abstract
Providing feedback on Red Hat documentation Copy linkLink copied to clipboard!
We appreciate your feedback. Tell us how we can improve the documentation.
To provide documentation feedback for Red Hat OpenStack Platform (RHOSP), create a Jira issue in the OSPRH Jira project.
Procedure
- Log in to the Red Hat Atlassian Jira.
- Click the following link to open a Create Issue page: Create issue
- Complete the Summary and Description fields. In the Description field, include the documentation URL, chapter or section number, and a detailed description of the issue.
- Click Create.
- Review the details of the bug you created.
Chapter 1. Preparing for a minor update Copy linkLink copied to clipboard!
Keep your Red Hat OpenStack Platform (RHOSP) 17.1 environment updated with the latest packages and containers.
Use the upgrade path for the following versions:
| Old RHOSP Version | New RHOSP Version |
|---|---|
| Red Hat OpenStack Platform 17.0.z | Red Hat OpenStack Platform 17.1 latest |
| Red Hat OpenStack Platform 17.1.z | Red Hat OpenStack Platform 17.1 latest |
Minor update workflow
A minor update of your RHOSP environment involves updating the RPM packages and containers on the undercloud and overcloud host, and the service configuration, if needed. The data plane and control plane are fully available during the minor update. You must complete each of the following steps to update your RHOSP environment:
| Update step | Description |
|---|---|
| Undercloud update | Director packages are updated, containers are replaced, and the undercloud is rebooted. |
|
|
All |
| ha-image-update external | Updates container image names of Pacemaker-controlled services. There is no service disruption. This step applies to only customers that are updating their system from version 17.0.z to the latest 17.1 release. |
| Overcloud update of Controller nodes and composable nodes that contain Pacemaker services | During an Overcloud update, the Pacemaker services are stopped for each host. While the Pacemaker services are stopped, the RPMs on the host, the container configuration data, and the containers are updated. When the Pacemaker services restart, the host is added again. |
| Overcloud update of composable nodes without Pacemaker services | Networker, ObjectStorage, BlockStorage, or any other role that does not include Pacemaker services are updated one node at a time. |
| Overcloud update of Compute nodes | Multiple nodes are updated in parallel. The default value for running nodes in parallel is 25. |
| Overcloud update of Ceph nodes | Ceph nodes are updated one node at a time. |
| Ceph cluster update |
Ceph services are updated by using |
If you have a multistack infrastructure, update each overcloud stack completely, one at a time. If you have a distributed compute node (DCN) infrastructure, update the overcloud at the central location completely, and then update the overcloud at each edge site, one at a time.
Additionally, an administrator can perform the following operations during a minor update:
- Migrate your virtual machine
- Create a virtual machine network
- Run additional cloud operations
The following operations are not supported during a minor update:
- Replacing a Controller node
- Scaling in or scaling out any role
Considerations before you update your RHOSP environment
To help guide you during the update process, consider the following information:
- Red Hat recommends backing up the undercloud and overcloud control planes. For more information about backing up nodes, see Backing up and restoring the undercloud and control plane nodes.
- Familiarize yourself with the known issues that might block an update.
- Familiarize yourself with the possible update and upgrade paths before you begin your update. For more information, see Section 1.1, “Upgrade paths for long life releases”.
-
To identify your current maintenance release, run
$ cat /etc/rhosp-release. You can also run this command after updating your environment to validate the update. - If your packages require a reboot after you complete the update, plan a maintenance window to minimize disruption to your workloads. For more information about how to determine whether your packages require a reboot, see the Red Hat Knowledgebase article Identify packages that will require a system reboot after an update.
Updates with a single Controller node are not supported.
Procedure
To prepare your RHOSP environment for the minor update, complete the following procedures:
1.1. Upgrade paths for long life releases Copy linkLink copied to clipboard!
Familiarize yourself with the possible update and upgrade paths before you begin an update.
You can view your current RHOSP and RHEL versions in the /etc/rhosp-release and /etc/redhat-release files.
| Current version | Target version |
|---|---|
| RHOSP 17.0.x on RHEL 9.0 | RHOSP 17.0 latest on RHEL 9.0 latest |
| RHOSP 17.1.x on RHEL 9.2 | RHOSP 17.1 latest on RHEL 9.2 latest |
| Current version | Target version |
|---|---|
| RHOSP 10 on RHEL 7.7 | RHOSP 13 latest on RHEL 7.9 latest |
| RHOSP 13 on RHEL 7.9 | RHOSP 16.1 latest on RHEL 8.2 latest |
| RHOSP 13 on RHEL 7.9 | RHOSP 16.2 latest on RHEL 8.4 latest |
| RHOSP 16 on RHEL 8.4 | RHOSP 17.1 latest on RHEL 9.0 latest |
For more information, see Framework for upgrades (16.2 to 17.1).
1.2. Known issues that might block a minor update Copy linkLink copied to clipboard!
Review the following known issues that might affect a successful update.
During an update from Red Hat OpenStack Platform (RHOSP) 17.1 GA, 17.1.1, and 17.1.2 to RHOSP 17.1.4, when you use an Open Virtual Network (OVN) back end, there is a possibility of a short network API outage during the external run of the OVN update.
If your Red Hat OpenStack Platform (RHOSP) 17.0 environment is deployed with ML2/OVN, you cannot update your environment directly from RHOSP 17.0 to 17.1.4. You must update to RHOSP 17.0.1 first. For more information, see Keeping Red Hat OpenStack Platform Updated.
If your RHOSP 17.1.3 or earlier deployment includes a filter rule in nftables or iptables with a LOG action, and the kernel command line (/proc/cmdline) has console=tty50, logging actions can cause substantial latency in packet transmission. Before updating to 17.1.4, you must apply the workaround in the Red Hat Knowledgebase solution Sometimes receiving packet(e.g. ICMP echo) has latency, around 190(ms).
If you are performing an update of your RHOSP environment to 17.1.x, the pre-update package_version validation fails because the validation cannot find a matching podman version.
Workaround: To skip the package_version validation, use the --skiplist package-version option when you run the pre-update validation:
$ validation run -i ~/overcloud-deploy/<stack>/tripleo-ansible-inventory.yaml --group pre-update --skiplist package-version
-
Replace
<stack>with the name of your stack.
During an update to RHOSP 17.1.4, when you run the pre-update validation, the undercloud-service-status validation fails. The failure occurs because the validation cannot find the undercloud-service-status service.
Workaround: Skip the undercloud-service-status validation when you run the pre-update validation:
$ validation run \
--group pre-update \
--inventory /home/stack/overcloud-deploy/<stack>/config-download/<stack>/tripleo-ansible-inventory.yaml
--skiplist undercloud-service-status
-
Replace
<stack>with the name of your stack.
1.3. Repository adjustments Copy linkLink copied to clipboard!
You must adjust which repositories you enabled based on the current phase of the Red Hat Enterprise Linux Life Cycle. You should select Enhanced Extended Update Service (EEUS) if it is available in your entitlements to minimize the need for future adjustments. If you do not have the EEUS repositories, then select the Extended Update Support (EUS) repositories. If your entitlements do not currently provide the right repositories, contact Red Hat support.
Depending on your Red Hat OpenStack Platform subscription entitlements, you have one of the following repository types:
RHEL 8 or Multi-RHEL environments:
Telecommunications Update Service (TUS)
-
rhel-8-for-x86_64-baseos-tus-rpms -
rhel-8-for-x86_64-appstream-tus-rpms -
rhel-8-for-x86_64-highavailability-tus-rpms
-
Advanced Update Stream (AUS)
-
rhel-8-for-x86_64-baseos-aus-rpms -
rhel-8-for-x86_64-appstream-aus-rpms -
rhel-8-for-x86_64-highavailability-aus-rpms
-
RHEL 9 environments:
Extended Update Support (EUS)
-
rhel-9-for-x86_64-baseos-eus-rpms -
rhel-9-for-x86_64-appstream-eus-rpms -
rhel-9-for-x86_64-highavailability-eus-rpms
-
Enhanced Extended Update Service (EEUS)
-
rhel-9-for-x86_64-baseos-e4s-rpms -
rhel-9-for-x86_64-appstream-e4s-rpms -
rhel-9-for-x86_64-highavailability-e4s-rpms
-
Advanced Update Stream (AUS)
-
rhel-9-for-x86_64-baseos-aus-rpms -
rhel-9-for-x86_64-appstream-aus-rpms -
rhel-9-for-x86_64-highavailability-aus-rpms
-
To learn more about the repositories that are included with your subscription entitlement, see Red Hat Enterprise Linux Life Cycle.
1.4. Locking the environment to a Red Hat Enterprise Linux release Copy linkLink copied to clipboard!
Red Hat OpenStack Platform (RHOSP) 17.1 is supported on Red Hat Enterprise Linux (RHEL) 9.2. Before you perform the update, lock the undercloud and overcloud repositories to the RHEL 9.2 release to avoid upgrading the operating system to a newer minor release.
Procedure
-
Log in to the undercloud host as the
stackuser. Source the
stackrcundercloud credentials file:$ source ~/stackrc-
Edit your overcloud subscription management environment file, which is the file that contains the
RhsmVarsparameter. The default name for this file is usuallyrhsm.yml. Check if your subscription management configuration includes the
rhsm_releaseparameter. If therhsm_releaseparameter is not present, add it and set it to 9.2:parameter_defaults: RhsmVars: … rhsm_username: "myusername" rhsm_password: "p@55w0rd!" rhsm_org_id: "1234567" rhsm_pool_ids: "1a85f9223e3d5e43013e3d6e8ff506fd" rhsm_method: "portal" rhsm_release: "9.2"- Save the overcloud subscription management environment file.
Create a playbook that contains a task to lock the operating system version to RHEL 9.2 on all nodes:
$ cat > ~/set_release.yaml <<'EOF' - hosts: all gather_facts: false tasks: - name: set release to 9.2 command: subscription-manager release --set=9.2 become: true EOFRun the
set_release.yamlplaybook:$ ansible-playbook -i ~/overcloud-deploy/<stack>/tripleo-ansible-inventory.yaml -f 25 ~/set_release.yaml --limit <undercloud>, <Controller>, <Compute>-
Replace
<stack>with the name of your stack. -
Use the
--limitoption to apply the content to all RHOSP nodes. Replace <undercloud>, <Controller>, <Compute> with the Ansible groups in your environment that contain those nodes. Do not run this playbook against Ceph Storage nodes because you might have a different subscription for these nodes.
-
Replace
To manually lock a node to a version, log in to the node and run the subscription-manager release command:
$ sudo subscription-manager release --set=9.2
1.5. Updating Red Hat Openstack Platform repositories Copy linkLink copied to clipboard!
Update your repositories to use Red Hat OpenStack Platform (RHOSP) 17.1.
Procedure
-
Log in to the undercloud host as the
stackuser. Source the
stackrcundercloud credentials file:$ source ~/stackrc-
Edit your overcloud subscription management environment file, which is the file that contains the
RhsmVarsparameter. The default name for this file is usuallyrhsm.yml. Check the
rhsm_reposparameter in your subscription management configuration. If therhsm_reposparameter is using the RHOSP 17.1 repositories, change the repository to the correct versions:parameter_defaults: RhsmVars: rhsm_repos: - rhel-9-for-x86_64-baseos-e4s-rpms - rhel-9-for-x86_64-appstream-e4s-rpms - rhel-9-for-x86_64-highavailability-e4s-rpms - openstack-17.1-for-rhel-9-x86_64-rpms - fast-datapath-for-rhel-9-x86_64-rpms- Save the overcloud subscription management environment file.
Create a playbook that contains a task to set the repositories to RHOSP 17.1 on all nodes:
$ cat > ~/update_rhosp_repos.yaml <<'EOF' - hosts: all gather_facts: false tasks: - name: change osp repos command: subscription-manager repos --enable=openstack-17.1-for-rhel-9-x86_64-rpms become: true EOFRun the
update_rhosp_repos.yamlplaybook:$ ansible-playbook -i ~/overcloud-deploy/<stack>/tripleo-ansible-inventory.yaml -f 25 ~/update_rhosp_repos.yaml --limit <undercloud>,<Controller>,<Compute>-
Replace
<stack>with the name of your stack. -
Use the
--limitoption to apply the content to all RHOSP nodes. Replace <undercloud>, <Controller>, and <Compute> with the Ansible groups in your environment that contain those nodes. Do not run this playbook against Ceph Storage nodes because they usually use a different subscription.
-
Replace
Create a playbook that contains a task to set the repositories to RHOSP 17.1 on all ceph storage nodes:
$ cat > ~/update_ceph_repos.yaml <<'EOF' - hosts: all gather_facts: false tasks: - name: change ceph repos command: subscription-manager repos --enable=openstack-17.1-deployment-tools-for-rhel-9-x86_64-rpms become: true EOFRun the
update_ceph_repos.yamlplaybook:$ ansible-playbook -i ~/overcloud-deploy/<stack>/tripleo-ansible-inventory.yaml -f 25 ~/update_ceph_repos.yaml --limit CephStorageUse the
--limitoption to apply the content to Ceph Storage nodes.
1.6. Updating the container image preparation file Copy linkLink copied to clipboard!
The container preparation file is the file that contains the ContainerImagePrepare parameter. You use this file to define the rules for obtaining container images for the undercloud and overcloud.
Before you update your environment, check the file to ensure that you obtain the correct image versions.
Procedure
-
Edit the container preparation file. The default name for this file is usually
containers-prepare-parameter.yaml. Ensure that the
tagparameter is set to17.1for each rule set:parameter_defaults: ContainerImagePrepare: - push_destination: true set: ... tag: '17.1' tag_from_label: '{version}-{release}'NoteIf you do not want to use a specific tag for the update, such as
17.1or17.1.1, remove thetagkey-value pair and specifytag_from_labelonly. This uses the installed Red Hat OpenStack Platform version to determine the value for the tag to use as part of the update process. For more information about version tagging, see Guidelines for container image tagging in Customizing your Red Hat OpenStack Platform deployment.- Save this file.
1.7. Disabling fencing in the overcloud Copy linkLink copied to clipboard!
Before you update the overcloud, ensure that fencing is disabled.
If fencing is deployed in your environment during the Controller nodes update process, the overcloud might detect certain nodes as disabled and attempt fencing operations, which can cause unintended results.
If you have enabled fencing in the overcloud, you must temporarily disable fencing for the duration of the update.
Procedure
-
Log in to the undercloud host as the
stackuser. Source the
stackrcundercloud credentials file:$ source ~/stackrcFor each Controller node, log in to the Controller node and run the Pacemaker command to disable fencing:
$ ssh tripleo-admin@<controller_ip> "sudo pcs property set stonith-enabled=false"-
Replace
<controller_ip>with the IP address of a Controller node. You can find the IP addresses of your Controller nodes at/etc/hostsor/var/lib/mistral.
-
Replace
If you use SBD fencing, disable SBD fencing on the
pacemaker_remotenodes:# pcs property set stonith-watchdog-timeout=0 --forceNoteEnsure that you take note of the original value of the watchdog timer device interval. You must reset the watchdog timer device interval to its original value after you upgrade the control plane nodes. For more information, see Re-enabling fencing in the overcloud.
-
In the
fencing.yamlenvironment file, set theEnableFencingparameter tofalseto ensure that fencing stays disabled during the update process.
Additional Resources
1.8. Firewall rule change Copy linkLink copied to clipboard!
In Red Hat OpenStack Platform (RHOSP) 17.1.4, iptables rules were replaced with nftables rules. If your RHOSP templates include firewall rules, for example, tripleo::tripleo_firewall::firewall_rules, you must redefine them by using the ExtraFirewallRules parameter. For more information about using the ExtraFirewallRules parameter, see Adding services to the overcloud firewall in Hardening Red Hat OpenStack Platform.
Chapter 2. Updating the undercloud Copy linkLink copied to clipboard!
You can use director to update the main packages on the undercloud node. To update the undercloud and its overcloud images to the latest Red Hat OpenStack Platform (RHOSP) 17.1 version, complete the following procedures:
Prerequisites
- Before you can update the undercloud to the latest RHOSP 17.1 version, ensure that you complete all the update preparation procedures. For more information, see Chapter 1, Preparing for a minor update
2.1. Validating RHOSP before the undercloud update Copy linkLink copied to clipboard!
Before you update your Red Hat OpenStack Platform (RHOSP) environment, validate your undercloud with the tripleo-validations playbooks.
For more information about validations, see Using the validation framework in Installing and managing Red Hat OpenStack Platform with director.
Procedure
-
Log in to the undercloud host as the
stackuser. Source the
stackrcundercloud credentials file:$ source ~/stackrcInstall the packages for validation:
$ sudo dnf -y update openstack-tripleo-validations python3-validations-libs validations-commonRun the validation:
$ validation run -i ~/overcloud-deploy/<stack>/config-download/<stack>/tripleo-ansible-inventory.yaml --group pre-update- Replace <stack> with the name of the stack.
Verification
- To view the results of the validation report, see Viewing validation history in Installing and managing Red Hat OpenStack Platform with director.
If a host is not found when you run a validation, the command reports the status as SKIPPED. A status of SKIPPED means that the validation is not executed, which is expected. Additionally, if a validation’s pass criteria is not met, the command reports the status as FAILED. A FAILED validation does not prevent you from using your updated RHOSP environment. However, a FAILED validation can indicate an issue with your environment.
2.2. Validating your SSH key size Copy linkLink copied to clipboard!
Starting with Red Hat Enterprise Linux (RHEL) 9.1, a minimum SSH key size of 2048 bits is required. If your current SSH key on Red Hat OpenStack Platform (RHOSP) director is less than 2048 bits, you can lose access to the overcloud. You must verify that your SSH key meets the required bit size.
Procedure
Validate your SSH key size:
ssh-keygen -l -f ~/.ssh/id_rsa.pubExample output:
1024 SHA256:Xqz0Xz0/aJua6B3qRD7VsLr6n/V3zhmnGSkcFR6FlJw stack@director.example.local (RSA)- If your SSH key is less than 2048 bits, you must rotate out the SSH key before continuing. For more information, see Updating SSH keys in your OpenStack environment in Hardening Red Hat OpenStack Platform.
2.3. Performing a minor update of a containerized undercloud Copy linkLink copied to clipboard!
Director provides commands to update the main packages on the undercloud node. Use director to perform a minor update within the current version of your RHOSP environment.
Procedure
-
Log in to the undercloud host as the
stackuser. Source the
stackrcundercloud credentials file:$ source ~/stackrcUpdate the director main packages with the
dnf updatecommand:$ sudo dnf update -y python3-tripleoclient ansible-*Update the undercloud environment:
$ openstack undercloud upgrade- Wait until the undercloud update process completes.
Reboot the undercloud to update the operating system’s kernel and other system packages:
$ sudo reboot- Wait until the node boots.
2.4. Updating the overcloud images Copy linkLink copied to clipboard!
You must replace your current overcloud images with new versions to ensure that director can introspect and provision your nodes with the latest version of the RHOSP software.
Prerequisites
- You have updated the undercloud node to the latest version. For more information, see Section 2.3, “Performing a minor update of a containerized undercloud”.
-
You have installed the
rhosp-director-images-uefi-x86_64package. For more information about installing packages, see Installing the overcloud images in the Installing and managing Red Hat OpenStack Platform with director.
Procedure
-
Log in to the undercloud host as the
stackuser. Source the
stackrcundercloud credentials file:$ source ~/stackrcRemove any existing images from the
imagesdirectory on thestackuser’s home (/home/stack/images):$ rm -rf ~/images/*Extract the archives:
cd ~/images for i in /usr/share/rhosp-director-images/ironic-python-agent-latest-17.1.tar /usr/share/rhosp-director-images/overcloud-hardened-uefi-full-latest-17.1.tar; do tar -xvf $i; done cd ~Import the latest images into the director:
$ openstack overcloud image upload --update-existing --image-path /home/stack/images/Configure your nodes to use the new images:
$ openstack overcloud node configure $(openstack baremetal node list -c UUID -f value)Verify the existence of the new images:
$ ls -l /var/lib/ironic/httpboot /var/lib/ironic/images
- When you deploy overcloud nodes, ensure that the overcloud image version corresponds to the respective heat template version. For example, use only the RHOSP 17.1 images with the RHOSP 17.1 heat templates.
-
If you deployed a connected environment that uses the Red Hat Customer Portal or Red Hat Satellite Server, the overcloud image and package repository versions might be out of sync. To ensure that the overcloud image and package repository versions match, you can use the
virt-customizetool. For more information, see the Red Hat Knowledgebase solution Modifying the Red Hat Linux OpenStack Platform Overcloud Image with virt-customize. -
The new
overcloud-fullimage replaces the oldovercloud-fullimage. If you made changes to the old image, you must repeat the changes in the new image, especially if you want to deploy new nodes in the future.
Chapter 3. Updating the overcloud Copy linkLink copied to clipboard!
After you update the undercloud, you can update the overcloud by running the overcloud and container image preparation commands, and updating your nodes. The control plane API is fully available during a minor update.
Prerequisites
- You have updated the undercloud node to the latest version. For more information, see Chapter 2, Updating the undercloud.
-
If you use a local set of core templates in your
stackuser home directory, ensure that you update the templates and use the recommended workflow in Understanding heat templates in the Customizing your Red Hat OpenStack Platform deployment guide. You must update the local copy before you upgrade the overcloud. Add the
GlanceApiInternalservice to your Controller role:OS::TripleO::Services::GlanceApiInternalThis is the service for the internal instance of the Image service (glance) API to provide location data to administrators and other services that require it, such as the Block Storage service (cinder) and the Compute service (nova).
-
Ensure that your environment includes the
/home/stack/overcloud-deploy/${STACK}/${STACK}-passwords.yamlfile. If this file does not exist, create the file by following the instructions in the Knowledgebase article Create a missing overcloud password file.
Procedure
To update the overcloud, you must complete the following procedures:
- Section 3.1, “Running the overcloud update preparation”
- Section 3.2, “Running the container image preparation”
- Section 3.3, “Updating the ovn-controller container on all overcloud servers”
- Section 3.4, “Updating the container image names of Pacemaker-controlled services”
- Section 3.5, “Updating all Controller nodes”
- Section 3.7, “Updating all Compute nodes”
- Section 3.8, “Updating all HCI Compute nodes”
- Section 3.9, “Updating all DistributedComputeHCI nodes”
- Section 3.10, “Updating all Ceph Storage nodes”
- Section 3.11, “Updating the Red Hat Ceph Storage cluster”
- Section 3.13, “Performing online database updates”
- Section 3.14, “Re-enabling fencing in the overcloud”
3.1. Running the overcloud update preparation Copy linkLink copied to clipboard!
To prepare the overcloud for the update process, you must run the openstack overcloud update prepare command, which updates the overcloud plan to Red Hat OpenStack Platform (RHOSP) 17.1 and prepares the nodes for the update.
Prerequisites
-
If you use a Ceph subscription and have configured director to use the
overcloud-minimalimage for Ceph storage nodes, you must ensure that in theroles_data.yamlrole definition file, therhsm_enforceparameter is set toFalse. -
If you rendered custom NIC templates, you must regenerate the templates with the updated version of the
openstack-tripleo-heat-templatescollection to avoid incompatibility with the overcloud version. For more information about custom NIC templates, see Defining custom network interface templates in the Customizing your Red Hat OpenStack Platform deployment guide.
For distributed compute node (edge) architectures with OVN deployments, you must complete this procedure for each stack with Compute, DistributedCompute, or DistributedComputeHCI nodes before proceeding with section Updating the ovn-controller container on all overcloud servers.
Procedure
-
Log in to the undercloud host as the
stackuser. Source the
stackrcundercloud credentials file:$ source ~/stackrcRun the update preparation command:
$ openstack overcloud update prepare \ --templates \ --stack <stack_name> \ -r <roles_data_file> \ -n <network_data_file> \ -e <environment_file> \ -e <environment_file> \ ...Include the following options relevant to your environment:
-
Include the
/usr/share/openstack-tripleo-heat-templates/environments/rhsm.yamlfile with the-eoption to apply the RHOSP 17.1 repositories. -
If the name of your overcloud stack is different to the default name
overcloud, include the--stackoption in the update preparation command and replace<stack_name>with the name of your stack. -
If you use your own custom roles, use the
-roption to include the custom roles (<roles_data_file>) file. -
If you use custom networks, use the
-noption to include your composable network in the (<network_data_file>) file. -
If you deploy a high availability cluster, include the
--ntp-serveroption in the update preparation command, or include theNtpServerparameter and value in your environment file. -
Include any custom configuration environment files with the
-eoption.
-
Include the
- Wait until the update preparation process completes.
3.2. Running the container image preparation Copy linkLink copied to clipboard!
Before you can update the overcloud, you must prepare all container image configurations that are required for your environment and pull the latest RHOSP 17.1 container images to your undercloud.
To complete the container image preparation, you must run the openstack overcloud external-update run command against tasks that have the container_image_prepare tag.
Procedure
-
Log in to the undercloud host as the
stackuser. Source the
stackrcundercloud credentials file:$ source ~/stackrcRun the
openstack overcloud external-update runcommand against tasks that have thecontainer_image_preparetag:$ openstack overcloud external-update run --stack <stack_name> --tags container_image_prepare-
If the name of your overcloud stack is different from the default stack name
overcloud, set your stack name with the--stackoption and replace<stack_name>with the name of your stack.
-
If the name of your overcloud stack is different from the default stack name
3.3. Updating the ovn-controller container on all overcloud servers Copy linkLink copied to clipboard!
If you deployed your overcloud with the Modular Layer 2 Open Virtual Network mechanism driver (ML2/OVN), update the ovn-controller container to the latest RHOSP 17.1 version. The update occurs on every overcloud server that runs the ovn-controller container.
-
The following procedure updates the
ovn-controllercontainers on servers that are assigned the Compute role before it updates the ovn-northd service on servers that are assigned the Controller role. - For distributed compute node (edge) architectures, you must complete this procedure for each stack with Compute, DistributedCompute, or DistributedComputeHCI nodes before proceeding with section Updating all Controller nodes.
If you accidentally updated the ovn-northd service before following this procedure, you might not be able to connect to your virtual machines or create new virtual machines or virtual networks. The following procedure restores connectivity.
Procedure
-
Log in to the undercloud host as the
stackuser. Source the
stackrcundercloud credentials file:$ source ~/stackrcRun the
openstack overcloud external-update runcommand against the tasks that have the ovn tag:$ openstack overcloud external-update run --stack <stack_name> --tags ovn-
If the name of your overcloud stack is different from the default stack name
overcloud, set your stack name with the--stackoption and replace<stack_name>with the name of your stack.
-
If the name of your overcloud stack is different from the default stack name
-
Wait until the
ovn-controllercontainer update completes.
3.4. Updating the container image names of Pacemaker-controlled services Copy linkLink copied to clipboard!
If you update your system from Red Hat Openstack Platform (RHOSP) 17 to RHOSP 17.1, you must update the container image names of the Pacemaker-controlled services. You must perform this update to migrate to the new image naming schema of the Pacemaker-controlled services.
If you update your system from a version of RHOSP 17.1 to the latest version of RHOSP 17.1, you do not need to update the container image names of the pacemaker-controlled services.
Procedure
- Log in to the undercloud host as the stack user.
Source the stackrc undercloud credentials file:
$ source ~/stackrcRun the
openstack overcloud external-update runcommand with theha_image_updatetag:$ openstack overcloud external-update run --stack <stack_name> --tags ha_image_update- If the name of your undercloud stack is different from the default stack name undercloud, set your stack name with the --stack option and replace <stack_name> with the name of your stack.
3.5. Updating all Controller nodes Copy linkLink copied to clipboard!
Update all the Controller nodes to the latest RHOSP 17.1 version. Run the openstack overcloud update run command and include the --limit Controller option to restrict operations to the Controller nodes only. The control plane API is fully available during the minor update.
Procedure
-
Log in to the undercloud host as the
stackuser. Source the
stackrcundercloud credentials file:$ source ~/stackrcRun the update command:
$ openstack overcloud update run --stack <stack_name> --limit Controller-
If the name of your overcloud stack is different from the default stack name
overcloud, set your stack name with the--stackoption and replace<stack_name>with the name of your stack.
-
If the name of your overcloud stack is different from the default stack name
- Wait until the Controller node update completes.
3.6. Updating composable roles with non-Pacemaker services Copy linkLink copied to clipboard!
Update composable roles with non-Pacemaker services to the latest RHOSP 17.1 version. Update the nodes in each composable role one at a time.
Procedure
-
Log in to the undercloud host as the
stackuser. Source the
stackrcundercloud credentials file:$ source ~/stackrcRun the update command:
$ openstack overcloud update run --stack <stack_name> --limit <non_pcs_role_0> $ openstack overcloud update run --stack <stack_name> --limit <non_pcs_role_1> $ openstack overcloud update run --stack <stack_name> --limit <non_pcs_role_2>-
If the name of your overcloud stack is different from the default stack name
overcloud, set your stack name with the--stackoption and replace<stack_name>with the name of your stack. -
Replace
<non_pcs_role_0>,<non_pcs_role_1>, and<non_pcs_role_2>with the names of your composable roles with non-Pacemaker services.
-
If the name of your overcloud stack is different from the default stack name
- Wait until the update completes.
3.7. Updating all Compute nodes Copy linkLink copied to clipboard!
Update all Compute nodes to the latest RHOSP 17.1 version. To update Compute nodes, run the openstack overcloud update run command and include the --limit Compute option to restrict operations to the Compute nodes only.
- Parallelization considerations
When you update a large number of Compute nodes, to improve performance, you can run multiple update tasks in the background and configure each task to update a separate group of 20 nodes. For example, if you have 80 Compute nodes in your deployment, you can run the following commands to update the Compute nodes in parallel:
$ openstack overcloud update run -y --limit 'Compute[0:19]' > update-compute-0-19.log 2>&1 & $ openstack overcloud update run -y --limit 'Compute[20:39]' > update-compute-20-39.log 2>&1 & $ openstack overcloud update run -y --limit 'Compute[40:59]' > update-compute-40-59.log 2>&1 & $ openstack overcloud update run -y --limit 'Compute[60:79]' > update-compute-60-79.log 2>&1 &This method of partitioning the nodes space is random and you do not have control over which nodes are updated. The selection of nodes is based on the inventory file that you generate when you run the
tripleo-ansible-inventorycommand.To update specific Compute nodes, list the nodes that you want to update in a batch separated by a comma:
$ openstack overcloud update run --limit <Compute0>,<Compute1>,<Compute2>,<Compute3>
Procedure
-
Log in to the undercloud host as the
stackuser. Source the
stackrcundercloud credentials file:$ source ~/stackrcRun the update command:
$ openstack overcloud update run --stack <stack_name> --limit Compute-
If the name of your overcloud stack is different from the default stack name
overcloud, set your stack name with the--stackoption and replace<stack_name>with the name of your stack.
-
If the name of your overcloud stack is different from the default stack name
- Wait until the Compute node update completes.
3.8. Updating all HCI Compute nodes Copy linkLink copied to clipboard!
Update the Hyperconverged Infrastructure (HCI) Compute nodes to the latest RHOSP 17.1 version.
Prerequisites
On a Ceph Monitor or Controller node that is running the
ceph-monservice, check that the Red Hat Ceph Storage cluster status is healthy and the pg status isactive+clean:$ sudo cephadm shell -- ceph statusIf the Ceph cluster is healthy, it returns a status of
HEALTH_OK.If the Ceph cluster status is unhealthy, it returns a status of
HEALTH_WARNorHEALTH_ERR. For troubleshooting guidance, see the Red Hat Ceph Storage 5 Troubleshooting Guide or the Red Hat Ceph Storage 6 Troubleshooting Guide
Procedure
-
Log in to the undercloud host as the
stackuser. Source the
stackrcundercloud credentials file:$ source ~/stackrcRun the update command:
$ openstack overcloud update run --stack <stack_name> --limit ComputeHCI-
If the name of your overcloud stack is different from the default stack name
overcloud, set your stack name with the--stackoption and replace<stack_name>with the name of your stack.
-
If the name of your overcloud stack is different from the default stack name
- Wait until the node update completes.
3.9. Updating all DistributedComputeHCI nodes Copy linkLink copied to clipboard!
Update roles specific to distributed compute node architecture. When you upgrade distributed compute nodes, update DistributedComputeHCI nodes first, and then update DistributedComputeHCIScaleOut nodes.
Prerequisites
On a Ceph Monitor or Controller node that is running the
ceph-monservice, check that the Red Hat Ceph Storage cluster status is healthy and the pg status isactive+clean:$ sudo cephadm shell -- ceph statusIf the Ceph cluster is healthy, it returns a status of
HEALTH_OK.If the Ceph cluster status is unhealthy, it returns a status of
HEALTH_WARNorHEALTH_ERR. For troubleshooting guidance, see the Red Hat Ceph Storage 5 Troubleshooting Guide or Red Hat Ceph Storage 6 Troubleshooting Guide.
Procedure
-
Log in to the undercloud host as the
stackuser. Source the
stackrcundercloud credentials file:$ source ~/stackrcRun the update command:
$ openstack overcloud update run --stack <stack_name> --limit DistributedComputeHCI-
If the name of your overcloud stack is different from the default stack name
overcloud, set your stack name with the--stackoption and replace<stack_name>with the name of your stack.
-
If the name of your overcloud stack is different from the default stack name
-
Wait until the
DistributedComputeHCInode update completes. -
Use the same process to update
DistributedComputeHCIScaleOutnodes.
3.10. Updating all Ceph Storage nodes Copy linkLink copied to clipboard!
Update the Red Hat Ceph Storage nodes to the latest RHOSP 17.1 version.
RHOSP 17.1 is supported on RHEL 9.2. However, hosts that are mapped to the Ceph Storage role update to the latest major RHEL release. For more information, see Red Hat Ceph Storage: Supported configurations.
Prerequisites
On a Ceph Monitor or Controller node that is running the
ceph-monservice, check that the Red Hat Ceph Storage cluster status is healthy and the pg status isactive+clean:$ sudo cephadm shell -- ceph statusIf the Ceph cluster is healthy, it returns a status of
HEALTH_OK.If the Ceph cluster status is unhealthy, it returns a status of
HEALTH_WARNorHEALTH_ERR. For troubleshooting guidance, see the Red Hat Ceph Storage 5 Troubleshooting Guide or the Red Hat Ceph Storage 6 Troubleshooting Guide.
Procedure
-
Log in to the undercloud host as the
stackuser. Source the
stackrcundercloud credentials file:$ source ~/stackrcRun the update command:
$ openstack overcloud update run --stack <stack_name> --limit CephStorage-
If the name of your overcloud stack is different from the default stack name
overcloud, set your stack name with the--stackoption and replace<stack_name>with the name of your stack.
-
If the name of your overcloud stack is different from the default stack name
- Wait until the node update completes.
3.11. Updating the Red Hat Ceph Storage cluster Copy linkLink copied to clipboard!
Update the director-deployed Red Hat Ceph Storage cluster to the latest version that is compatible with Red Hat OpenStack Platform (RHOSP) 17.1 by using the cephadm command.
Update your Red Hat Ceph Storage cluster if one of the following scenarios applies to your environment:
- If you upgraded from RHOSP 16.2 to RHOSP 17.1, you run Red Hat Ceph Storage 5, and you are updating to a newer version of Red Hat Ceph Storage 5.
- If you newly deployed RHOSP 17.1, you run Red Hat Ceph Storage 6, and you are updating to a newer version of Red Hat Ceph Storage 6.
Prerequisites
- Complete the container image preparation in Section 3.2, “Running the container image preparation”.
Procedure
- Log in to a Controller node.
Check the health of the cluster:
$ sudo cephadm shell -- ceph healthNoteIf the Ceph Storage cluster is healthy, the command returns a result of
HEALTH_OK. If the command returns a different result, review the status of the cluster and contact Red Hat support before continuing the update. For more information, see Upgrade a Red Hat Ceph Storage cluster using cephadm in the Red Hat Ceph Storage Upgrade Guide or Upgrade a Red Hat Ceph Storage cluster using cephadm in the Red Hat Ceph Storage 6 Upgrade Guide.Optional: Check which images should be included in the Ceph Storage cluster update:
$ openstack tripleo container image list -f value | awk -F '//' '/ceph/ {print $2}'Update the cluster to the latest Red Hat Ceph Storage version:
$ sudo cephadm shell -- ceph orch upgrade start --image <image_name>: <version>-
Replace
<image_name>with the name of the Ceph Storage cluster image. -
Replace
<version>with the target version to which you are updating the Ceph Storage cluster.
-
Replace
Wait until the Ceph Storage container update completes. To monitor the status of the update, run the following command:
sudo cephadm shell -- ceph orch upgrade status
3.12. Upgrading to Red Hat Ceph Storage 7 Copy linkLink copied to clipboard!
Red Hat Ceph Storage 6 is deployed by default with Red Hat OpenStack Platform 17.1. When deployment is complete, Red Hat Ceph Storage can be upgraded to release 7. For information on this process, and procedures to complete the upgrade, see the section Director-deployed Red Hat Ceph Storage environments in the Upgrading Red Hat Ceph Storage 6 to 7 chapter of Framework for upgrades (16.2 to 17.1).
3.13. Performing online database updates Copy linkLink copied to clipboard!
Some overcloud components require an online update or migration of their databases tables. To perform online database updates, run the openstack overcloud external-update run command against tasks that have the online_upgrade tag.
Online database updates apply to the following components:
- OpenStack Block Storage (cinder)
- OpenStack Compute (nova)
Procedure
-
Log in to the undercloud host as the
stackuser. Source the
stackrcundercloud credentials file:$ source ~/stackrcRun the
openstack overcloud external-update runcommand against tasks that use theonline_upgradetag:$ openstack overcloud external-update run --stack <stack_name> --tags online_upgrade
3.14. Re-enabling fencing in the overcloud Copy linkLink copied to clipboard!
Before you updated the overcloud, you disabled fencing in Disabling fencing in the overcloud. After you update the overcloud, re-enable fencing to protect your data if a node fails.
Procedure
-
Log in to the undercloud host as the
stackuser. Source the
stackrcundercloud credentials file:$ source ~/stackrcLog in to a Controller node and run the Pacemaker command to re-enable fencing:
$ ssh tripleo-admin@<controller_ip> "sudo pcs property set stonith-enabled=true"-
Replace
<controller_ip>with the IP address of a Controller node. You can find the IP addresses of your Controller nodes with theopenstack server listcommand.
-
Replace
If you use SBD fencing, reset the watchdog timer device interval to its original value before you disabled it:
# pcs property set stonith-watchdog-timeout=<interval>-
Replace
<interval>with the original value of the watchdog timer device, for example,10.
-
Replace
-
In the
fencing.yamlenvironment file, set theEnableFencingparameter totrue.
Additional Resources
Chapter 4. Rebooting the overcloud Copy linkLink copied to clipboard!
After you perform a minor Red Hat OpenStack Platform (RHOSP) update to the latest 17.1 version, check whether your packages require a system reboot by following the instructions in Identify packages that will require a system reboot after an update. If your packages do not require a reboot, continue to the "Validating RHOSP after the overcloud update" procedure at the end of this chapter.
The reboot refreshes the nodes with any associated kernel, system-level, and container component updates. These updates provide performance and security benefits. Plan a maintenance window to perform the reboot procedures.
Use the following guidance to understand how to reboot different node types:
- If you reboot all nodes in one role, reboot each node individually. If you reboot all nodes in a role simultaneously, service downtime can occur during the reboot operation.
Complete the reboot procedures on the nodes in the following order:
4.1. Rebooting Controller and composable nodes Copy linkLink copied to clipboard!
Reboot Controller nodes and standalone nodes based on composable roles, and exclude Compute nodes and Ceph Storage nodes.
Procedure
- Log in to the node that you want to reboot.
Optional: If the node uses Pacemaker resources, stop the cluster:
[tripleo-admin@overcloud-controller-0 ~]$ sudo pcs cluster stopReboot the node:
[tripleo-admin@overcloud-controller-0 ~]$ sudo reboot- Wait until the node boots.
Verification
Verify that the services are enabled.
If the node uses Pacemaker services, check that the node has rejoined the cluster:
[tripleo-admin@overcloud-controller-0 ~]$ sudo pcs statusIf the node uses Systemd services, check that all services are enabled:
[tripleo-admin@overcloud-controller-0 ~]$ sudo systemctl statusIf the node uses containerized services, check that all containers on the node are active:
[tripleo-admin@overcloud-controller-0 ~]$ sudo podman ps
4.2. Rebooting a Ceph Storage (OSD) cluster Copy linkLink copied to clipboard!
Complete the following steps to reboot a cluster of Ceph Storage (OSD) nodes.
Prerequisites
On a Ceph Monitor or Controller node that is running the
ceph-monservice, check that the Red Hat Ceph Storage cluster status is healthy and the pg status isactive+clean:$ sudo cephadm shell -- ceph statusIf the Ceph cluster is healthy, it returns a status of
HEALTH_OK.If the Ceph cluster status is unhealthy, it returns a status of
HEALTH_WARNorHEALTH_ERR. For troubleshooting guidance, see the Red Hat Ceph Storage 5 Troubleshooting Guide or the Red Hat Ceph Storage 6 Troubleshooting Guide.
Procedure
Log in to a Ceph Monitor or Controller node that is running the
ceph-monservice, and disable Ceph Storage cluster rebalancing temporarily:$ sudo cephadm shell -- ceph osd set noout $ sudo cephadm shell -- ceph osd set norebalanceNoteIf you have a multistack or distributed compute node (DCN) architecture, you must specify the Ceph cluster name when you set the
nooutandnorebalanceflags. For example:sudo cephadm shell -c /etc/ceph/<cluster>.conf -k /etc/ceph/<cluster>.client.keyring.- Select the first Ceph Storage node that you want to reboot and log in to the node.
Reboot the node:
$ sudo reboot- Wait until the node boots.
Log in to the node and check the Ceph cluster status:
$ sudo cephadm shell -- ceph statusCheck that the
pgmapreports allpgsas normal (active+clean).- Log out of the node, reboot the next node, and check its status. Repeat this process until you have rebooted all Ceph Storage nodes.
When complete, log in to a Ceph Monitor or Controller node that is running the
ceph-monservice and enable Ceph cluster rebalancing:$ sudo cephadm shell -- ceph osd unset noout $ sudo cephadm shell -- ceph osd unset norebalanceNoteIf you have a multistack or distributed compute node (DCN) architecture, you must specify the Ceph cluster name when you unset the
nooutandnorebalanceflags. For example:sudo cephadm shell -c /etc/ceph/<cluster>.conf -k /etc/ceph/<cluster>.client.keyringPerform a final status check to verify that the cluster reports
HEALTH_OK:$ sudo cephadm shell ceph status
4.3. Rebooting Compute nodes Copy linkLink copied to clipboard!
To ensure minimal downtime of instances in your Red Hat OpenStack Platform environment, the Migrating instances workflow outlines the steps you must complete to migrate instances from the Compute node that you want to reboot.
Migrating instances workflow
- Decide whether to migrate instances to another Compute node before rebooting the node.
- Select and disable the Compute node that you want to reboot so that it does not provision new instances.
- Migrate the instances to another Compute node.
- Reboot the empty Compute node.
- Enable the empty Compute node.
Prerequisites
Before you reboot the Compute node, you must decide whether to migrate instances to another Compute node while the node is rebooting.
Review the list of migration constraints that you might encounter when you migrate virtual machine instances between Compute nodes. For more information, see Migration constraints in Configuring the Compute service for instance creation.
NoteIf you have a Multi-RHEL environment, and you want to migrate virtual machines from a Compute node that is running RHEL 9.2 to a Compute node that is running RHEL 8.4, only cold migration is supported. For more information about cold migration, see Cold migrating an instance in Configuring the Compute service for instance creation.
If you cannot migrate the instances, you can set the following core template parameters to control the state of the instances after the Compute node reboots:
NovaResumeGuestsStateOnHostBoot-
Determines whether to return instances to the same state on the Compute node after reboot. When set to
False, the instances remain down and you must start them manually. The default value isFalse. NovaResumeGuestsShutdownTimeoutNumber of seconds to wait for an instance to shut down before rebooting. It is not recommended to set this value to
0. The default value is300.For more information about overcloud parameters and their usage, see Overcloud parameters.
Procedure
-
Log in to the undercloud as the
stackuser. Retrieve a list of your Compute nodes to identify the host name of the node that you want to reboot:
(undercloud)$ source ~/overcloudrc (overcloud)$ openstack compute service listIdentify the host name of the Compute node that you want to reboot.
Disable the Compute service on the Compute node that you want to reboot:
(overcloud)$ openstack compute service list (overcloud)$ openstack compute service set <hostname> nova-compute --disable-
Replace
<hostname>with the host name of your Compute node.
-
Replace
List all instances on the Compute node:
(overcloud)$ openstack server list --host <hostname> --all-projectsOptional: To migrate the instances to another Compute node, complete the following steps:
If you decide to migrate the instances to another Compute node, use one of the following commands:
To migrate the instance to a different host, run the following command:
(overcloud) $ openstack server migrate <instance_id> --live <target_host> --wait-
Replace
<instance_id>with your instance ID. -
Replace
<target_host>with the host that you are migrating the instance to.
-
Replace
Let
nova-schedulerautomatically select the target host:(overcloud) $ nova live-migration <instance_id>Live migrate all instances at once:
$ nova host-evacuate-live <hostname>NoteThe
novacommand might cause some deprecation warnings, which are safe to ignore.
- Wait until migration completes.
Confirm that the migration was successful:
(overcloud) $ openstack server list --host <hostname> --all-projects- Continue to migrate instances until none remain on the Compute node.
Log in to the Compute node and reboot the node:
[tripleo-admin@overcloud-compute-0 ~]$ sudo rebootNoteAfter a Compute node reboot, instances with VNC enabled will have a USB keyboard device automatically added to their configuration. If you experience console keyboard input issues for new instances after rebooting a Compute node, see Unable to access console of virtual machine after Compute node reboot. For existing instances, stop the instance, run the following command, and restart the instance:
nova-manage image_property set --property hw_input_bus=virtio <instance_uuid>- Wait until the node boots.
Re-enable the Compute node:
$ source ~/overcloudrc (overcloud) $ openstack compute service set <hostname> nova-compute --enableCheck that the Compute node is enabled:
(overcloud) $ openstack compute service list
4.4. Validating RHOSP after the overcloud update Copy linkLink copied to clipboard!
After you update your Red Hat OpenStack Platform (RHOSP) environment, validate your overcloud with the tripleo-validations playbooks.
For more information about validations, see Using the validation framework in Installing and managing Red Hat OpenStack Platform with director.
Procedure
-
Log in to the undercloud host as the
stackuser. Source the
stackrcundercloud credentials file:$ source ~/stackrcRun the validation:
$ validation run -i ~/overcloud-deploy/<stack>/config-download/<stack>/tripleo-ansible-inventory.yaml --group post-update- Replace <stack> with the name of the stack.
Verification
- To view the results of the validation report, see Viewing validation history in Installing and managing Red Hat OpenStack Platform with director.
If a host is not found when you run a validation, the command reports the status as SKIPPED. A status of SKIPPED means that the validation is not executed, which is expected. Additionally, if a validation’s pass criteria is not met, the command reports the status as FAILED. A FAILED validation does not prevent you from using your updated RHOSP environment. However, a FAILED validation can indicate an issue with your environment.