Questo contenuto non è disponibile nella lingua selezionata.
Chapter 4. Performing maintenance on Compute nodes and Controller nodes with Instance HA
To perform maintenance on a Compute node or a Controller node with Instance HA, stop the node by setting it in standby mode and disabling the Pacemaker resources on the node. After you complete the maintenance work, you start the node and check that the Pacemaker resources are healthy.
Prerequisites
- A running overcloud with Instance HA enabled
Procedure
Log in to a Controller node and stop the Compute or Controller node:
pcs node standby <node UUID>
# pcs node standby <node UUID>Copy to Clipboard Copied! Toggle word wrap Toggle overflow ImportantYou must log in to a different node from the node you want to stop.
Disable the Pacemaker resources on the node:
pcs resource disable <ocf::pacemaker:remote on the node>
# pcs resource disable <ocf::pacemaker:remote on the node>Copy to Clipboard Copied! Toggle word wrap Toggle overflow - Perform any maintenance work on the node.
- Restore the IPMI connection and start the node. Wait until the node is ready before proceeding.
Enable the Pacemaker resources on the node and start the node:
pcs resource enable <ocf::pacemaker:remote on the node> pcs node unstandby <node UUID>
# pcs resource enable <ocf::pacemaker:remote on the node> # pcs node unstandby <node UUID>Copy to Clipboard Copied! Toggle word wrap Toggle overflow If you set the node to maintenance mode, source the credential file for your overcloud and unset the node from maintenance mode:
source stackrc openstack baremetal node maintenance unset <baremetal node UUID>
# source stackrc # openstack baremetal node maintenance unset <baremetal node UUID>Copy to Clipboard Copied! Toggle word wrap Toggle overflow
Verification
Check that the Pacemaker resources are active and healthy:
pcs status
# pcs statusCopy to Clipboard Copied! Toggle word wrap Toggle overflow -
If any Pacemaker resources fail to start during the startup process, run the
pcs resource cleanupcommand to reset the status and the fail count of the resource. If you evacuated instances from a Compute node before you stopped the node, check that the instances are migrated to a different node:
openstack server list --long nova migration-list
# openstack server list --long # nova migration-listCopy to Clipboard Copied! Toggle word wrap Toggle overflow