17.3. Clearing failure status of cluster resources
If a resource has failed, a failure message appears when you display the cluster status with the pcs status command. After attempting to resolve the cause of the failure, you can check the updated status of the resource by running the pcs status command again, and you can check the failure count for the cluster resources with the pcs resource failcount show --full command.
After you resolve the cause of a resource failure, you may want to remove the failure message from the status display by removing the failure operation history.
- Resetting the failure status and removing the failure operation history
You can clear that failure status of a resource with the
pcs resource cleanupcommand. Thepcs resource cleanupcommand resets the resource status andfailcountvalue for the resource. This command also removes the operation history for the resource and re-detects its current state. Thepcs resource cleanupcommand operates only on resources with failed actions as shown in the cluster status.The following command resets the resource status and
failcountvalue for the resource specified by resource_id.pcs resource cleanup resource_idIf you do not specify resource_id, the
pcs resource cleanupcommand resets the resource status andfailcountvalue for all resources with a failure count.- Resetting the resource status and removing the full resource operation history
You can reset the resource status and clear the entire operation history of a resource with the
pcs resource refresh resource_idcommand. Run thepcs resource refreshcommand with no options specified to reset the resource status andfailcountvalue for all resources.The
pcs resource refreshcommand operates on resources regardless of their current state. This requires that Pacemaker reprobe the resources on all nodes, which increases the workload. To remove the operation history only of resources with failed actions, use thepcs resource cleanupcommand.