Ce contenu n'est pas disponible dans la langue sélectionnée.

Chapter 1. OpenShift Container Storage deployed using dynamic devices

1.1. OpenShift Container Storage deployed on AWS
Copier lien

To replace an operational node, see:
- Section 1.1.1, “Replacing an operational AWS node on user-provisioned infrastructure”
- Section 1.1.2, “Replacing an operational AWS node on installer-provisioned infrastructure”
To replace a failed node, see:
- Section 1.1.4, “Replacing a failed AWS node on installer-provisioned infrastructure”
- Section 1.1.3, “Replacing a failed AWS node on user-provisioned infrastructure”

1.1.1. Replacing an operational AWS node on user-provisioned infrastructure
Copier lien

Perform this procedure to replace an operational node on AWS user-provisioned infrastructure.

Prerequisites

Red Hat recommends that replacement nodes are configured with similar infrastructure and resources to the node being replaced.
You must be logged into the OpenShift Container Platform (RHOCP) cluster.

Procedure

Identify the node that needs to be replaced.
Mark the node as unschedulable using the following command:
```
oc adm cordon <node_name>
```
```
$ oc adm cordon <node_name>
```
Copy to Clipboard Toggle word wrap
Drain the node using the following command:
```
oc adm drain <node_name> --force --delete-local-data --ignore-daemonsets
```
```
$ oc adm drain <node_name> --force --delete-local-data --ignore-daemonsets
```
Copy to Clipboard Toggle word wrap
Important
This activity may take at least 5-10 minutes or more. Ceph errors generated during this period are temporary and are automatically resolved when the new node is labeled and functional.
Delete the node using the following command:
```
oc delete nodes <node_name>
```
```
$ oc delete nodes <node_name>
```
Copy to Clipboard Toggle word wrap
Create a new AWS machine instance with the required infrastructure. See Platform requirements.
Create a new OpenShift Container Platform node using the new AWS machine instance.
Check for certificate signing requests (CSRs) related to OpenShift Container Platform that are in Pending state:
```
oc get csr
```
```
$ oc get csr
```
Copy to Clipboard Toggle word wrap
Approve all required OpenShift Container Platform CSRs for the new node:
```
oc adm certificate approve <Certificate_Name>
```
```
$ oc adm certificate approve <Certificate_Name>
```
Copy to Clipboard Toggle word wrap
Click Compute Nodes, confirm if the new node is in Ready state.
Apply the OpenShift Container Storage label to the new node.
From the web user interface
For the new node, click Action Menu (⋮) Edit Labels
Add cluster.ocs.openshift.io/openshift-storage and click Save.
From the command line interface
Execute the following command to apply the OpenShift Container Storage label to the new node:

$ oc label node <new_node_name> cluster.ocs.openshift.io/openshift-storage=""

Copy to Clipboard Toggle word wrap

Verification steps

Execute the following command and verify that the new node is present in the output:

oc get nodes --show-labels | grep cluster.ocs.openshift.io/openshift-storage= |cut -d' ' -f1

$ oc get nodes --show-labels | grep cluster.ocs.openshift.io/openshift-storage= |cut -d' ' -f1

Copy to Clipboard

Toggle word wrap

Click Workloads Pods, confirm that at least the following pods on the new node are in Running state:
- csi-cephfsplugin-*
- csi-rbdplugin-*
Verify that all other required OpenShift Container Storage pods are in Running state.

Verify that new OSD pods are running on the replacement node.

oc get pods -o wide -n openshift-storage| egrep -i new-node-name | egrep osd

$ oc get pods -o wide -n openshift-storage| egrep -i new-node-name | egrep osd

Copy to Clipboard

Toggle word wrap

(Optional) If cluster-wide encryption is enabled on the cluster, verify that the new OSD devices are encrypted.
For each of the new nodes identified in previous step, do the following:
1. Create a debug pod and open a chroot environment for the selected host(s).
  $ oc debug node/<node name> $ chroot /host
  Copy to Clipboard Toggle word wrap
2. Run “lsblk” and check for the “crypt” keyword beside the ocs-deviceset name(s)
  $ lsblk
  Copy to Clipboard Toggle word wrap
If verification steps fail, contact Red Hat Support.

1.1.2. Replacing an operational AWS node on installer-provisioned infrastructure
Copier lien

Use this procedure to replace an operational node on AWS installer-provisioned infrastructure (IPI).

Procedure

Log in to OpenShift Web Console and click Compute Nodes.
Identify the node that needs to be replaced. Take a note of its Machine Name.
Mark the node as unschedulable using the following command:
```
oc adm cordon <node_name>
```
```
$ oc adm cordon <node_name>
```
Copy to Clipboard Toggle word wrap
Drain the node using the following command:
```
oc adm drain <node_name> --force --delete-local-data --ignore-daemonsets
```
```
$ oc adm drain <node_name> --force --delete-local-data --ignore-daemonsets
```
Copy to Clipboard Toggle word wrap
Important
This activity may take at least 5-10 minutes or more. Ceph errors generated during this period are temporary and are automatically resolved when the new node is labeled and functional.
Click Compute Machines. Search for the required machine.
Besides the required machine, click the Action menu (⋮) Delete Machine.
Click Delete to confirm the machine deletion. A new machine is automatically created.
Wait for new machine to start and transition into Running state.
Important
This activity may take at least 5-10 minutes or more.
Click Compute Nodes, confirm if the new node is in Ready state.
Apply the OpenShift Container Storage label to the new node using any one of the following:
From User interface
For the new node, click Action Menu (⋮) Edit Labels
Add cluster.ocs.openshift.io/openshift-storage and click Save.
From Command line interface
Execute the following command to apply the OpenShift Container Storage label to the new node:

$ oc label node <new_node_name> cluster.ocs.openshift.io/openshift-storage=""

Copy to Clipboard Toggle word wrap

Verification steps

Execute the following command and verify that the new node is present in the output:

oc get nodes --show-labels | grep cluster.ocs.openshift.io/openshift-storage= |cut -d' ' -f1

$ oc get nodes --show-labels | grep cluster.ocs.openshift.io/openshift-storage= |cut -d' ' -f1

Copy to Clipboard

Toggle word wrap

Click Workloads Pods, confirm that at least the following pods on the new node are in Running state:
- csi-cephfsplugin-*
- csi-rbdplugin-*
Verify that all other required OpenShift Container Storage pods are in Running state.

Verify that new OSD pods are running on the replacement node.

oc get pods -o wide -n openshift-storage| egrep -i new-node-name | egrep osd

$ oc get pods -o wide -n openshift-storage| egrep -i new-node-name | egrep osd

Copy to Clipboard

Toggle word wrap

(Optional) If cluster-wide encryption is enabled on the cluster, verify that the new OSD devices are encrypted.
For each of the new nodes identified in previous step, do the following:
1. Create a debug pod and open a chroot environment for the selected host(s).
  $ oc debug node/<node name> $ chroot /host
  Copy to Clipboard Toggle word wrap
2. Run “lsblk” and check for the “crypt” keyword beside the ocs-deviceset name(s)
  $ lsblk
  Copy to Clipboard Toggle word wrap
If verification steps fail, contact Red Hat Support.

1.1.3. Replacing a failed AWS node on user-provisioned infrastructure
Copier lien

Perform this procedure to replace a failed node which is not operational on AWS user-provisioned infrastructure (UPI) for OpenShift Container Storage.

Prerequisites

Red Hat recommends that replacement nodes are configured with similar infrastructure and resources to the node being replaced.
You must be logged into the OpenShift Container Platform (RHOCP) cluster.

Procedure

Identify the AWS machine instance of the node that needs to be replaced.
Log in to AWS and terminate the identified AWS machine instance.
Create a new AWS machine instance with the required infrastructure. See platform requirements.
Create a new OpenShift Container Platform node using the new AWS machine instance.
Check for certificate signing requests (CSRs) related to OpenShift Container Platform that are in Pending state:
```
oc get csr
```
```
$ oc get csr
```
Copy to Clipboard Toggle word wrap
Approve all required OpenShift Container Platform CSRs for the new node:
```
oc adm certificate approve <Certificate_Name>
```
```
$ oc adm certificate approve <Certificate_Name>
```
Copy to Clipboard Toggle word wrap
Click Compute Nodes, confirm if the new node is in Ready state.
Apply the OpenShift Container Storage label to the new node using any one of the following:
From User interface
For the new node, click Action Menu (⋮) Edit Labels
Add cluster.ocs.openshift.io/openshift-storage and click Save.
From Command line interface
Execute the following command to apply the OpenShift Container Storage label to the new node:

$ oc label node <new_node_name> cluster.ocs.openshift.io/openshift-storage=""

Copy to Clipboard Toggle word wrap

Verification steps

Execute the following command and verify that the new node is present in the output:

oc get nodes --show-labels | grep cluster.ocs.openshift.io/openshift-storage= |cut -d' ' -f1

$ oc get nodes --show-labels | grep cluster.ocs.openshift.io/openshift-storage= |cut -d' ' -f1

Copy to Clipboard

Toggle word wrap

Click Workloads Pods, confirm that at least the following pods on the new node are in Running state:
- csi-cephfsplugin-*
- csi-rbdplugin-*
Verify that all other required OpenShift Container Storage pods are in Running state.

Verify that new OSD pods are running on the replacement node.

oc get pods -o wide -n openshift-storage| egrep -i new-node-name | egrep osd

$ oc get pods -o wide -n openshift-storage| egrep -i new-node-name | egrep osd

Copy to Clipboard

Toggle word wrap

(Optional) If cluster-wide encryption is enabled on the cluster, verify that the new OSD devices are encrypted.
For each of the new nodes identified in previous step, do the following:
1. Create a debug pod and open a chroot environment for the selected host(s).
  $ oc debug node/<node name> $ chroot /host
  Copy to Clipboard Toggle word wrap
2. Run “lsblk” and check for the “crypt” keyword beside the ocs-deviceset name(s)
  $ lsblk
  Copy to Clipboard Toggle word wrap
If verification steps fail, contact Red Hat Support.

1.1.4. Replacing a failed AWS node on installer-provisioned infrastructure
Copier lien

Perform this procedure to replace a failed node which is not operational on AWS installer-provisioned infrastructure (IPI) for OpenShift Container Storage.

Procedure

Log in to OpenShift Web Console and click Compute Nodes.
Identify the faulty node and click on its Machine Name.
Click Actions Edit Annotations, and click Add More.
Add machine.openshift.io/exclude-node-draining and click Save.
Click Actions Delete Machine, and click Delete.
A new machine is automatically created, wait for new machine to start.
Important
This activity may take at least 5-10 minutes or more. Ceph errors generated during this period are temporary and are automatically resolved when the new node is labeled and functional.
Click Compute Nodes, confirm if the new node is in Ready state.
Apply the OpenShift Container Storage label to the new node using any one of the following:
From User interface
For the new node, click Action Menu (⋮) Edit Labels
Add cluster.ocs.openshift.io/openshift-storage and click Save.
From Command line interface
Execute the following command to apply the OpenShift Container Storage label to the new node:

$ oc label node <new_node_name> cluster.ocs.openshift.io/openshift-storage=""

Copy to Clipboard Toggle word wrap
[Optional]: If the failed AWS instance is not removed automatically, terminate the instance from AWS console.

Verification steps

Execute the following command and verify that the new node is present in the output:

oc get nodes --show-labels | grep cluster.ocs.openshift.io/openshift-storage= |cut -d' ' -f1

$ oc get nodes --show-labels | grep cluster.ocs.openshift.io/openshift-storage= |cut -d' ' -f1

Copy to Clipboard

Toggle word wrap

Click Workloads Pods, confirm that at least the following pods on the new node are in Running state:
- csi-cephfsplugin-*
- csi-rbdplugin-*
Verify that all other required OpenShift Container Storage pods are in Running state.

Verify that new OSD pods are running on the replacement node.

oc get pods -o wide -n openshift-storage| egrep -i new-node-name | egrep osd

$ oc get pods -o wide -n openshift-storage| egrep -i new-node-name | egrep osd

Copy to Clipboard

Toggle word wrap

(Optional) If cluster-wide encryption is enabled on the cluster, verify that the new OSD devices are encrypted.
For each of the new nodes identified in previous step, do the following:
1. Create a debug pod and open a chroot environment for the selected host(s).
  $ oc debug node/<node name> $ chroot /host
  Copy to Clipboard Toggle word wrap
2. Run “lsblk” and check for the “crypt” keyword beside the ocs-deviceset name(s)
  $ lsblk
  Copy to Clipboard Toggle word wrap
If verification steps fail, contact Red Hat Support.

1.2. OpenShift Container Storage deployed on VMware
Copier lien

To replace an operational node, see:
- Section 1.2.1, “Replacing an operational VMware node on user-provisioned infrastructure”
- Section 1.2.2, “Replacing an operational VMware node on installer-provisioned infrastructure”
To replace a failed node, see:
- Section 1.2.3, “Replacing a failed VMware node on user-provisioned infrastructure”
- Section 1.2.4, “Replacing a failed VMware node on installer-provisioned infrastructure”

1.2.1. Replacing an operational VMware node on user-provisioned infrastructure
Copier lien

Perform this procedure to replace an operational node on VMware user-provisioned infrastructure (UPI).

Prerequisites

Red Hat recommends that replacement nodes are configured with similar infrastructure, resources, and disks to the node being replaced.
You must be logged into the OpenShift Container Platform (RHOCP) cluster.

Procedure

Identify the node and its VM that needs to be replaced.
Mark the node as unschedulable using the following command:
```
oc adm cordon <node_name>
```
```
$ oc adm cordon <node_name>
```
Copy to Clipboard Toggle word wrap
Drain the node using the following command:
```
oc adm drain <node_name> --force --delete-local-data --ignore-daemonsets
```
```
$ oc adm drain <node_name> --force --delete-local-data --ignore-daemonsets
```
Copy to Clipboard Toggle word wrap
Important
This activity may take at least 5-10 minutes or more. Ceph errors generated during this period are temporary and are automatically resolved when the new node is labeled and functional.
Delete the node using the following command:
```
oc delete nodes <node_name>
```
```
$ oc delete nodes <node_name>
```
Copy to Clipboard Toggle word wrap
Log in to vSphere and terminate the identified VM.
Important
VM should be deleted only from the inventory and not from the disk.
Create a new VM on vSphere with the required infrastructure. See Platform requirements.
Create a new OpenShift Container Platform worker node using the new VM.
Check for certificate signing requests (CSRs) related to OpenShift Container Platform that are in Pending state:
```
oc get csr
```
```
$ oc get csr
```
Copy to Clipboard Toggle word wrap
Approve all required OpenShift Container Platform CSRs for the new node:
```
oc adm certificate approve <Certificate_Name>
```
```
$ oc adm certificate approve <Certificate_Name>
```
Copy to Clipboard Toggle word wrap
Click Compute Nodes, confirm if the new node is in Ready state.
Apply the OpenShift Container Storage label to the new node using any one of the following:
From User interface
For the new node, click Action Menu (⋮) Edit Labels
Add cluster.ocs.openshift.io/openshift-storage and click Save.
From Command line interface
Execute the following command to apply the OpenShift Container Storage label to the new node:

$ oc label node <new_node_name> cluster.ocs.openshift.io/openshift-storage=""

Copy to Clipboard Toggle word wrap

Verification steps

Execute the following command and verify that the new node is present in the output:

oc get nodes --show-labels | grep cluster.ocs.openshift.io/openshift-storage= |cut -d' ' -f1

$ oc get nodes --show-labels | grep cluster.ocs.openshift.io/openshift-storage= |cut -d' ' -f1

Copy to Clipboard

Toggle word wrap

Click Workloads Pods, confirm that at least the following pods on the new node are in Running state:
- csi-cephfsplugin-*
- csi-rbdplugin-*
Verify that all other required OpenShift Container Storage pods are in Running state.

Verify that new OSD pods are running on the replacement node.

oc get pods -o wide -n openshift-storage| egrep -i new-node-name | egrep osd

$ oc get pods -o wide -n openshift-storage| egrep -i new-node-name | egrep osd

Copy to Clipboard

Toggle word wrap

(Optional) If cluster-wide encryption is enabled on the cluster, verify that the new OSD devices are encrypted.
For each of the new nodes identified in previous step, do the following:
1. Create a debug pod and open a chroot environment for the selected host(s).
  $ oc debug node/<node name> $ chroot /host
  Copy to Clipboard Toggle word wrap
2. Run “lsblk” and check for the “crypt” keyword beside the ocs-deviceset name(s)
  $ lsblk
  Copy to Clipboard Toggle word wrap
If verification steps fail, contact Red Hat Support.

1.2.2. Replacing an operational VMware node on installer-provisioned infrastructure
Copier lien

Use this procedure to replace an operational node on VMware installer-provisioned infrastructure (IPI).

Procedure

Log in to OpenShift Web Console and click Compute Nodes.
Identify the node that needs to be replaced. Take a note of its Machine Name.
Mark the node as unschedulable using the following command:
```
oc adm cordon <node_name>
```
```
$ oc adm cordon <node_name>
```
Copy to Clipboard Toggle word wrap
Drain the node using the following command:
```
oc adm drain <node_name> --force --delete-local-data --ignore-daemonsets
```
```
$ oc adm drain <node_name> --force --delete-local-data --ignore-daemonsets
```
Copy to Clipboard Toggle word wrap
Important
This activity may take at least 5-10 minutes or more. Ceph errors generated during this period are temporary and are automatically resolved when the new node is labeled and functional.
Click Compute Machines. Search for the required machine.
Besides the required machine, click the Action menu (⋮) Delete Machine.
Click Delete to confirm the machine deletion. A new machine is automatically created.
Wait for new machine to start and transition into Running state.
Important
This activity may take at least 5-10 minutes or more.
Click Compute Nodes, confirm if the new node is in Ready state.
Apply the OpenShift Container Storage label to the new node using any one of the following:
From User interface
For the new node, click Action Menu (⋮) Edit Labels
Add cluster.ocs.openshift.io/openshift-storage and click Save.
From Command line interface
Execute the following command to apply the OpenShift Container Storage label to the new node:

$ oc label node <new_node_name> cluster.ocs.openshift.io/openshift-storage=""

Copy to Clipboard Toggle word wrap

Verification steps

Execute the following command and verify that the new node is present in the output:

oc get nodes --show-labels | grep cluster.ocs.openshift.io/openshift-storage= |cut -d' ' -f1

$ oc get nodes --show-labels | grep cluster.ocs.openshift.io/openshift-storage= |cut -d' ' -f1

Copy to Clipboard

Toggle word wrap

Click Workloads Pods, confirm that at least the following pods on the new node are in Running state:
- csi-cephfsplugin-*
- csi-rbdplugin-*
Verify that all other required OpenShift Container Storage pods are in Running state.

Verify that new OSD pods are running on the replacement node.

oc get pods -o wide -n openshift-storage| egrep -i new-node-name | egrep osd

$ oc get pods -o wide -n openshift-storage| egrep -i new-node-name | egrep osd

Copy to Clipboard

Toggle word wrap

(Optional) If cluster-wide encryption is enabled on the cluster, verify that the new OSD devices are encrypted.
For each of the new nodes identified in previous step, do the following:
1. Create a debug pod and open a chroot environment for the selected host(s).
  $ oc debug node/<node name> $ chroot /host
  Copy to Clipboard Toggle word wrap
2. Run “lsblk” and check for the “crypt” keyword beside the ocs-deviceset name(s)
  $ lsblk
  Copy to Clipboard Toggle word wrap
If verification steps fail, contact Red Hat Support.

1.2.3. Replacing a failed VMware node on user-provisioned infrastructure
Copier lien

Perform this procedure to replace a failed node on VMware user-provisioned infrastructure (UPI).

Prerequisites

Red Hat recommends that replacement nodes are configured with similar infrastructure, resources, and disks to the node being replaced.
You must be logged into the OpenShift Container Platform (RHOCP) cluster.

Procedure

Identify the node and its VM that needs to be replaced.
Delete the node using the following command:
```
oc delete nodes <node_name>
```
```
$ oc delete nodes <node_name>
```
Copy to Clipboard Toggle word wrap
Log in to vSphere and terminate the identified VM.
Important
VM should be deleted only from the inventory and not from the disk.
Create a new VM on vSphere with the required infrastructure. See Platform requirements.
Create a new OpenShift Container Platform worker node using the new VM.
Check for certificate signing requests (CSRs) related to OpenShift Container Platform that are in Pending state:
```
oc get csr
```
```
$ oc get csr
```
Copy to Clipboard Toggle word wrap
Approve all required OpenShift Container Platform CSRs for the new node:
```
oc adm certificate approve <Certificate_Name>
```
```
$ oc adm certificate approve <Certificate_Name>
```
Copy to Clipboard Toggle word wrap
Click Compute Nodes, confirm if the new node is in Ready state.
Apply the OpenShift Container Storage label to the new node using any one of the following:
From User interface
For the new node, click Action Menu (⋮) Edit Labels
Add cluster.ocs.openshift.io/openshift-storage and click Save.
From Command line interface
Execute the following command to apply the OpenShift Container Storage label to the new node:

$ oc label node <new_node_name> cluster.ocs.openshift.io/openshift-storage=""

Copy to Clipboard Toggle word wrap

Verification steps

Execute the following command and verify that the new node is present in the output:

oc get nodes --show-labels | grep cluster.ocs.openshift.io/openshift-storage= |cut -d' ' -f1

$ oc get nodes --show-labels | grep cluster.ocs.openshift.io/openshift-storage= |cut -d' ' -f1

Copy to Clipboard

Toggle word wrap

Click Workloads Pods, confirm that at least the following pods on the new node are in Running state:
- csi-cephfsplugin-*
- csi-rbdplugin-*
Verify that all other required OpenShift Container Storage pods are in Running state.

Verify that new OSD pods are running on the replacement node.

oc get pods -o wide -n openshift-storage| egrep -i new-node-name | egrep osd

$ oc get pods -o wide -n openshift-storage| egrep -i new-node-name | egrep osd

Copy to Clipboard

Toggle word wrap

(Optional) If cluster-wide encryption is enabled on the cluster, verify that the new OSD devices are encrypted.
For each of the new nodes identified in previous step, do the following:
1. Create a debug pod and open a chroot environment for the selected host(s).
  $ oc debug node/<node name> $ chroot /host
  Copy to Clipboard Toggle word wrap
2. Run “lsblk” and check for the “crypt” keyword beside the ocs-deviceset name(s)
  $ lsblk
  Copy to Clipboard Toggle word wrap
If verification steps fail, contact Red Hat Support.

1.2.4. Replacing a failed VMware node on installer-provisioned infrastructure
Copier lien

Perform this procedure to replace a failed node which is not operational on VMware installer-provisioned infrastructure (IPI) for OpenShift Container Storage.

Procedure

Log in to OpenShift Web Console and click Compute Nodes.
Identify the faulty node and click on its Machine Name.
Click Actions Edit Annotations, and click Add More.
Add machine.openshift.io/exclude-node-draining and click Save.
Click Actions Delete Machine, and click Delete.
A new machine is automatically created, wait for new machine to start.
Important
This activity may take at least 5-10 minutes or more. Ceph errors generated during this period are temporary and are automatically resolved when the new node is labeled and functional.
Click Compute Nodes, confirm if the new node is in Ready state.
Apply the OpenShift Container Storage label to the new node using any one of the following:
From User interface
For the new node, click Action Menu (⋮) Edit Labels
Add cluster.ocs.openshift.io/openshift-storage and click Save.
From Command line interface
Execute the following command to apply the OpenShift Container Storage label to the new node:

$ oc label node <new_node_name> cluster.ocs.openshift.io/openshift-storage=""

Copy to Clipboard Toggle word wrap
[Optional]: If the failed VM is not removed automatically, terminate the VM from vSphere.

Verification steps

Execute the following command and verify that the new node is present in the output:

oc get nodes --show-labels | grep cluster.ocs.openshift.io/openshift-storage= |cut -d' ' -f1

$ oc get nodes --show-labels | grep cluster.ocs.openshift.io/openshift-storage= |cut -d' ' -f1

Copy to Clipboard

Toggle word wrap

Click Workloads Pods, confirm that at least the following pods on the new node are in Running state:
- csi-cephfsplugin-*
- csi-rbdplugin-*
Verify that all other required OpenShift Container Storage pods are in Running state.

Verify that new OSD pods are running on the replacement node.

oc get pods -o wide -n openshift-storage| egrep -i new-node-name | egrep osd

$ oc get pods -o wide -n openshift-storage| egrep -i new-node-name | egrep osd

Copy to Clipboard

Toggle word wrap

(Optional) If cluster-wide encryption is enabled on the cluster, verify that the new OSD devices are encrypted.
For each of the new nodes identified in previous step, do the following:
1. Create a debug pod and open a chroot environment for the selected host(s).
  $ oc debug node/<node name> $ chroot /host
  Copy to Clipboard Toggle word wrap
2. Run “lsblk” and check for the “crypt” keyword beside the ocs-deviceset name(s)
  $ lsblk
  Copy to Clipboard Toggle word wrap
If verification steps fail, contact Red Hat Support.

1.3. OpenShift Container Storage deployed on Red Hat Virtualization
Copier lien

To replace an operational node, see Section 1.3.1, “Replacing an operational Red Hat Virtualization node on installer-provisioned infrastructure”
To replace a failed node, see Section 2.4.2, “Replacing a failed node on Red Hat Virtualization installer-provisioned infrastructure”

1.3.1. Replacing an operational Red Hat Virtualization node on installer-provisioned infrastructure
Copier lien

Use this procedure to replace an operational node on Red Hat Virtualization installer-provisioned infrastructure (IPI).

Procedure

Log in to OpenShift Web Console and click Compute Nodes.
Identify the node that needs to be replaced. Take a note of its Machine Name.
Mark the node as unschedulable using the following command:
```
oc adm cordon <node_name>
```
```
$ oc adm cordon <node_name>
```
Copy to Clipboard Toggle word wrap
Drain the node using the following command:
```
oc adm drain <node_name> --force --delete-local-data --ignore-daemonsets
```
```
$ oc adm drain <node_name> --force --delete-local-data --ignore-daemonsets
```
Copy to Clipboard Toggle word wrap
Important
This activity may take at least 5-10 minutes or more. Ceph errors generated during this period are temporary and are automatically resolved when the new node is labeled and functional.
Click Compute Machines. Search for the required machine.
Besides the required machine, click the Action menu (⋮) Delete Machine.
Click Delete to confirm the machine deletion. A new machine is automatically created. Wait for new machine to start and transition into Running state.
Important
This activity may take at least 5-10 minutes or more.
Click Compute Nodes, confirm if the new node is in Ready state.
Apply the OpenShift Container Storage label to the new node using any one of the following:
From User interface
For the new node, click Action Menu (⋮) Edit Labels.
Add cluster.ocs.openshift.io/openshift-storage and click Save.
From Command line interface
Execute the following command to apply the OpenShift Container Storage label to the new node:
$ oc label node <new_node_name> cluster.ocs.openshift.io/openshift-storage=""

Copy to Clipboard Toggle word wrap

Verification steps

Execute the following command and verify that the new node is present in the output:

oc get nodes --show-labels | grep cluster.ocs.openshift.io/openshift-storage= |cut -d' ' -f1

$ oc get nodes --show-labels | grep cluster.ocs.openshift.io/openshift-storage= |cut -d' ' -f1

Copy to Clipboard

Toggle word wrap

Click Workloads Pods, confirm that at least the following pods on the new node are in Running state:
- csi-cephfsplugin-*
- csi-rbdplugin-*
Verify that all other required OpenShift Container Storage pods are in Running state.

Verify that new OSD pods are running on the replacement node.

oc get pods -o wide -n openshift-storage| egrep -i new-node-name | egrep osd

$ oc get pods -o wide -n openshift-storage| egrep -i new-node-name | egrep osd

Copy to Clipboard

Toggle word wrap

(Optional) If cluster-wide encryption is enabled on the cluster, verify that the new OSD devices are encrypted.
For each of the new nodes identified in previous step, do the following:
1. Create a debug pod and open a chroot environment for the selected host(s).
  $ oc debug node/<node name> $ chroot /host
  Copy to Clipboard Toggle word wrap
2. Run “lsblk” and check for the “crypt” keyword beside the ocs-deviceset name(s)
  $ lsblk
  Copy to Clipboard Toggle word wrap
If verification steps fail, contact Red Hat Support.

1.3.2. Replacing a failed Red Hat Virtualization node on installer-provisioned infrastructure
Copier lien

Perform this procedure to replace a failed node which is not operational on Red Hat Virtualization installer-provisioned infrastructure (IPI) for OpenShift Container Storage.

Procedure

Log in to OpenShift Web Console and click Compute Nodes.
Identify the faulty node. Take a note of its Machine Name.
Log in to Red Hat Virtualization Administration Portal and remove the virtual disks associated with mon and OSDs from the failed Virtual Machine.
This step is required so that the disks are not deleted when the VM instance is deleted as part of the Delete machine step.
Important
Do not select the Remove Permanently option when removing the disk(s).
In the OpenShift Web Console, click Compute Machines. Search for the required machine.
Click Actions Edit Annotations, and click Add More.
Add machine.openshift.io/exclude-node-draining and click Save.
Click Actions Delete Machine, and click Delete.
A new machine is automatically created, wait for new machine to start.
Important
This activity may take at least 5-10 minutes or more. Ceph errors generated during this period are temporary and are automatically resolved when the new node is labeled and functional.
Click Compute Nodes, confirm if the new node is in Ready state.
Apply the OpenShift Container Storage label to the new node using any one of the following:
From User interface
For the new node, click Action Menu (⋮) Edit Labels.
Add cluster.ocs.openshift.io/openshift-storage and click Save.
From Command line interface
Execute the following command to apply the OpenShift Container Storage label to the new node:

$ oc label node <new_node_name> cluster.ocs.openshift.io/openshift-storage=""

Copy to Clipboard Toggle word wrap
(Optional) If the failed VM is not removed automatically, remove the VM from Red Hat Virtualization Administration Portal.

Verification steps

Execute the following command and verify that the new node is present in the output:

oc get nodes --show-labels | grep cluster.ocs.openshift.io/openshift-storage= |cut -d' ' -f1

$ oc get nodes --show-labels | grep cluster.ocs.openshift.io/openshift-storage= |cut -d' ' -f1

Copy to Clipboard

Toggle word wrap

Click Workloads Pods, confirm that at least the following pods on the new node are in Running state:
- csi-cephfsplugin-*
- csi-rbdplugin-*
Verify that all other required OpenShift Container Storage pods are in Running state.

Verify that new OSD pods are running on the replacement node.

oc get pods -o wide -n openshift-storage| egrep -i new-node-name | egrep osd

$ oc get pods -o wide -n openshift-storage| egrep -i new-node-name | egrep osd

Copy to Clipboard

Toggle word wrap

(Optional) If cluster-wide encryption is enabled on the cluster, verify that the new OSD devices are encrypted.
For each of the new nodes identified in previous step, do the following:
1. Create a debug pod and open a chroot environment for the selected host(s).
  $ oc debug node/<node name> $ chroot /host
  Copy to Clipboard Toggle word wrap
2. Run “lsblk” and check for the “crypt” keyword beside the ocs-deviceset name(s)
  $ lsblk
  Copy to Clipboard Toggle word wrap
If verification steps fail, contact Red Hat Support.

1.4. OpenShift Container Storage deployed on Microsoft Azure
Copier lien

To replace an operational node, see Section 1.4.1, “Replacing operational nodes on Azure installer-provisioned infrastructure” To replace a failed node, see Section 1.4.2, “Replacing failed nodes on Azure installer-provisioned infrastructure”

1.4.1. Replacing operational nodes on Azure installer-provisioned infrastructure
Copier lien

Use this procedure to replace an operational node on Azure installer-provisioned infrastructure (IPI).

Procedure

Log in to OpenShift Web Console and click Compute Nodes.
Identify the node that needs to be replaced. Take a note of its Machine Name.
Mark the node as unschedulable using the following command:
```
oc adm cordon <node_name>
```
```
$ oc adm cordon <node_name>
```
Copy to Clipboard Toggle word wrap
Drain the node using the following command:
```
oc adm drain <node_name> --force --delete-local-data --ignore-daemonsets
```
```
$ oc adm drain <node_name> --force --delete-local-data --ignore-daemonsets
```
Copy to Clipboard Toggle word wrap
Important
This activity may take at least 5-10 minutes or more. Ceph errors generated during this period are temporary and are automatically resolved when the new node is labeled and functional.
Click Compute Machines. Search for the required machine.
Besides the required machine, click the Action menu (⋮) Delete Machine.
Click Delete to confirm the machine deletion. A new machine is automatically created.
Wait for new machine to start and transition into Running state.
Important
This activity may take at least 5-10 minutes or more.
Click Compute Nodes, confirm if the new node is in Ready state.
Apply the OpenShift Container Storage label to the new node using any one of the following:
From User interface
For the new node, click Action Menu (⋮) Edit Labels
Add cluster.ocs.openshift.io/openshift-storage and click Save.
From Command line interface
Execute the following command to apply the OpenShift Container Storage label to the new node:

$ oc label node <new_node_name> cluster.ocs.openshift.io/openshift-storage=""

Copy to Clipboard Toggle word wrap

Verification steps

Execute the following command and verify that the new node is present in the output:

oc get nodes --show-labels | grep cluster.ocs.openshift.io/openshift-storage= |cut -d' ' -f1

$ oc get nodes --show-labels | grep cluster.ocs.openshift.io/openshift-storage= |cut -d' ' -f1

Copy to Clipboard

Toggle word wrap

Click Workloads Pods, confirm that at least the following pods on the new node are in Running state:
- csi-cephfsplugin-*
- csi-rbdplugin-*
Verify that all other required OpenShift Container Storage pods are in Running state.

Verify that new OSD pods are running on the replacement node.

oc get pods -o wide -n openshift-storage| egrep -i new-node-name | egrep osd

$ oc get pods -o wide -n openshift-storage| egrep -i new-node-name | egrep osd

Copy to Clipboard

Toggle word wrap

(Optional) If cluster-wide encryption is enabled on the cluster, verify that the new OSD devices are encrypted.
For each of the new nodes identified in previous step, do the following:
1. Create a debug pod and open a chroot environment for the selected host(s).
  $ oc debug node/<node name> $ chroot /host
  Copy to Clipboard Toggle word wrap
2. Run “lsblk” and check for the “crypt” keyword beside the ocs-deviceset name(s)
  $ lsblk
  Copy to Clipboard Toggle word wrap
If verification steps fail, contact Red Hat Support.

1.4.2. Replacing failed nodes on Azure installer-provisioned infrastructure
Copier lien

Perform this procedure to replace a failed node which is not operational on Azure installer-provisioned infrastructure (IPI) for OpenShift Container Storage.

Procedure

Log in to OpenShift Web Console and click Compute Nodes.
Identify the faulty node and click on its Machine Name.
Click Actions Edit Annotations, and click Add More.
Add machine.openshift.io/exclude-node-draining and click Save.
Click Actions Delete Machine, and click Delete.
A new machine is automatically created, wait for new machine to start.
Important
This activity may take at least 5-10 minutes or more. Ceph errors generated during this period are temporary and are automatically resolved when the new node is labeled and functional.
Click Compute Nodes, confirm if the new node is in Ready state.
Apply the OpenShift Container Storage label to the new node using any one of the following:
From User interface
For the new node, click Action Menu (⋮) Edit Labels
Add cluster.ocs.openshift.io/openshift-storage and click Save.
From Command line interface
Execute the following command to apply the OpenShift Container Storage label to the new node:

$ oc label node <new_node_name> cluster.ocs.openshift.io/openshift-storage=""

Copy to Clipboard Toggle word wrap
[Optional]: If the failed Azure instance is not removed automatically, terminate the instance from Azure console.

Verification steps

Execute the following command and verify that the new node is present in the output:

oc get nodes --show-labels | grep cluster.ocs.openshift.io/openshift-storage= |cut -d' ' -f1

$ oc get nodes --show-labels | grep cluster.ocs.openshift.io/openshift-storage= |cut -d' ' -f1

Copy to Clipboard

Toggle word wrap

Click Workloads Pods, confirm that at least the following pods on the new node are in Running state:
- csi-cephfsplugin-*
- csi-rbdplugin-*
Verify that all other required OpenShift Container Storage pods are in Running state.

Verify that new OSD pods are running on the replacement node.

oc get pods -o wide -n openshift-storage| egrep -i new-node-name | egrep osd

$ oc get pods -o wide -n openshift-storage| egrep -i new-node-name | egrep osd

Copy to Clipboard

Toggle word wrap

(Optional) If cluster-wide encryption is enabled on the cluster, verify that the new OSD devices are encrypted.
For each of the new nodes identified in previous step, do the following:
1. Create a debug pod and open a chroot environment for the selected host(s).
  $ oc debug node/<node name> $ chroot /host
  Copy to Clipboard Toggle word wrap
2. Run “lsblk” and check for the “crypt” keyword beside the ocs-deviceset name(s)
  $ lsblk
  Copy to Clipboard Toggle word wrap
If verification steps fail, contact Red Hat Support.

Ce contenu n'est pas disponible dans la langue sélectionnée.

Chapter 1. OpenShift Container Storage deployed using dynamic devices

1.1. OpenShift Container Storage deployed on AWS
Copier lien

1.1.1. Replacing an operational AWS node on user-provisioned infrastructure
Copier lien

1.1.2. Replacing an operational AWS node on installer-provisioned infrastructure
Copier lien

1.1.3. Replacing a failed AWS node on user-provisioned infrastructure
Copier lien

1.1.4. Replacing a failed AWS node on installer-provisioned infrastructure
Copier lien

1.2. OpenShift Container Storage deployed on VMware
Copier lien

1.2.1. Replacing an operational VMware node on user-provisioned infrastructure
Copier lien

1.2.2. Replacing an operational VMware node on installer-provisioned infrastructure
Copier lien

1.2.3. Replacing a failed VMware node on user-provisioned infrastructure
Copier lien

1.2.4. Replacing a failed VMware node on installer-provisioned infrastructure
Copier lien

1.3. OpenShift Container Storage deployed on Red Hat Virtualization
Copier lien

1.3.1. Replacing an operational Red Hat Virtualization node on installer-provisioned infrastructure
Copier lien

1.3.2. Replacing a failed Red Hat Virtualization node on installer-provisioned infrastructure
Copier lien

1.4. OpenShift Container Storage deployed on Microsoft Azure
Copier lien

1.4.1. Replacing operational nodes on Azure installer-provisioned infrastructure
Copier lien

1.4.2. Replacing failed nodes on Azure installer-provisioned infrastructure
Copier lien

Apprendre

Essayez, achetez et vendez

Communautés

À propos de la documentation Red Hat

Rendre l’open source plus inclusif

À propos de Red Hat

Theme

Red Hat legal and privacy links

Red Hat legal and privacy links

Ce contenu n'est pas disponible dans la langue sélectionnée.

Chapter 1. OpenShift Container Storage deployed using dynamic devices

1.1. OpenShift Container Storage deployed on AWSCopier lienLien copié sur presse-papiers!

1.1.1. Replacing an operational AWS node on user-provisioned infrastructureCopier lienLien copié sur presse-papiers!

1.1.2. Replacing an operational AWS node on installer-provisioned infrastructureCopier lienLien copié sur presse-papiers!

1.1.3. Replacing a failed AWS node on user-provisioned infrastructureCopier lienLien copié sur presse-papiers!

1.1.4. Replacing a failed AWS node on installer-provisioned infrastructureCopier lienLien copié sur presse-papiers!

1.2. OpenShift Container Storage deployed on VMwareCopier lienLien copié sur presse-papiers!

1.2.1. Replacing an operational VMware node on user-provisioned infrastructureCopier lienLien copié sur presse-papiers!

1.2.2. Replacing an operational VMware node on installer-provisioned infrastructureCopier lienLien copié sur presse-papiers!

1.2.3. Replacing a failed VMware node on user-provisioned infrastructureCopier lienLien copié sur presse-papiers!

1.2.4. Replacing a failed VMware node on installer-provisioned infrastructureCopier lienLien copié sur presse-papiers!

1.3. OpenShift Container Storage deployed on Red Hat VirtualizationCopier lienLien copié sur presse-papiers!

1.3.1. Replacing an operational Red Hat Virtualization node on installer-provisioned infrastructureCopier lienLien copié sur presse-papiers!

1.3.2. Replacing a failed Red Hat Virtualization node on installer-provisioned infrastructureCopier lienLien copié sur presse-papiers!

1.4. OpenShift Container Storage deployed on Microsoft AzureCopier lienLien copié sur presse-papiers!

1.4.1. Replacing operational nodes on Azure installer-provisioned infrastructureCopier lienLien copié sur presse-papiers!

1.4.2. Replacing failed nodes on Azure installer-provisioned infrastructureCopier lienLien copié sur presse-papiers!

Apprendre

Essayez, achetez et vendez

Communautés

À propos de la documentation Red Hat

Rendre l’open source plus inclusif

À propos de Red Hat

Theme

Red Hat legal and privacy links

Red Hat legal and privacy links

1.1. OpenShift Container Storage deployed on AWS
Copier lien

1.1.1. Replacing an operational AWS node on user-provisioned infrastructure
Copier lien

1.1.2. Replacing an operational AWS node on installer-provisioned infrastructure
Copier lien

1.1.3. Replacing a failed AWS node on user-provisioned infrastructure
Copier lien

1.1.4. Replacing a failed AWS node on installer-provisioned infrastructure
Copier lien

1.2. OpenShift Container Storage deployed on VMware
Copier lien

1.2.1. Replacing an operational VMware node on user-provisioned infrastructure
Copier lien

1.2.2. Replacing an operational VMware node on installer-provisioned infrastructure
Copier lien

1.2.3. Replacing a failed VMware node on user-provisioned infrastructure
Copier lien

1.2.4. Replacing a failed VMware node on installer-provisioned infrastructure
Copier lien

1.3. OpenShift Container Storage deployed on Red Hat Virtualization
Copier lien

1.3.1. Replacing an operational Red Hat Virtualization node on installer-provisioned infrastructure
Copier lien

1.3.2. Replacing a failed Red Hat Virtualization node on installer-provisioned infrastructure
Copier lien

1.4. OpenShift Container Storage deployed on Microsoft Azure
Copier lien

1.4.1. Replacing operational nodes on Azure installer-provisioned infrastructure
Copier lien

1.4.2. Replacing failed nodes on Azure installer-provisioned infrastructure
Copier lien