Dieser Inhalt ist in der von Ihnen ausgewählten Sprache nicht verfügbar.

Chapter 11. Scaling the Ceph Storage cluster

11.1. Scaling up the Ceph Storage cluster
Link kopieren

You can scale up the number of Ceph Storage nodes in your overcloud by re-running the deployment with the number of Ceph Storage nodes you need.

Before doing so, ensure that you have enough nodes for the updated deployment. These nodes must be registered with the director and tagged accordingly.

Registering new Ceph Storage nodes

To register new Ceph storage nodes with director, complete the following steps.

Procedure

Log in to the undercloud as the stack user and initialize your director configuration:
```
source ~/stackrc
```
```
$ source ~/stackrc
```
Copy to Clipboard Toggle word wrap
Define the hardware and power management details for the new nodes in a new node definition template; for example, instackenv-scale.json.
Import this file in to director:
```
openstack overcloud node import ~/instackenv-scale.json
```
```
$ openstack overcloud node import ~/instackenv-scale.json
```
Copy to Clipboard Toggle word wrap
Importing the node definition template registers each node that is defined there to director.
Assign the kernel and ramdisk images to all nodes:
```
openstack overcloud node configure
```
```
$ openstack overcloud node configure
```
Copy to Clipboard Toggle word wrap

Note

For more information about registering new nodes, see Section 2.2, “Registering nodes”.

Manually tagging new nodes

After you register each node, you must inspect the hardware and tag the node into a specific profile. Use profile tags to match your nodes to flavors, and then assign flavors to deployment roles.

Procedure

Trigger hardware introspection to retrieve the hardware attributes of each node:
```
openstack overcloud node introspect --all-manageable --provide
```
```
$ openstack overcloud node introspect --all-manageable --provide
```
Copy to Clipboard Toggle word wrap
- The --all-manageable option introspects only the nodes that are in a managed state. In this example, all nodes are in a managed state.
- The --provide option resets all nodes to an active state after introspection.
  Important
  Ensure that this process completes successfully. This process usually takes 15 minutes for bare metal nodes.
Retrieve a list of your nodes to identify their UUIDs:
```
openstack baremetal node list
```
```
$ openstack baremetal node list
```
Copy to Clipboard Toggle word wrap

Add a profile option to the properties/capabilities parameter for each node to manually tag a node to a specific profile. The addition of the profile option tags the nodes into each respective profile.

Note

As an alternative to manual tagging, use the Automated Health Check (AHC) Tools to automatically tag larger numbers of nodes based on benchmarking data. For example, the following commands tag three additional nodes with the ceph-storage profile:

openstack baremetal node set --property capabilities='profile:baremetal,boot_option:local' 551d81f5-4df2-4e0f-93da-6c5de0b868f7
openstack baremetal node set --property capabilities='profile:baremetal,boot_option:local' 5e735154-bd6b-42dd-9cc2-b6195c4196d7
openstack baremetal node set --property capabilities='profile:baremetal,boot_option:local' 1a2b090c-299d-4c20-a25d-57dd21a7085b

$ openstack baremetal node set --property capabilities='profile:baremetal,boot_option:local' 551d81f5-4df2-4e0f-93da-6c5de0b868f7
$ openstack baremetal node set --property capabilities='profile:baremetal,boot_option:local' 5e735154-bd6b-42dd-9cc2-b6195c4196d7
$ openstack baremetal node set --property capabilities='profile:baremetal,boot_option:local' 1a2b090c-299d-4c20-a25d-57dd21a7085b

Copy to Clipboard

Toggle word wrap

Tip

If the nodes you tagged and registered use multiple disks, you can set director to use a specific root disk on each node. For more information, see Section 2.5, “Defining the root disk for multi-disk clusters”.

Redeploying the overcloud with additional Ceph Storage nodes

After you register and tag the new nodes, you can scale up the number of Ceph Storage nodes by redeploying the overcloud.

Procedure

Before you redeploy the overcloud, set the CephStorageCount parameter in the parameter_defaults of your environment file, in this case, ~/templates/storage-config.yaml. In Section 7.1, “Assigning nodes and flavors to roles”, the overcloud is configured to deploy with three Ceph Storage nodes. The following example scales the overcloud to 6 nodes:

parameter_defaults:
  ControllerCount: 3
  OvercloudControlFlavor: control
  ComputeCount: 3
  OvercloudComputeFlavor: compute
  CephStorageCount: 6
  OvercloudCephStorageFlavor: ceph-storage
  CephMonCount: 3
  OvercloudCephMonFlavor: ceph-mon

parameter_defaults:
  ControllerCount: 3
  OvercloudControlFlavor: control
  ComputeCount: 3
  OvercloudComputeFlavor: compute
  CephStorageCount: 6
  OvercloudCephStorageFlavor: ceph-storage
  CephMonCount: 3
  OvercloudCephMonFlavor: ceph-mon

Copy to Clipboard

Toggle word wrap

Redeploy the overcloud. The overcloud now has six Ceph Storage nodes instead of three.

11.2. Scaling down and replacing Ceph Storage nodes
Link kopieren

In some cases, you might need to scale down your Ceph cluster, or even replace a Ceph Storage node, for example, if a Ceph Storage node is faulty. In either situation, you must disable and rebalance any Ceph Storage node that you want to remove from the overcloud to avoid data loss.

Note

This procedure uses steps from the Red Hat Ceph Storage Administration Guide to manually remove Ceph Storage nodes. For more in-depth information about manual removal of Ceph Storage nodes, see Starting, stopping, and restarting Ceph daemons that run in containers and Removing a Ceph OSD using the command-line interface.

Procedure

Log in to a Controller node as the heat-admin user. The director stack user has an SSH key to access the heat-admin user.

List the OSD tree and find the OSDs for your node. For example, the node you want to remove might contain the following OSDs:

-2 0.09998     host overcloud-cephstorage-0
0 0.04999         osd.0                         up  1.00000          1.00000
1 0.04999         osd.1                         up  1.00000          1.00000

-2 0.09998     host overcloud-cephstorage-0
0 0.04999         osd.0                         up  1.00000          1.00000
1 0.04999         osd.1                         up  1.00000          1.00000

Copy to Clipboard

Toggle word wrap

Disable the OSDs on the Ceph Storage node. In this case, the OSD IDs are 0 and 1.

sudo podman exec ceph-mon-<HOSTNAME> ceph osd out 0
sudo podman exec ceph-mon-<HOSTNAME> ceph osd out 1

[heat-admin@overcloud-controller-0 ~]$ sudo podman exec ceph-mon-<HOSTNAME> ceph osd out 0
[heat-admin@overcloud-controller-0 ~]$ sudo podman exec ceph-mon-<HOSTNAME> ceph osd out 1

Copy to Clipboard

Toggle word wrap

The Ceph Storage cluster begins rebalancing. Wait for this process to complete. Follow the status by using the following command:
```
sudo podman exec ceph-mon-<HOSTNAME> ceph -w
```
```
[heat-admin@overcloud-controller-0 ~]$ sudo podman exec ceph-mon-<HOSTNAME> ceph -w
```
Copy to Clipboard Toggle word wrap

After the Ceph cluster completes rebalancing, log in to the Ceph Storage node you are removing, in this case, overcloud-cephstorage-0, as the heat-admin user, and stop and disable the node.

sudo systemctl stop ceph-osd@0
sudo systemctl stop ceph-osd@1
sudo systemctl disable ceph-osd@0
sudo systemctl disable ceph-osd@1

[heat-admin@overcloud-cephstorage-0 ~]$ sudo systemctl stop ceph-osd@0
[heat-admin@overcloud-cephstorage-0 ~]$ sudo systemctl stop ceph-osd@1
[heat-admin@overcloud-cephstorage-0 ~]$ sudo systemctl disable ceph-osd@0
[heat-admin@overcloud-cephstorage-0 ~]$ sudo systemctl disable ceph-osd@1

Copy to Clipboard

Toggle word wrap

Stop the OSDs.

sudo systemctl stop ceph-osd@0
sudo systemctl stop ceph-osd@1

[heat-admin@overcloud-cephstorage-0 ~]$ sudo systemctl stop ceph-osd@0
[heat-admin@overcloud-cephstorage-0 ~]$ sudo systemctl stop ceph-osd@1

Copy to Clipboard

Toggle word wrap

While logged in to the Controller node, remove the OSDs from the CRUSH map so that they no longer receive data.

sudo podman exec ceph-mon-<HOSTNAME> ceph osd crush remove osd.0
sudo podman exec ceph-mon-<HOSTNAME> ceph osd crush remove osd.1

[heat-admin@overcloud-controller-0 ~]$ sudo podman exec ceph-mon-<HOSTNAME> ceph osd crush remove osd.0
[heat-admin@overcloud-controller-0 ~]$ sudo podman exec ceph-mon-<HOSTNAME> ceph osd crush remove osd.1

Copy to Clipboard

Toggle word wrap

Remove the OSD authentication key.

sudo podman exec ceph-mon-<HOSTNAME> ceph auth del osd.0
sudo podman exec ceph-mon-<HOSTNAME> ceph auth del osd.1

[heat-admin@overcloud-controller-0 ~]$ sudo podman exec ceph-mon-<HOSTNAME> ceph auth del osd.0
[heat-admin@overcloud-controller-0 ~]$ sudo podman exec ceph-mon-<HOSTNAME> ceph auth del osd.1

Copy to Clipboard

Toggle word wrap

Remove the OSD from the cluster.

sudo podman exec ceph-mon-<HOSTNAME> ceph osd rm 0
sudo podman exec ceph-mon-<HOSTNAME> ceph osd rm 1

[heat-admin@overcloud-controller-0 ~]$ sudo podman exec ceph-mon-<HOSTNAME> ceph osd rm 0
[heat-admin@overcloud-controller-0 ~]$ sudo podman exec ceph-mon-<HOSTNAME> ceph osd rm 1

Copy to Clipboard

Toggle word wrap

Remove the Storage node from the CRUSH map:

sudo docker exec ceph-mon-<HOSTNAME> ceph osd crush rm <NODE>
sudo ceph osd crush remove <NODE>

[heat-admin@overcloud-controller-0 ~]$ sudo docker exec ceph-mon-<HOSTNAME> ceph osd crush rm <NODE>
[heat-admin@overcloud-controller-0 ~]$ sudo ceph osd crush remove <NODE>

Copy to Clipboard

Toggle word wrap

You can confirm the <NODE> name as defined in the CRUSH map by searching the CRUSH tree:

sudo podman exec ceph-mon-<HOSTNAME> ceph osd crush tree | grep overcloud-osd-compute-3 -A 4

[heat-admin@overcloud-controller-0 ~]$ sudo podman exec ceph-mon-<HOSTNAME> ceph osd crush tree | grep overcloud-osd-compute-3 -A 4
                "name": "overcloud-osd-compute-3",
                "type": "host",
                "type_id": 1,
                "items": []
            },
[heat-admin@overcloud-controller-0 ~]$

Copy to Clipboard

Toggle word wrap

In the CRUSH tree, ensure that the items list is empty. If the list is not empty, revisit step 7.

Leave the node and return to the undercloud as the stack user.
```
exit
```
```
[heat-admin@overcloud-controller-0 ~]$ exit
[stack@director ~]$
```
Copy to Clipboard Toggle word wrap

Disable the Ceph Storage node so that director does not reprovision it.

openstack baremetal node list
openstack baremetal node maintenance set UUID

[stack@director ~]$ openstack baremetal node list
[stack@director ~]$ openstack baremetal node maintenance set UUID

Copy to Clipboard

Toggle word wrap

Removing a Ceph Storage node requires an update to the overcloud stack in director with the local template files. First identify the UUID of the overcloud stack:
```
openstack stack list
```
```
$ openstack stack list
```
Copy to Clipboard Toggle word wrap
Identify the UUIDs of the Ceph Storage node you want to delete:
```
openstack server list
```
```
$ openstack server list
```
Copy to Clipboard Toggle word wrap
Delete the node from the stack and update the plan accordingly:
Important
If you passed any extra environment files when you created the overcloud, pass them again here using the -e option to avoid making undesired changes to the overcloud. For more information, see Modifying the overcloud environment in the Director Installation and Usage guide.
```
openstack overcloud node delete /
```
```
$ openstack overcloud node delete /
--stack <stack-name> /
--templates /
-e <other-environment-files> /
<node_UUID>
```
Copy to Clipboard Toggle word wrap
Wait until the stack completes its update. Use the heat stack-list --show-nested command to monitor the stack update.

Add new nodes to the director node pool and deploy them as Ceph Storage nodes. Use the CephStorageCount parameter in parameter_defaults of your environment file, in this case, ~/templates/storage-config.yaml, to define the total number of Ceph Storage nodes in the overcloud.

parameter_defaults:
  ControllerCount: 3
  OvercloudControlFlavor: control
  ComputeCount: 3
  OvercloudComputeFlavor: compute
  CephStorageCount: 3
  OvercloudCephStorageFlavor: ceph-storage
  CephMonCount: 3
  OvercloudCephMonFlavor: ceph-mon

parameter_defaults:
  ControllerCount: 3
  OvercloudControlFlavor: control
  ComputeCount: 3
  OvercloudComputeFlavor: compute
  CephStorageCount: 3
  OvercloudCephStorageFlavor: ceph-storage
  CephMonCount: 3
  OvercloudCephMonFlavor: ceph-mon

Copy to Clipboard

Toggle word wrap

Note

For more information about how to define the number of nodes per role, see Section 7.1, “Assigning nodes and flavors to roles”.

After you update your environment file, redeploy the overcloud:
```
openstack overcloud deploy --templates -e <ENVIRONMENT_FILE>
```
```
$ openstack overcloud deploy --templates -e <ENVIRONMENT_FILE>
```
Copy to Clipboard Toggle word wrap
Director provisions the new node and updates the entire stack with the details of the new node.
Log in to a Controller node as the heat-admin user and check the status of the Ceph Storage node:
```
sudo ceph status
```
```
[heat-admin@overcloud-controller-0 ~]$ sudo ceph status
```
Copy to Clipboard Toggle word wrap
Confirm that the value in the osdmap section matches the number of nodes in your cluster that you want. The Ceph Storage node that you removed is replaced with a new node.

11.3. Adding an OSD to a Ceph Storage node
Link kopieren

This procedure demonstrates how to add an OSD to a node. For more information about Ceph OSDs, see Ceph OSDs in the Red Hat Ceph Storage Operations Guide.

Procedure

Notice the following heat template deploys Ceph Storage with three OSD devices:

parameter_defaults:
  CephAnsibleDisksConfig:
    devices:
      - /dev/sdb
      - /dev/sdc
      - /dev/sdd
    osd_scenario: lvm
    osd_objectstore: bluestore

parameter_defaults:
  CephAnsibleDisksConfig:
    devices:
      - /dev/sdb
      - /dev/sdc
      - /dev/sdd
    osd_scenario: lvm
    osd_objectstore: bluestore

Copy to Clipboard

Toggle word wrap

To add an OSD, update the node disk layout as described in Section 5.3, “Mapping the Ceph Storage node disk layout”. In this example, add /dev/sde to the template:

parameter_defaults:
  CephAnsibleDisksConfig:
    devices:
      - /dev/sdb
      - /dev/sdc
      - /dev/sdd
      - /dev/sde
    osd_scenario: lvm
    osd_objectstore: bluestore

parameter_defaults:
  CephAnsibleDisksConfig:
    devices:
      - /dev/sdb
      - /dev/sdc
      - /dev/sdd
      - /dev/sde
    osd_scenario: lvm
    osd_objectstore: bluestore

Copy to Clipboard

Toggle word wrap

Run openstack overcloud deploy to update the overcloud.

Note

In this example, all hosts with OSDs have a new device called /dev/sde. If you do not want all nodes to have the new device, update the heat template. For information about how to define hosts with a differing devices list, see Section 5.5, “Overriding parameters for dissimilar Ceph Storage nodes” and Section 5.5.1.2, “Altering the disk layout in Ceph Storage nodes”.

11.4. Removing an OSD from a Ceph Storage node
Link kopieren

This procedure demonstrates how to remove an OSD from a node. It assumes the following about the environment:

A server (ceph-storage0) has an OSD (ceph-osd@4) running on /dev/sde.
The Ceph monitor service (ceph-mon) is running on controller0.
There are enough available OSDs to ensure the storage cluster is not at its near-full ratio.

For more information about Ceph OSDs, see Ceph OSDs in the Red Hat Ceph Storage Operations Guide.

Procedure

SSH into ceph-storage0 and log in as root.

Disable and stop the OSD service:

systemctl disable ceph-osd@4
systemctl stop ceph-osd@4

[root@ceph-storage0 ~]# systemctl disable ceph-osd@4
[root@ceph-stoarge0 ~]# systemctl stop ceph-osd@4

Copy to Clipboard

Toggle word wrap

Disconnect from ceph-storage0.
SSH into controller0 and log in as root.

Identify the name of the Ceph monitor container:

podman ps | grep ceph-mon

[root@controller0 ~]# podman ps | grep ceph-mon
ceph-mon-controller0
[root@controller0 ~]#

Copy to Clipboard

Toggle word wrap

Enable the Ceph monitor container to mark the undesired OSD as out:
```
podman exec ceph-mon-controller0 ceph osd out 4
```
```
[root@controller0 ~]# podman exec ceph-mon-controller0 ceph osd out 4
```
Copy to Clipboard Toggle word wrap
Note
This command causes Ceph to rebalance the storage cluster and copy data to other OSDs in the cluster. The cluster temporarily leaves the active+clean state until rebalancing is complete.
Run the following command and wait for the storage cluster state to become active+clean:
```
podman exec ceph-mon-controller0 ceph -w
```
```
[root@controller0 ~]# podman exec ceph-mon-controller0 ceph -w
```
Copy to Clipboard Toggle word wrap

Remove the OSD from the CRUSH map so that it no longer receives data:

podman exec ceph-mon-controller0 ceph osd crush remove osd.4

[root@controller0 ~]# podman exec ceph-mon-controller0 ceph osd crush remove osd.4

Copy to Clipboard

Toggle word wrap

Remove the OSD authentication key:

podman exec ceph-mon-controller0 ceph auth del osd.4

[root@controller0 ~]# podman exec ceph-mon-controller0 ceph auth del osd.4

Copy to Clipboard

Toggle word wrap

Remove the OSD:

podman exec ceph-mon-controller0 ceph osd rm 4

[root@controller0 ~]# podman exec ceph-mon-controller0 ceph osd rm 4

Copy to Clipboard

Toggle word wrap

Disconnect from controller0.
SSH into the undercloud as the stack user and locate the heat environment file in which you defined the CephAnsibleDisksConfig parameter.

Notice the heat template contains four OSDs:

parameter_defaults:
  CephAnsibleDisksConfig:
    devices:
      - /dev/sdb
      - /dev/sdc
      - /dev/sdd
      - /dev/sde
    osd_scenario: lvm
    osd_objectstore: bluestore

parameter_defaults:
  CephAnsibleDisksConfig:
    devices:
      - /dev/sdb
      - /dev/sdc
      - /dev/sdd
      - /dev/sde
    osd_scenario: lvm
    osd_objectstore: bluestore

Copy to Clipboard

Toggle word wrap

Modify the template to remove /dev/sde.

parameter_defaults:
  CephAnsibleDisksConfig:
    devices:
      - /dev/sdb
      - /dev/sdc
      - /dev/sdd
    osd_scenario: lvm
    osd_objectstore: bluestore

parameter_defaults:
  CephAnsibleDisksConfig:
    devices:
      - /dev/sdb
      - /dev/sdc
      - /dev/sdd
    osd_scenario: lvm
    osd_objectstore: bluestore

Copy to Clipboard

Toggle word wrap

Run openstack overcloud deploy to update the overcloud.
Note
In this example, you remove the /dev/sde device from all hosts with OSDs. If you do not remove the same device from all nodes, update the heat template. For information about how to define hosts with a differing devices list, see Section 5.5, “Overriding parameters for dissimilar Ceph Storage nodes”.

Dieser Inhalt ist in der von Ihnen ausgewählten Sprache nicht verfügbar.

Chapter 11. Scaling the Ceph Storage cluster

11.1. Scaling up the Ceph Storage cluster
Link kopieren

11.2. Scaling down and replacing Ceph Storage nodes
Link kopieren

11.3. Adding an OSD to a Ceph Storage node
Link kopieren

11.4. Removing an OSD from a Ceph Storage node
Link kopieren

Lernen

Testen, kaufen und verkaufen

Communitys

Über Red Hat Dokumentation

Mehr Inklusion in Open Source

Über Red Hat

Theme

Red Hat legal and privacy links

Red Hat legal and privacy links

Dieser Inhalt ist in der von Ihnen ausgewählten Sprache nicht verfügbar.

Chapter 11. Scaling the Ceph Storage cluster

11.1. Scaling up the Ceph Storage clusterLink kopierenLink in die Zwischenablage kopiert!

11.2. Scaling down and replacing Ceph Storage nodesLink kopierenLink in die Zwischenablage kopiert!

11.3. Adding an OSD to a Ceph Storage nodeLink kopierenLink in die Zwischenablage kopiert!

11.4. Removing an OSD from a Ceph Storage nodeLink kopierenLink in die Zwischenablage kopiert!

Lernen

Testen, kaufen und verkaufen

Communitys

Über Red Hat Dokumentation

Mehr Inklusion in Open Source

Über Red Hat

Theme

Red Hat legal and privacy links

Red Hat legal and privacy links

11.1. Scaling up the Ceph Storage cluster
Link kopieren

11.2. Scaling down and replacing Ceph Storage nodes
Link kopieren

11.3. Adding an OSD to a Ceph Storage node
Link kopieren

11.4. Removing an OSD from a Ceph Storage node
Link kopieren