Dieser Inhalt ist in der von Ihnen ausgewählten Sprache nicht verfügbar.
AI workloads
Running AI workloads on OpenShift Container Platform
Abstract
Chapter 1. Overview of AI workloads on OpenShift Container Platform Link kopierenLink in die Zwischenablage kopiert!
OpenShift Container Platform provides a secure, scalable foundation for running artificial intelligence (AI) workloads across training, inference, and data science workflows.
1.1. Operators for running AI workloads Link kopierenLink in die Zwischenablage kopiert!
You can use Operators to run artificial intelligence (AI) and machine learning (ML) workloads on OpenShift Container Platform. With Operators, you can build a customized environment that meets your specific AI/ML requirements while continuing to use OpenShift Container Platform as the core platform for your applications.
OpenShift Container Platform provides several Operators that can help you run AI workloads:
- Red Hat build of Kueue
You can use Red Hat build of Kueue to provide structured queues and prioritization so that workloads are handled fairly and efficiently. Without proper prioritization, important jobs might be delayed while less critical jobs occupy resources.
For more information, see "Introduction to Red Hat build of Kueue".
- Leader Worker Set Operator
You can use the Leader Worker Set Operator to enable large-scale AI inference workloads to run reliably across nodes with synchronization between leader and worker processes. Without proper coordination, large training runs might fail or stall.
For more information, see "Leader Worker Set Operator overview".
Chapter 2. Red Hat build of Kueue Link kopierenLink in die Zwischenablage kopiert!
2.1. Introduction to Red Hat build of Kueue Link kopierenLink in die Zwischenablage kopiert!
Red Hat build of Kueue is a Kubernetes-native system that manages access to resources for jobs. Red Hat build of Kueue can determine when a job waits, is admitted to start by creating pods, or should be preempted, meaning that active pods for that job are deleted.
In the context of Red Hat build of Kueue, a job can be defined as a one-time or on-demand task that runs to completion.
Red Hat build of Kueue is based on the Kueue open source project.
Red Hat build of Kueue is compatible with environments that use heterogeneous, elastic resources. This means that the environment has many different resource types, and those resources are capable of dynamic scaling.
Red Hat build of Kueue does not replace any existing components in a Kubernetes cluster, but instead integrates with the existing Kubernetes API server, scheduler, and cluster autoscaler components.
Red Hat build of Kueue supports all-or-nothing semantics. This means that either an entire job with all of its components is admitted to the cluster, or the entire job is rejected if it does not fit on the cluster.
2.1.1. Personas Link kopierenLink in die Zwischenablage kopiert!
Different personas exist in a Red Hat build of Kueue workflow.
- Batch administrators
- Batch administrators manage the cluster infrastructure and establish quotas and queues.
- Batch users
- Batch users run jobs on the cluster. Examples of batch users might be researchers, AI/ML engineers, or data scientists.
- Serving users
- Serving users run jobs on the cluster. For example, to expose a trained AI/ML model for inference.
- Platform developers
- Platform developers integrate Red Hat build of Kueue with other software. They might also contribute to the Kueue open source project.
2.1.2. Workflow overview Link kopierenLink in die Zwischenablage kopiert!
The Red Hat build of Kueue workflow can be described at a high level as follows:
-
Batch administrators create and configure
ResourceFlavor
,LocalQueue
, andClusterQueue
resources. - User personas create jobs on the cluster.
- The Kubernetes API server validates and accepts job data.
-
Red Hat build of Kueue admits jobs based on configured options, such as order or quota. It injects affinity into the job by using resource flavors, and creates a
Workload
object that corresponds to each job. - The applicable controller for the job type creates pods.
- The Kubernetes scheduler assigns pods to a node in the cluster.
- The Kubernetes cluster autoscaler provisions more nodes as required.
2.2. Release notes Link kopierenLink in die Zwischenablage kopiert!
Red Hat build of Kueue is released as an Operator that is supported on OpenShift Container Platform.
2.2.1. Compatible environments Link kopierenLink in die Zwischenablage kopiert!
Before you install Red Hat build of Kueue, review this section to ensure that your cluster meets the requirements.
2.2.1.1. Supported architectures Link kopierenLink in die Zwischenablage kopiert!
Red Hat build of Kueue version 1.1 and later is supported on the following architectures:
- ARM64
- 64-bit x86
- ppc64le (IBM Power®)
- s390x (IBM Z®)
2.2.1.2. Supported platforms Link kopierenLink in die Zwischenablage kopiert!
Red Hat build of Kueue version 1.1 and later is supported on the following platforms:
- OpenShift Container Platform
- Hosted control planes for OpenShift Container Platform
Currently, Red Hat build of Kueue is not supported on Red Hat build of MicroShift (MicroShift).
2.2.2. Release notes for Red Hat build of Kueue version 1.1 Link kopierenLink in die Zwischenablage kopiert!
Red Hat build of Kueue version 1.1 is a generally available release that is supported on OpenShift Container Platform versions 4.18 and later. Red Hat build of Kueue version 1.1 uses Kueue version 0.12.
If you have a previously installed version of Red Hat build of Kueue on your cluster, you must uninstall the Operator and manually install version 1.1. For information see Upgrading Red Hat build of Kueue.
2.2.2.1. New features and enhancements Link kopierenLink in die Zwischenablage kopiert!
- Configure a default local queue
A default local queue serves as the local queue for newly created jobs that do not have the
kueue.x-k8s.io/queue-name
label. After you create a default local queue, any new jobs created in the namespace without akueue.x-k8s.io/queue-name
label automatically update to have thekueue.x-k8s.io/queue-name: default
label.(RFE-7615)
- Multi-architecture and Hosted control planes support
With this release, Red Hat build of Kueue is supported on multiple different architectures, including ARM64, 64-bit x86, ppc64le (IBM Power®), and s390x (IBM Z®), as well as on Hosted control planes for OpenShift Container Platform.
2.2.2.2. Fixed issues Link kopierenLink in die Zwischenablage kopiert!
- You can create a
Kueue
custom resource by using the OpenShift Container Platform web console Before this update, if you tried to use the OpenShift Container Platform web console to create a
Kueue
custom resource (CR) by using the form view, the web console showed an error and the resource could not be created. With this release, the default namespace was removed from theKueue
CR template. As a result, you can use the OpenShift Container Platform web console to create aKueue
CR by using the form view.
2.2.2.3. Known issues Link kopierenLink in die Zwischenablage kopiert!
Kueue
CR description reads as "Not available" in the OpenShift Container Platform web consoleAfter you install Red Hat build of Kueue, in the Operator details view, the description for the
Kueue
CR reads as "Not available". This issue does not affect or degrade the Red Hat build of Kueue Operator functionality.- Custom resources are not deleted properly when you uninstall Red Hat build of Kueue
After you uninstall the Red Hat Build of Kueue Operator using the Delete all operand instances for this operator option in the OpenShift Container Platform web console, some Red Hat build of Kueue custom resources are not fully deleted. These resources can be viewed in the Installed Operators view with the status Resource is being deleted. As a workaround, you can manually delete the resource finalizers to remove them fully.
2.2.3. Release notes for Red Hat build of Kueue version 1.0.1 Link kopierenLink in die Zwischenablage kopiert!
Red Hat build of Kueue version 1.0.1 is a patch release that is supported on OpenShift Container Platform versions 4.18 and 4.19 on the 64-bit x86 architecture.
Red Hat build of Kueue version 1.0.1 uses Kueue version 0.11.
2.2.3.1. Bug fixes in Red Hat build of Kueue version 1.0.1 Link kopierenLink in die Zwischenablage kopiert!
- Previously, leader election for Red Hat build of Kueue was not configured to tolerate disruption, which resulted in frequent crashing. With this release, the leader election values for Red Hat build of Kueue have been updated to match the durations recommended for OpenShift Container Platform. (OCPBUGS-58496)
-
Previously, the
ReadyReplicas
count was not set in the reconciler, which meant that the Red Hat build of Kueue Operator status would report that there were no replicas ready. With this release, theReadyReplicas
count is based on the number of ready replicas for the deployment, which ensures that the Operator shows as ready in the OpenShift Container Platform console when thekueue-controller-manager
pods are ready. (OCPBUGS-59261) -
Previously, when the
Kueue
custom resource (CR) was deleted from theopenshift-kueue-operator
namespace, thekueue-manager-config
config map was not deleted automatically and could remain in the namespace. With this release, thekueue-manager-config
config map,kueue-webhook-server-cert
secret, andmetrics-server-cert
secret are deleted automatically when theKueue
CR is deleted. (OCPBUGS-57960)
2.2.4. Release notes for Red Hat build of Kueue version 1.0 Link kopierenLink in die Zwischenablage kopiert!
Red Hat build of Kueue version 1.0 is a generally available release that is supported on OpenShift Container Platform versions 4.18 and 4.19 on the 64-bit x86 architecture. Red Hat build of Kueue version 1.0 uses Kueue version 0.11.
2.2.4.1. New features and enhancements Link kopierenLink in die Zwischenablage kopiert!
- Role-based access control (RBAC)
- Role-based access control (RBAC) enables you to control which types of users can create which types of Red Hat build of Kueue resources.
- Configure resource quotas
- Configuring resource quotas by creating cluster queues, resource flavors, and local queues enables you to control the amount of resources used by user-submitted jobs and workloads.
- Control job and workload management
- Labeling namespaces and configuring label policies enable you to control which jobs and workloads are managed by Red Hat build of Kueue.
- Share borrowable resources between queues
- Configuring cohorts, fair sharing, and gang scheduling settings enable you to share unused, borrowable resources between queues.
2.2.4.2. Known issues Link kopierenLink in die Zwischenablage kopiert!
- Jobs in all namespaces are reconciled if they have the
kueue.x-k8s.io/queue-name
label Red Hat build of Kueue uses the
managedJobsNamespaceSelector
configuration field, so that administrators can configure which namespaces opt in to be managed by Red Hat build of Kueue. Because namespaces must be manually configured to opt in to being managed by Red Hat build of Kueue, resources in system or third-party namespaces are not impacted or managed by Red Hat build of Kueue.The behavior in Red Hat build of Kueue 1.0 allows reconciliation of
Job
resources that have thekueue.x-k8s.io/queue-name
label, even if these resources are in namespaces that are not configured to opt in to being managed by Red Hat build of Kueue. This is inconsistent with the behavior for other core integrations like pods, deployments, and stateful sets, which are only reconciled if they are in namespaces that have been configured to opt in to being managed by Red Hat build of Kueue.- You cannot create a
Kueue
custom resource by using the OpenShift Container Platform web console If you try to use the OpenShift Container Platform web console to create a
Kueue
custom resource (CR) by using the form view, the web console shows an error and the resource cannot be created. As a workaround, use the YAML view to create aKueue
CR instead.
2.3. Installing Red Hat build of Kueue Link kopierenLink in die Zwischenablage kopiert!
You can install Red Hat build of Kueue by using the Red Hat Build of Kueue Operator in OperatorHub.
2.3.1. Compatible environments Link kopierenLink in die Zwischenablage kopiert!
Before you install Red Hat build of Kueue, review this section to ensure that your cluster meets the requirements.
2.3.1.1. Supported architectures Link kopierenLink in die Zwischenablage kopiert!
Red Hat build of Kueue version 1.1 and later is supported on the following architectures:
- ARM64
- 64-bit x86
- ppc64le (IBM Power®)
- s390x (IBM Z®)
2.3.1.2. Supported platforms Link kopierenLink in die Zwischenablage kopiert!
Red Hat build of Kueue version 1.1 and later is supported on the following platforms:
- OpenShift Container Platform
- Hosted control planes for OpenShift Container Platform
Currently, Red Hat build of Kueue is not supported on Red Hat build of MicroShift (MicroShift).
2.3.2. Installing the Red Hat Build of Kueue Operator Link kopierenLink in die Zwischenablage kopiert!
You can install the Red Hat Build of Kueue Operator on a OpenShift Container Platform cluster by using the OperatorHub in the web console.
Prerequisites
- You have administrator permissions on a OpenShift Container Platform cluster.
- You have access to the OpenShift Container Platform web console.
- You have installed and configured the cert-manager Operator for Red Hat OpenShift for your cluster.
Procedure
- In the OpenShift Container Platform web console, click Operators → OperatorHub.
- Choose Red Hat Build of Kueue Operator from the list of available Operators, and click Install.
Verification
- Go to Operators → Installed Operators and confirm that the Red Hat Build of Kueue Operator is listed with Status as Succeeded.
2.3.3. Upgrading Red Hat build of Kueue Link kopierenLink in die Zwischenablage kopiert!
If you have previously installed Red Hat build of Kueue, you must manually upgrade your deployment to the latest version to use the latest bug fixes and feature enhancements.
Prerequisites
- You have installed a previous version of Red Hat build of Kueue.
- You are logged in to the OpenShift Container Platform web console with cluster administrator permissions.
Procedure
- In the OpenShift Container Platform web console, click Operators → Installed Operators, then select Red Hat build of Kueue from the list.
- From the Actions drop-down menu, select Uninstall Operator.
The Uninstall Operator? dialog box opens. Click Uninstall.
ImportantSelecting the Delete all operand instances for this operator checkbox before clicking Uninstall deletes all existing resources from the cluster, including:
-
The
Kueue
CR - Any cluster queues, local queues, or resource flavors that you have created
Leave this box unchecked when upgrading your cluster to retain your created resources.
-
The
- In the OpenShift Container Platform web console, click Operators → OperatorHub.
- Choose Red Hat Build of Kueue Operator from the list of available Operators, and click Install.
Verification
- Go to Operators → Installed Operators.
- Confirm that the Red Hat Build of Kueue Operator is listed with Status as Succeeded.
- Confirm that the version shown under the Operator name in the list is the latest version.
2.3.4. Creating a Kueue custom resource Link kopierenLink in die Zwischenablage kopiert!
After you have installed the Red Hat Build of Kueue Operator, you must create a Kueue
custom resource (CR) to configure your installation.
Prerequisites
Ensure that you have completed the following prerequisites:
- The Red Hat build of Kueue Operator is installed on your cluster.
-
You have cluster administrator permissions and the
kueue-batch-admin-role
role. - You have access to the OpenShift Container Platform web console.
Procedure
- In the OpenShift Container Platform web console, click Operators → Installed Operators.
- In the Provided APIs table column, click Kueue. This takes you to the Kueue tab of the Operator details page.
- Click Create Kueue. This takes you to the Create Kueue YAML view.
Enter the details for your
Kueue
CR.Example
Kueue
CRCopy to Clipboard Copied! Toggle word wrap Toggle overflow - 1
- The name of the
Kueue
CR must becluster
. - 2
- If you want to configure Red Hat build of Kueue for use with other workload types, add those types here. For the default configuration, only the
BatchJob
type is recommended and supported. - 3
- Optional: If you want to configure fair sharing for Red Hat build of Kueue, set the
preemptionPolicy
value toFairSharing
. The default setting in theKueue
CR isClassical
preemption.
- Click Create.
Verification
-
After you create the
Kueue
CR, the web console brings you to the Operator details page, where you can see the CR in the list of Kueues. Optional: If you have the OpenShift CLI (
oc
) installed, you can run the following command and observe the output to confirm that yourKueue
CR has been created successfully:oc get kueue
$ oc get kueue
Copy to Clipboard Copied! Toggle word wrap Toggle overflow Example output
NAME AGE cluster 4m
NAME AGE cluster 4m
Copy to Clipboard Copied! Toggle word wrap Toggle overflow
2.3.5. Labeling namespaces to allow Red Hat build of Kueue to manage jobs Link kopierenLink in die Zwischenablage kopiert!
The Red Hat build of Kueue Operator uses an opt-in webhook mechanism to ensure that policies are only enforced for the jobs and namespaces that it is expected to target.
You must label the namespaces where you want Red Hat build of Kueue to manage jobs with the kueue.openshift.io/managed=true
label.
Prerequisites
- You have cluster administrator permissions.
-
The Red Hat build of Kueue Operator is installed on your cluster, and you have created a
Kueue
custom resource (CR). -
You have installed the OpenShift CLI (
oc
).
Procedure
Add the
kueue.openshift.io/managed=true
label to a namespace by running the following command:oc label namespace <namespace> kueue.openshift.io/managed=true
$ oc label namespace <namespace> kueue.openshift.io/managed=true
Copy to Clipboard Copied! Toggle word wrap Toggle overflow
When you add this label, you instruct the Red Hat build of Kueue Operator that the namespace is managed by its webhook admission controllers. As a result, any Red Hat build of Kueue resources within that namespace are properly validated and mutated.
2.4. Installing Red Hat build of Kueue in a disconnected environment Link kopierenLink in die Zwischenablage kopiert!
Before you can install Red Hat build of Kueue on a disconnected OpenShift Container Platform cluster, you must enable Operator Lifecycle Manager (OLM) in disconnected environments by completing the following steps:
- Disable the default remote OperatorHub sources for OLM.
- Use a workstation with full internet access to create and push local mirrors of the OperatorHub content to a mirror registry.
- Configure OLM to install and manage Operators from local sources on the mirror registry instead of the default remote sources.
After enabling OLM in a disconnected environment, you can continue to use your unrestricted workstation to keep your local OperatorHub sources updated as newer versions of Operators are released.
For full documentation on completing these steps, see the OpenShift Container Platform documentation on Using Operator Lifecycle Manager in disconnected environments.
2.4.1. Compatible environments Link kopierenLink in die Zwischenablage kopiert!
Before you install Red Hat build of Kueue, review this section to ensure that your cluster meets the requirements.
2.4.1.1. Supported architectures Link kopierenLink in die Zwischenablage kopiert!
Red Hat build of Kueue version 1.1 and later is supported on the following architectures:
- ARM64
- 64-bit x86
- ppc64le (IBM Power®)
- s390x (IBM Z®)
2.4.1.2. Supported platforms Link kopierenLink in die Zwischenablage kopiert!
Red Hat build of Kueue version 1.1 and later is supported on the following platforms:
- OpenShift Container Platform
- Hosted control planes for OpenShift Container Platform
Currently, Red Hat build of Kueue is not supported on Red Hat build of MicroShift (MicroShift).
2.4.2. Installing the Red Hat Build of Kueue Operator Link kopierenLink in die Zwischenablage kopiert!
You can install the Red Hat Build of Kueue Operator on a OpenShift Container Platform cluster by using the OperatorHub in the web console.
Prerequisites
- You have administrator permissions on a OpenShift Container Platform cluster.
- You have access to the OpenShift Container Platform web console.
- You have installed and configured the cert-manager Operator for Red Hat OpenShift for your cluster.
Procedure
- In the OpenShift Container Platform web console, click Operators → OperatorHub.
- Choose Red Hat Build of Kueue Operator from the list of available Operators, and click Install.
Verification
- Go to Operators → Installed Operators and confirm that the Red Hat Build of Kueue Operator is listed with Status as Succeeded.
2.4.3. Upgrading Red Hat build of Kueue Link kopierenLink in die Zwischenablage kopiert!
If you have previously installed Red Hat build of Kueue, you must manually upgrade your deployment to the latest version to use the latest bug fixes and feature enhancements.
Prerequisites
- You have installed a previous version of Red Hat build of Kueue.
- You are logged in to the OpenShift Container Platform web console with cluster administrator permissions.
Procedure
- In the OpenShift Container Platform web console, click Operators → Installed Operators, then select Red Hat build of Kueue from the list.
- From the Actions drop-down menu, select Uninstall Operator.
The Uninstall Operator? dialog box opens. Click Uninstall.
ImportantSelecting the Delete all operand instances for this operator checkbox before clicking Uninstall deletes all existing resources from the cluster, including:
-
The
Kueue
CR - Any cluster queues, local queues, or resource flavors that you have created
Leave this box unchecked when upgrading your cluster to retain your created resources.
-
The
- In the OpenShift Container Platform web console, click Operators → OperatorHub.
- Choose Red Hat Build of Kueue Operator from the list of available Operators, and click Install.
Verification
- Go to Operators → Installed Operators.
- Confirm that the Red Hat Build of Kueue Operator is listed with Status as Succeeded.
- Confirm that the version shown under the Operator name in the list is the latest version.
2.4.4. Creating a Kueue custom resource Link kopierenLink in die Zwischenablage kopiert!
After you have installed the Red Hat Build of Kueue Operator, you must create a Kueue
custom resource (CR) to configure your installation.
Prerequisites
Ensure that you have completed the following prerequisites:
- The Red Hat build of Kueue Operator is installed on your cluster.
-
You have cluster administrator permissions and the
kueue-batch-admin-role
role. - You have access to the OpenShift Container Platform web console.
Procedure
- In the OpenShift Container Platform web console, click Operators → Installed Operators.
- In the Provided APIs table column, click Kueue. This takes you to the Kueue tab of the Operator details page.
- Click Create Kueue. This takes you to the Create Kueue YAML view.
Enter the details for your
Kueue
CR.Example
Kueue
CRCopy to Clipboard Copied! Toggle word wrap Toggle overflow - 1
- The name of the
Kueue
CR must becluster
. - 2
- If you want to configure Red Hat build of Kueue for use with other workload types, add those types here. For the default configuration, only the
BatchJob
type is recommended and supported. - 3
- Optional: If you want to configure fair sharing for Red Hat build of Kueue, set the
preemptionPolicy
value toFairSharing
. The default setting in theKueue
CR isClassical
preemption.
- Click Create.
Verification
-
After you create the
Kueue
CR, the web console brings you to the Operator details page, where you can see the CR in the list of Kueues. Optional: If you have the OpenShift CLI (
oc
) installed, you can run the following command and observe the output to confirm that yourKueue
CR has been created successfully:oc get kueue
$ oc get kueue
Copy to Clipboard Copied! Toggle word wrap Toggle overflow Example output
NAME AGE cluster 4m
NAME AGE cluster 4m
Copy to Clipboard Copied! Toggle word wrap Toggle overflow
2.4.5. Labeling namespaces to allow Red Hat build of Kueue to manage jobs Link kopierenLink in die Zwischenablage kopiert!
The Red Hat build of Kueue Operator uses an opt-in webhook mechanism to ensure that policies are only enforced for the jobs and namespaces that it is expected to target.
You must label the namespaces where you want Red Hat build of Kueue to manage jobs with the kueue.openshift.io/managed=true
label.
Prerequisites
- You have cluster administrator permissions.
-
The Red Hat build of Kueue Operator is installed on your cluster, and you have created a
Kueue
custom resource (CR). -
You have installed the OpenShift CLI (
oc
).
Procedure
Add the
kueue.openshift.io/managed=true
label to a namespace by running the following command:oc label namespace <namespace> kueue.openshift.io/managed=true
$ oc label namespace <namespace> kueue.openshift.io/managed=true
Copy to Clipboard Copied! Toggle word wrap Toggle overflow
When you add this label, you instruct the Red Hat build of Kueue Operator that the namespace is managed by its webhook admission controllers. As a result, any Red Hat build of Kueue resources within that namespace are properly validated and mutated.
2.5. Configuring role-based permissions Link kopierenLink in die Zwischenablage kopiert!
The following procedures provide information about how you can configure role-based access control (RBAC) for your Red Hat build of Kueue deployment. These RBAC permissions determine which types of users can create which types of Red Hat build of Kueue objects.
2.5.1. Cluster roles Link kopierenLink in die Zwischenablage kopiert!
The Red Hat build of Kueue Operator deploys kueue-batch-admin-role
and kueue-batch-user-role
cluster roles by default.
- kueue-batch-admin-role
- This cluster role includes the permissions to manage cluster queues, local queues, workloads, and resource flavors.
- kueue-batch-user-role
- This cluster role includes the permissions to manage jobs and to view local queues and workloads.
2.5.2. Configuring permissions for batch administrators Link kopierenLink in die Zwischenablage kopiert!
You can configure permissions for batch administrators by binding the kueue-batch-admin-role
cluster role to a user or group of users.
Prerequisites
- The Red Hat build of Kueue Operator is installed on your cluster.
- You have cluster administrator permissions.
-
You have installed the OpenShift CLI (
oc
).
Procedure
Create a
ClusterRoleBinding
object as a YAML file:Example
ClusterRoleBinding
objectCopy to Clipboard Copied! Toggle word wrap Toggle overflow Apply the
ClusterRoleBinding
object:oc apply -f <filename>.yaml
$ oc apply -f <filename>.yaml
Copy to Clipboard Copied! Toggle word wrap Toggle overflow
Verification
You can verify that the
ClusterRoleBinding
object was applied correctly by running the following command and verifying that the output contains the correct information for thekueue-batch-admin-role
cluster role:$ oc describe clusterrolebinding.rbac
$ oc describe clusterrolebinding.rbac
Copy to Clipboard Copied! Toggle word wrap Toggle overflow Example output
Copy to Clipboard Copied! Toggle word wrap Toggle overflow
2.5.3. Configuring permissions for users Link kopierenLink in die Zwischenablage kopiert!
You can configure permissions for Red Hat build of Kueue users by binding the kueue-batch-user-role
cluster role to a user or group of users.
Prerequisites
- The Red Hat build of Kueue Operator is installed on your cluster.
- You have cluster administrator permissions.
-
You have installed the OpenShift CLI (
oc
).
Procedure
Create a
RoleBinding
object as a YAML file:Example
ClusterRoleBinding
objectCopy to Clipboard Copied! Toggle word wrap Toggle overflow Apply the
RoleBinding
object:oc apply -f <filename>.yaml
$ oc apply -f <filename>.yaml
Copy to Clipboard Copied! Toggle word wrap Toggle overflow
Verification
You can verify that the
RoleBinding
object was applied correctly by running the following command and verifying that the output contains the correct information for thekueue-batch-user-role
cluster role:$ oc describe rolebinding.rbac
$ oc describe rolebinding.rbac
Copy to Clipboard Copied! Toggle word wrap Toggle overflow Example output
Copy to Clipboard Copied! Toggle word wrap Toggle overflow
2.6. Configuring quotas Link kopierenLink in die Zwischenablage kopiert!
As an administrator, you can use Red Hat build of Kueue to configure quotas to optimize resource allocation and system throughput for user workloads. You can configure quotas for compute resources such as CPU, memory, pods, and GPU.
You can configure quotas in Red Hat build of Kueue by completing the following steps:
- Configure a cluster queue.
- Configure a resource flavor.
- Configure a local queue.
Users can then submit their workloads to the local queue.
2.6.1. Configuring a cluster queue Link kopierenLink in die Zwischenablage kopiert!
A cluster queue is a cluster-scoped resource, represented by a ClusterQueue
object, that governs a pool of resources such as CPU, memory, and pods. Cluster queues can be used to define usage limits, quotas for resource flavors, order of consumption, and fair sharing rules.
The cluster queue is not ready for use until a ResourceFlavor
object has also been configured.
Prerequisites
- The Red Hat build of Kueue Operator is installed on your cluster.
-
You have cluster administrator permissions or the
kueue-batch-admin-role
role. -
You have installed the OpenShift CLI (
oc
).
Procedure
Create a
ClusterQueue
object as a YAML file:Example of a basic
ClusterQueue
object using a single resource flavorCopy to Clipboard Copied! Toggle word wrap Toggle overflow - 1
- Defines which namespaces can use the resources governed by this cluster queue. An empty
namespaceSelector
as shown in the example means that all namespaces can use these resources. - 2
- Defines the resource types governed by the cluster queue. This example
ClusterQueue
object governs CPU, memory, pod, and GPU resources. - 3
- Defines the resource flavor that is applied to the resource types listed. In this example, the
default-flavor
resource flavor is applied to CPU, memory, pod, and GPU resources. - 4
- Defines the resource requirements for admitting jobs. This example cluster queue only admits jobs if the following conditions are met:
- The sum of the CPU requests is less than or equal to 9.
- The sum of the memory requests is less than or equal to 36Gi.
- The total number of pods is less than or equal to 5.
- The sum of the GPU requests is less than or equal to 100.
Apply the
ClusterQueue
object by running the following command:oc apply -f <filename>.yaml
$ oc apply -f <filename>.yaml
Copy to Clipboard Copied! Toggle word wrap Toggle overflow
2.6.2. Configuring a resource flavor Link kopierenLink in die Zwischenablage kopiert!
After you have configured a ClusterQueue
object, you can configure a ResourceFlavor
object.
Resources in a cluster are typically not homogeneous. If the resources in your cluster are homogeneous, you can use an empty ResourceFlavor
instead of adding labels to custom resource flavors.
You can use a custom ResourceFlavor
object to represent different resource variations that are associated with cluster nodes through labels, taints, and tolerations. You can then associate workloads with specific node types to enable fine-grained resource management.
Prerequisites
- The Red Hat build of Kueue Operator is installed on your cluster.
-
You have cluster administrator permissions or the
kueue-batch-admin-role
role. -
You have installed the OpenShift CLI (
oc
).
Procedure
Create a
ResourceFlavor
object as a YAML file:Example of an empty
ResourceFlavor
objectapiVersion: kueue.x-k8s.io/v1beta1 kind: ResourceFlavor metadata: name: default-flavor
apiVersion: kueue.x-k8s.io/v1beta1 kind: ResourceFlavor metadata: name: default-flavor
Copy to Clipboard Copied! Toggle word wrap Toggle overflow Example of a custom
ResourceFlavor
objectCopy to Clipboard Copied! Toggle word wrap Toggle overflow Apply the
ResourceFlavor
object by running the following command:oc apply -f <filename>.yaml
$ oc apply -f <filename>.yaml
Copy to Clipboard Copied! Toggle word wrap Toggle overflow
2.6.3. Configuring a local queue Link kopierenLink in die Zwischenablage kopiert!
A local queue is a namespaced object, represented by a LocalQueue
object, that groups closely related workloads that belong to a single namespace.
As an administrator, you can configure a LocalQueue
object to point to a cluster queue. This allocates resources from the cluster queue to workloads in the namespace specified in the LocalQueue
object.
Prerequisites
- The Red Hat build of Kueue Operator is installed on your cluster.
-
You have cluster administrator permissions or the
kueue-batch-admin-role
role. -
You have installed the OpenShift CLI (
oc
). -
You have created a
ClusterQueue
object.
Procedure
Create a
LocalQueue
object as a YAML file:Example of a basic
LocalQueue
objectCopy to Clipboard Copied! Toggle word wrap Toggle overflow Apply the
LocalQueue
object by running the following command:oc apply -f <filename>.yaml
$ oc apply -f <filename>.yaml
Copy to Clipboard Copied! Toggle word wrap Toggle overflow
2.6.4. Configuring a default local queue Link kopierenLink in die Zwischenablage kopiert!
As a cluster administrator, you can improve quota enforcement in your cluster by managing all jobs in selected namespaces without needing to explicitly label each job. You can do this by creating a default local queue.
A default local queue serves as the local queue for newly created jobs that do not have the kueue.x-k8s.io/queue-name
label. After you create a default local queue, any new jobs created in the namespace without a kueue.x-k8s.io/queue-name
label automatically update to have the kueue.x-k8s.io/queue-name: default
label.
Preexisting jobs in a namespace are not affected when you create a default local queue. If jobs already exist in the namespace before you create the default local queue, you must label those jobs explicitly to assign them to a queue.
Prerequisites
- You have installed Red Hat build of Kueue version 1.1 on your cluster.
-
You have cluster administrator permissions or the
kueue-batch-admin-role
role. -
You have installed the OpenShift CLI (
oc
). -
You have created a
ClusterQueue
object.
Procedure
Create a
LocalQueue
object nameddefault
as a YAML file:Example of a default
LocalQueue
objectCopy to Clipboard Copied! Toggle word wrap Toggle overflow Apply the
LocalQueue
object by running the following command:oc apply -f <filename>.yaml
$ oc apply -f <filename>.yaml
Copy to Clipboard Copied! Toggle word wrap Toggle overflow
Verification
- Create a job in the same namespace as the default local queue.
-
Observe that the job updates with the
kueue.x-k8s.io/queue-name: default
label.
2.7. Managing jobs and workloads Link kopierenLink in die Zwischenablage kopiert!
Red Hat build of Kueue does not directly manipulate jobs that are created by users. Instead, Kueue manages Workload
objects that represent the resource requirements of a job. Red Hat build of Kueue automatically creates a workload for each job, and syncs any decisions and statuses between the two objects.
2.7.1. Labeling namespaces to allow Red Hat build of Kueue to manage jobs Link kopierenLink in die Zwischenablage kopiert!
The Red Hat build of Kueue Operator uses an opt-in webhook mechanism to ensure that policies are only enforced for the jobs and namespaces that it is expected to target.
You must label the namespaces where you want Red Hat build of Kueue to manage jobs with the kueue.openshift.io/managed=true
label.
Prerequisites
- You have cluster administrator permissions.
-
The Red Hat build of Kueue Operator is installed on your cluster, and you have created a
Kueue
custom resource (CR). -
You have installed the OpenShift CLI (
oc
).
Procedure
Add the
kueue.openshift.io/managed=true
label to a namespace by running the following command:oc label namespace <namespace> kueue.openshift.io/managed=true
$ oc label namespace <namespace> kueue.openshift.io/managed=true
Copy to Clipboard Copied! Toggle word wrap Toggle overflow
When you add this label, you instruct the Red Hat build of Kueue Operator that the namespace is managed by its webhook admission controllers. As a result, any Red Hat build of Kueue resources within that namespace are properly validated and mutated.
2.7.2. Configuring label policies for jobs Link kopierenLink in die Zwischenablage kopiert!
The spec.config.workloadManagement.labelPolicy
spec in the Kueue
custom resource (CR) is an optional field that controls how Red Hat build of Kueue decides whether to manage or ignore different jobs. The allowed values are QueueName
, None
and empty (""
).
If the labelPolicy
setting is omitted or empty (""
), the default policy is that Red Hat build of Kueue manages jobs that have a kueue.x-k8s.io/queue-name
label, and ignores jobs that do not have the kueue.x-k8s.io/queue-name
label. This is the same workflow as if the labelPolicy
is set to QueueName
.
If the labelPolicy
setting is set to None
, jobs are managed by Red Hat build of Kueue even if they do not have the kueue.x-k8s.io/queue-name
label.
Example workloadManagement
spec configuration
Example user-created Job
object containing the kueue.x-k8s.io/queue-name
label
2.8. Using cohorts Link kopierenLink in die Zwischenablage kopiert!
You can use cohorts to group cluster queues and determine which cluster queues are able to share borrowable resources with each other. Borrowable resources are defined as the unused nominal quota of all the cluster queues in a cohort.
Using cohorts can help to optimize resource utilization by preventing under-utilization and enabling fair sharing configurations. Cohorts can also help to simplify resource management and allocation between teams, because you can group cluster queues for related workloads or for each team. You can also use cohorts to set resource quotas at a group level to define the limits for resources that a group of cluster queues can consume.
2.8.1. Configuring cohorts within a cluster queue spec Link kopierenLink in die Zwischenablage kopiert!
You can add a cluster queue to a cohort by specifying the name of the cohort in the .spec.cohort
field of the ClusterQueue
object, as shown in the following example:
All cluster queues that have a matching spec.cohort
are part of the same cohort.
If the spec.cohort
field is omitted, the cluster queue does not belong to any cohort and cannot access borrowable resources.
2.9. Configuring fair sharing Link kopierenLink in die Zwischenablage kopiert!
Fair sharing is a preemption strategy that is used to achieve an equal or weighted share of borrowable resources between the tenants of a cohort. Borrowable resources are the unused nominal quota of all the cluster queues in a cohort.
You can configure fair sharing by setting the preemptionPolicy
value in the Kueue
custom resource (CR) to FairSharing
.
2.10. Gang scheduling Link kopierenLink in die Zwischenablage kopiert!
Gang scheduling ensures that a group or gang of related jobs only start when all required resources are available. Red Hat build of Kueue enables gang scheduling by suspending jobs until the OpenShift Container Platform cluster can guarantee the capacity to start and execute all of the related jobs in the gang together. This is also known as all-or-nothing scheduling.
Gang scheduling is important if you are working with expensive, limited resources, such as GPUs. Gang scheduling can prevent jobs from claiming but not using GPUs, which can improve GPU utilization and can reduce running costs. Gang scheduling can also help to prevent issues like resource segmentation and deadlocking.
2.10.1. Configuring gang scheduling Link kopierenLink in die Zwischenablage kopiert!
As a cluster administrator, you can configure gang scheduling by modifying the gangScheduling
spec in the Kueue
custom resource (CR).
Example Kueue
CR with gang scheduling configured
- 1
- You can set the
policy
value to enable or disable gang scheduling. The possible values areByWorkload
,None
, or empty (""
).ByWorkload
-
When the
policy
value is set toByWorkload
, each job is processed and considered for admission as a single unit. If the job does not become ready within the specified time, the entire job is evicted and retried at a later time. None
-
When the
policy
value is set toNone
, gang scheduling is disabled. - Empty (
""
) -
When the
policy
value is empty or set to""
, the Red Hat build of Kueue Operator determines settings for gang scheduling. Currently, gang scheduling is disabled by default.
- 2
- If the
policy
value is set toByWorkload
, you must configure job admission settings. The possible values for theadmission
spec areParallel
,Sequential
, or empty (""
).Parallel
-
When the
admission
value is set toParallel
, pods from any job can be admitted at any time. This can cause a deadlock, where jobs are in contention for cluster capacity. When a deadlock occurs, the successful scheduling of pods from another job can prevent the scheduling of pods from the current job. Sequential
-
When the
admission
value is set toSequential
, only pods from the currently processing job are admitted. After all of the pods from the current job have been admitted and are ready, Red Hat build of Kueue processes the next job. Sequential processing can slow down admission when the cluster has sufficient capacity for multiple jobs, but provides a higher likelihood that all of the pods for a job are scheduled together successfully. - Empty (
""
) -
When the
admission
value is empty or set to""
, the Red Hat build of Kueue Operator determines job admission settings. Currently, theadmission
value is set toParallel
by default.
2.11. Running jobs with quota limits Link kopierenLink in die Zwischenablage kopiert!
You can run Kubernetes jobs with Red Hat build of Kueue enabled to manage resource allocation within defined quota limits. This can help to ensure predictable resource availability, cluster stability, and optimized performance.
2.11.1. Identifying available local queues Link kopierenLink in die Zwischenablage kopiert!
Before you can submit a job to a queue, you must find the name of the local queue.
Prerequisites
- A cluster administrator has installed and configured Red Hat build of Kueue on your OpenShift Container Platform cluster.
-
A cluster administrator has assigned you the
kueue-batch-user-role
cluster role. -
You have installed the OpenShift CLI (
oc
).
Procedure
Run the following command to list available local queues in your namespace:
oc -n <namespace> get localqueues
$ oc -n <namespace> get localqueues
Copy to Clipboard Copied! Toggle word wrap Toggle overflow Example output
NAME CLUSTERQUEUE PENDING WORKLOADS user-queue cluster-queue 3
NAME CLUSTERQUEUE PENDING WORKLOADS user-queue cluster-queue 3
Copy to Clipboard Copied! Toggle word wrap Toggle overflow
2.11.2. Defining a job to run with Red Hat build of Kueue Link kopierenLink in die Zwischenablage kopiert!
When you are defining a job to run with Red Hat build of Kueue, ensure that it meets the following criteria:
-
Specify the local queue to submit the job to, by using the
kueue.x-k8s.io/queue-name
label. - Include the resource requests for each job pod.
Red Hat build of Kueue suspends the job, and then starts it when resources are available. Red Hat build of Kueue creates a corresponding workload, represented as a Workload
object with a name that matches the job.
Prerequisites
- A cluster administrator has installed and configured Red Hat build of Kueue on your OpenShift Container Platform cluster.
-
A cluster administrator has assigned you the
kueue-batch-user-role
cluster role. -
You have installed the OpenShift CLI (
oc
). - You have identified the name of the local queue that you want to submit jobs to.
Procedure
Create a
Job
object.Example job
Copy to Clipboard Copied! Toggle word wrap Toggle overflow Run the job by running the following command:
oc create -f <filename>.yaml
$ oc create -f <filename>.yaml
Copy to Clipboard Copied! Toggle word wrap Toggle overflow
Verification
Verify that pods are running for the job you have created, by running the following command and observing the output:
oc get job <job-name>
$ oc get job <job-name>
Copy to Clipboard Copied! Toggle word wrap Toggle overflow Example output
NAME STATUS COMPLETIONS DURATION AGE sample-job-sk42x Suspended 0/1 2m12s
NAME STATUS COMPLETIONS DURATION AGE sample-job-sk42x Suspended 0/1 2m12s
Copy to Clipboard Copied! Toggle word wrap Toggle overflow Verify that a workload has been created in your namespace for the job, by running the following command and observing the output:
oc -n <namespace> get workloads
$ oc -n <namespace> get workloads
Copy to Clipboard Copied! Toggle word wrap Toggle overflow Example output
NAME QUEUE RESERVED IN ADMITTED FINISHED AGE job-sample-job-sk42x-77c03 user-queue 3m8s
NAME QUEUE RESERVED IN ADMITTED FINISHED AGE job-sample-job-sk42x-77c03 user-queue 3m8s
Copy to Clipboard Copied! Toggle word wrap Toggle overflow
2.12. Getting support Link kopierenLink in die Zwischenablage kopiert!
If you experience difficulty with a procedure described in this documentation, or with Red Hat build of Kueue in general, visit the Red Hat Customer Portal.
From the Customer Portal, you can:
- Search or browse through the Red Hat Knowledgebase of articles and solutions relating to Red Hat products.
- Submit a support case to Red Hat Support.
- Access other product documentation.
2.12.1. About the Red Hat Knowledgebase Link kopierenLink in die Zwischenablage kopiert!
The Red Hat Knowledgebase provides rich content aimed at helping you make the most of Red Hat’s products and technologies. The Red Hat Knowledgebase consists of articles, product documentation, and videos outlining best practices on installing, configuring, and using Red Hat products. In addition, you can search for solutions to known issues, each providing concise root cause descriptions and remedial steps.
2.12.2. Collecting data for Red Hat Support Link kopierenLink in die Zwischenablage kopiert!
You can use the oc adm must-gather
CLI command to collect the information about your Red Hat build of Kueue instance that is most likely needed for debugging issues, including:
- Red Hat build of Kueue custom resources, such as workloads, cluster queues, local queues, resource flavors, admission checks, and their corresponding cluster resource definitions (CRDs)
- Services
- Endpoints
- Webhook configurations
-
Logs from the
openshift-kueue-operator
namespace andkueue-controller-manager
pods
Collected data is written into a new directory named must-gather/
in the current working directory by default.
Prerequisites
- The Red Hat build of Kueue Operator is installed on your cluster.
-
You have installed the OpenShift CLI (
oc
).
Procedure
-
Navigate to the directory where you want to store the
must-gather
data. Collect
must-gather
data by running the following command:oc adm must-gather \ --image=registry.redhat.io/kueue/kueue-must-gather-rhel9:<version>
$ oc adm must-gather \ --image=registry.redhat.io/kueue/kueue-must-gather-rhel9:<version>
Copy to Clipboard Copied! Toggle word wrap Toggle overflow Where
<version>
is your current version of Red Hat build of Kueue.-
Create a compressed file from the
must-gather
directory that was just created in your working directory. Make sure you provide the date and cluster ID for the uniquemust-gather
data. For more information about how to find the cluster ID, see How to find the cluster-id or name on OpenShift cluster. - Attach the compressed file to your support case on the the Customer Support page of the Red Hat Customer Portal.
Chapter 3. Leader Worker Set Operator Link kopierenLink in die Zwischenablage kopiert!
3.1. Leader Worker Set Operator overview Link kopierenLink in die Zwischenablage kopiert!
Using large language models (LLMs) for AI/ML inference often requires significant compute resources, and workloads typically must be sharded across multiple nodes. This can make deployments complex, creating challenges around scaling, recovery from failures, and efficient pod placement.
The Leader Worker Set Operator simplifies these multi-node deployments by treating a group of pods as a single, coordinated unit. It manages the lifecycle of each pod in the group, scales the entire group together, and performs updates and failure recovery at the group level to ensure consistency.
3.1.1. About the Leader Worker Set Operator Link kopierenLink in die Zwischenablage kopiert!
The Leader Worker Set Operator is based on the LeaderWorkerSet open source project. LeaderWorkerSet
is a custom Kubernetes API that can be used to deploy a group of pods as a unit. This is useful for artificial intelligence (AI) and machine learning (ML) inference workloads, where large language models (LLMs) are sharded across multiple nodes.
With the LeaderWorkerSet
API, pods are grouped into units consisting of one leader and multiple workers, all managed together as a single entity. Each pod in a group has a unique pod identity. Pods within a group are created in parallel and share identical lifecycle stages. Rollouts, rolling updates, and pod failure restarts are performed as a group.
In the LeaderWorkerSet
configuration, you define the size of the groups and the number of group replicas. If necessary, you can define separate templates for leader and worker pods, allowing for role-specific customization. You can also configure topology-aware placement, so that pods in the same group are co-located in the same topology.
Before you install the Leader Worker Set Operator, you must install the cert-manager Operator for Red Hat OpenShift because it is required to configure services and manage metrics collection.
Monitoring for the Leader Worker Set Operator is provided by default with OpenShift Container Platform through Prometheus.
3.1.1.1. LeaderWorkerSet architecture Link kopierenLink in die Zwischenablage kopiert!
The following diagram shows how the LeaderWorkerSet
API organizes groups of pods into a single unit, with one pod as the leader and the rest as the workers, to coordinate distributed workloads:
Figure 3.1. Leader worker set architecture
The LeaderWorkerSet
API uses a leader stateful set to manage the deployment and lifecycle of the groups of pods. For each replica defined, a leader-worker group is created.
Each leader-worker group contains a leader pod and a worker stateful set. The worker stateful set is owned by the leader pod and manages the set of worker pods associated with that leader pod. The specified size defines the total number of pods in each leader-worker group, with the leader pod included in that number.
3.2. Leader Worker Set Operator release notes Link kopierenLink in die Zwischenablage kopiert!
You can use the Leader Worker Set Operator to manage distributed inference workloads and process large-scale inference requests efficiently.
These release notes track the development of the Leader Worker Set Operator.
For more information, see About the Leader Worker Set Operator.
3.2.1. Release notes for Leader Worker Set Operator 1.0.0 Link kopierenLink in die Zwischenablage kopiert!
Issued: 18 September 2025
The following advisories are available for the Leader Worker Set Operator 1.0.0:
3.2.1.1. New features and enhancements Link kopierenLink in die Zwischenablage kopiert!
- This is the initial release of the Leader Worker Set Operator.
3.3. Managing distributed workloads with the Leader Worker Set Operator Link kopierenLink in die Zwischenablage kopiert!
You can use the Leader Worker Set Operator to manage distributed inference workloads and process large-scale inference requests efficiently.
3.3.1. Installing the Leader Worker Set Operator Link kopierenLink in die Zwischenablage kopiert!
You can use the web console to install the Leader Worker Set Operator.
Prerequisites
-
You have access to the cluster with
cluster-admin
privileges. - You have access to the OpenShift Container Platform web console.
- You have installed the cert-manager Operator for Red Hat OpenShift.
Procedure
- Log in to the OpenShift Container Platform web console.
- Verify that the cert-manager Operator for Red Hat OpenShift is installed.
Install the Leader Worker Set Operator.
- Navigate to Ecosystem → Software Catalog.
- Enter Leader Worker Set Operator into the filter box.
- Select the Leader Worker Set Operator and click Install.
On the Install Operator page:
- The Update channel is set to stable-v1.0, which installs the latest stable release of Leader Worker Set Operator 1.0.
- Under Installation mode, select A specific namespace on the cluster.
- Under Installed Namespace, select Operator recommended Namespace: openshift-lws-operator.
Under Update approval, select one of the following update strategies:
- The Automatic strategy allows Operator Lifecycle Manager (OLM) to automatically update the Operator when a new version is available.
- The Manual strategy requires a user with appropriate credentials to approve the Operator update.
- Click Install.
Create the custom resource (CR) for the Leader Worker Set Operator:
- Navigate to Installed Operators → Leader Worker Set Operator.
- Under Provided APIs, click Create instance in the LeaderWorkerSetOperator pane.
- Click Create.
3.3.2. Deploying a leader worker set Link kopierenLink in die Zwischenablage kopiert!
You can use the Leader Worker Set Operator to deploy a leader worker set to assist with managing distributed workloads across nodes.
Prerequisites
- You have installed the Leader Worker Set Operator.
Procedure
Create a new project by running the following command:
oc new-project my-namespace
$ oc new-project my-namespace
Copy to Clipboard Copied! Toggle word wrap Toggle overflow Create a file named
leader-worker-set.yaml
Copy to Clipboard Copied! Toggle word wrap Toggle overflow - 1
- Specify the name of the leader worker set resource.
- 2
- Specify the namespace for the leader worker set to run in.
- 3
- Specify the pod template for the leader pods.
- 4
- Specify the restart policy for when pod failures occur. Allowed values are
RecreateGroupOnPodRestart
to restart the whole group orNone
to not restart the group. - 5
- Specify the number of pods to create for each group, including the leader pod. For example, a value of
3
creates 1 leader pod and 2 worker pods. The default value is1
. - 6
- Specify the pod template for the worker pods.
- 7
- Specify the policy to use when creating the headless service. Allowed values are
UniquePerReplica
orShared
. The default value isShared
. - 8
- Specify the number of replicas, or leader-worker groups. The default value is
1
. - 9
- Specify the maximum number of replicas that can be scheduled above the
replicas
value during rolling updates. The value can be specified as an integer or a percentage.
For more information on all available fields to configure, see LeaderWorkerSet API upstream documentation.
Apply the leader worker set configuration by running the following command:
oc apply -f leader-worker-set.yaml
$ oc apply -f leader-worker-set.yaml
Copy to Clipboard Copied! Toggle word wrap Toggle overflow
Verification
Verify that pods were created by running the following command:
oc get pods -n my-namespace
$ oc get pods -n my-namespace
Copy to Clipboard Copied! Toggle word wrap Toggle overflow Example output
Copy to Clipboard Copied! Toggle word wrap Toggle overflow Review the stateful sets by running the following command:
oc get statefulsets
$ oc get statefulsets
Copy to Clipboard Copied! Toggle word wrap Toggle overflow Example output
NAME READY AGE my-lws 4/4 111s my-lws-0 2/2 57s my-lws-1 2/2 60s
NAME READY AGE my-lws 4/4 111s
1 my-lws-0 2/2 57s
2 my-lws-1 2/2 60s
3 Copy to Clipboard Copied! Toggle word wrap Toggle overflow
3.4. Uninstalling the Leader Worker Set Operator Link kopierenLink in die Zwischenablage kopiert!
You can remove the Leader Worker Set Operator from OpenShift Container Platform by uninstalling the Operator and removing its related resources.
3.4.1. Uninstalling the Leader Worker Set Operator Link kopierenLink in die Zwischenablage kopiert!
You can use the web console to uninstall the Leader Worker Set Operator.
Prerequisites
-
You have access to the cluster with
cluster-admin
privileges. - You have access to the OpenShift Container Platform web console.
- You have installed the Leader Worker Set Operator.
Procedure
- Log in to the OpenShift Container Platform web console.
- Navigate to Operators → Installed Operators.
-
Select
openshift-lws-operator
from the Project dropdown list. Delete the
LeaderWorkerSetOperator
instance.- Click Leader Worker Set Operator and select the LeaderWorkerSetOperator tab.
-
Click the Options menu
next to the cluster entry and select Delete LeaderWorkerSetOperator.
- In the confirmation dialog, click Delete.
Uninstall the Leader Worker Set Operator.
- Navigate to Operators → Installed Operators.
-
Click the Options menu
next to the Leader Worker Set Operator entry and click Uninstall Operator.
- In the confirmation dialog, click Uninstall.
3.4.2. Uninstalling Leader Worker Set Operator resources Link kopierenLink in die Zwischenablage kopiert!
Optionally, after uninstalling the Leader Worker Set Operator, you can remove its related resources from your cluster.
Prerequisites
-
You have access to the cluster with
cluster-admin
privileges. - You have access to the OpenShift Container Platform web console.
- You have uninstalled the Leader Worker Set Operator.
Procedure
- Log in to the OpenShift Container Platform web console.
Remove CRDs that were created when the Leader Worker Set Operator was installed:
- Navigate to Administration → CustomResourceDefinitions.
-
Enter
LeaderWorkerSetOperator
in the Name field to filter the CRDs. -
Click the Options menu
next to the LeaderWorkerSetOperator CRD and select Delete CustomResourceDefinition.
- In the confirmation dialog, click Delete.
Delete the
openshift-lws-operator
namespace.- Navigate to Administration → Namespaces.
-
Enter
openshift-lws-operator
into the filter box. -
Click the Options menu
next to the openshift-lws-operator entry and select Delete Namespace.
-
In the confirmation dialog, enter
openshift-lws-operator
and click Delete.
Legal Notice
Link kopierenLink in die Zwischenablage kopiert!
Copyright © 2025 Red Hat
OpenShift documentation is licensed under the Apache License 2.0 (https://www.apache.org/licenses/LICENSE-2.0).
Modified versions must remove all Red Hat trademarks.
Portions adapted from https://github.com/kubernetes-incubator/service-catalog/ with modifications by Red Hat.
Red Hat, Red Hat Enterprise Linux, the Red Hat logo, the Shadowman logo, JBoss, OpenShift, Fedora, the Infinity logo, and RHCE are trademarks of Red Hat, Inc., registered in the United States and other countries.
Linux® is the registered trademark of Linus Torvalds in the United States and other countries.
Java® is a registered trademark of Oracle and/or its affiliates.
XFS® is a trademark of Silicon Graphics International Corp. or its subsidiaries in the United States and/or other countries.
MySQL® is a registered trademark of MySQL AB in the United States, the European Union and other countries.
Node.js® is an official trademark of Joyent. Red Hat Software Collections is not formally related to or endorsed by the official Joyent Node.js open source or commercial project.
The OpenStack® Word Mark and OpenStack logo are either registered trademarks/service marks or trademarks/service marks of the OpenStack Foundation, in the United States and other countries and are used with the OpenStack Foundation’s permission. We are not affiliated with, endorsed or sponsored by the OpenStack Foundation, or the OpenStack community.
All other trademarks are the property of their respective owners.