Este contenido no está disponible en el idioma seleccionado.

Cluster administration

Red Hat OpenShift Service on AWS 4

Configuring Red Hat OpenShift Service on AWS clusters

Red Hat OpenShift Documentation Team

Abstract

This document provides information about configuring Red Hat OpenShift Service on AWS (ROSA) clusters.

Chapter 1. Cluster notifications
Copiar enlace

Cluster notifications (sometimes referred to as service logs) are messages about the status, health, or performance of your cluster.

Cluster notifications are the primary way that Red Hat Site Reliability Engineering (SRE) communicates with you about the health of your managed cluster. Red Hat SRE may also use cluster notifications to prompt you to perform an action in order to resolve or prevent an issue with your cluster.

Cluster owners and administrators must regularly review and action cluster notifications to ensure clusters remain healthy and supported.

You can view cluster notifications in the Red Hat Hybrid Cloud Console, in the Cluster history tab for your cluster. By default, only the cluster owner receives cluster notifications as emails. If other users need to receive cluster notification emails, add each user as a notification contact for your cluster.

1.2. What to expect from cluster notifications
Copiar enlace

As a cluster administrator, you need to be aware of when and why cluster notifications are sent, as well as their types and severity levels, in order to effectively understand the health and administration needs of your cluster.

1.2.1. Cluster notification policy
Copiar enlace

Cluster notifications are designed to keep you informed about the health of your cluster and high impact events that affect it.

Most cluster notifications are generated and sent automatically to ensure that you are immediately informed of problems or important changes to the state of your cluster.

In certain situations, Red Hat Site Reliability Engineering (SRE) creates and sends cluster notifications to provide additional context and guidance for a complex issue.

Cluster notifications are not sent for low-impact events, low-risk security updates, routine operations and maintenance, or minor, transient issues that are quickly resolved by Red Hat SRE.

Red Hat services automatically send notifications when:

Remote health monitoring or environment verification checks detect an issue in your cluster, for example, when a worker node has low disk space.
Significant cluster life cycle events occur, for example, when scheduled maintenance or upgrades begin, or cluster operations are impacted by an event, but do not require customer intervention.
Significant cluster management changes occur, for example, when cluster ownership or administrative control is transferred from one user to another.
Your cluster subscription is changed or updated, for example, when Red Hat makes updates to subscription terms or features available to your cluster.

SRE creates and sends notifications when:

An incident results in a degradation or outage that impacts your cluster’s availability or performance, for example, your cloud provider has a regional outage. SRE sends subsequent notifications to inform you of incident resolution progress, and when the incident is resolved.
A security vulnerability, security breach, or unusual activity is detected on your cluster.
Red Hat detects that changes you have made are creating or may result in cluster instability.
Red Hat detects that your workloads are causing performance degradation or instability in your cluster.

1.2.2. Cluster notification severity levels
Copiar enlace

Each cluster notification has an associated severity level to help you identify notifications with the greatest impact to your business. You can filter cluster notifications according to these severity levels in the Red Hat Hybrid Cloud Console, in the Cluster history tab for your cluster.

Red Hat uses the following severity levels for cluster notifications, from most to least severe:

Critical: Immediate action is required. One or more key functions of a service or cluster is not working, or will stop working soon. A critical alert is important enough to page on-call staff and interrupt regular workflows.
Major: Immediate action is strongly recommended. One or more key functions of the cluster will soon stop working. A major issue may lead to a critical issue if it is not addressed in a timely manner.
Warning: Action is required as soon as possible. One or more key functions of the cluster are not working optimally and may degrade further, but do not pose an immediate danger to the functioning of the cluster.
Info: No action necessary. This severity does not describe problems that need to be addressed, only important information about meaningful or important life cycle, service, or cluster events.
Debug: No action necessary. Debug notifications provide low-level information about less important lifecycle, service, or cluster events to aid in debugging unexpected behavior.

1.2.3. Cluster notification types
Copiar enlace

Each cluster notification has an associated notification type to help you identify notifications that are relevant to your role and responsibilities. You can filter cluster notifications according to these types in the Red Hat Hybrid Cloud Console, in the Cluster history tab for your cluster.

Red Hat uses the following notification types to indicate notification relevance.

Capacity management: Notifications for events related to updating, creating, or deleting node pools, machine pools, compute replicas or quotas (load balancer, storage, etc.).
Cluster access: Notifications for events related to adding or deleting groups, roles or identity providers, for example, when SRE cannot access your cluster because STS credentials have expired, when there is a configuration problem with your AWS roles, or when you add or remove identity providers.
Cluster add-ons: Notifications for events related to add-on management or upgrade maintenance for add-ons, for example, when an add-on is installed, upgraded, or removed, or cannot be installed due to unmet requirements.
Cluster configuration: Notifications for cluster tuning events, workload monitoring, and inflight checks.
Cluster lifecycle: Notifications for cluster or cluster resource creation, deletion, and registration, or change in cluster or resource status (for example, ready or hibernating).
Cluster networking: Notifications related to cluster networking, including HTTP/S proxy, router, and ingress state.
Cluster ownership: Notifications related to cluster ownership transfer from one user to another.
Cluster scaling: Notifications related to updating, creating, or deleting node pools, machine pools, compute replicas or quota.
Cluster security: Events related to cluster security, for example, an increased number of failed access attempts, updates to trust bundles, or software updates with security impact.
Cluster subscription: Cluster expiration, trial cluster notifications, or switching from free to paid.
Cluster updates: Anything relating to upgrades, such as upgrade maintenance or enablement.
Customer support: Updates on support case status.
General notification: The default notification type. This is only used for notifications that do not have a more specific category.

1.3. Viewing cluster notifications using the Red Hat Hybrid Cloud Console
Copiar enlace

Cluster notifications provide important information about the health of your cluster. You can view notifications that have been sent to your cluster in the Cluster history tab on the Red Hat Hybrid Cloud Console.

Prerequisites

You are logged in to the Hybrid Cloud Console.

Procedure

Navigate to the Clusters page of the Hybrid Cloud Console.
Click the name of your cluster to go to the cluster details page.
Click the Cluster history tab.
Cluster notifications appear under the Cluster history heading.
Optional: Filter for relevant cluster notifications
Use the filter controls to hide cluster notifications that are not relevant to you, so that you can focus on your area of expertise or on resolving a critical issue. You can filter notifications based on text in the notification description, severity level, notification type, when the notification was received, and which system or person triggered the notification.

1.4. Cluster notification emails
Copiar enlace

By default, when a cluster notification is sent to the cluster, it is also sent as an email to the cluster owner. You can configure additional recipients for notification emails to ensure that all appropriate users remain informed about the state of the cluster.

1.4.1. Adding notification contacts to your cluster
Copiar enlace

Notification contacts receive emails when cluster notifications are sent to the cluster. By default, only the cluster owner receives cluster notification emails. You can configure other cluster users as additional notification contacts in your cluster support settings.

Prerequisites

Your cluster is deployed and registered to the Red Hat Hybrid Cloud Console.
You are logged in to the Hybrid Cloud Console as the cluster owner or as a user with the cluster editor role.
The intended notification recipient has a Red Hat Customer Portal account associated with the same organization as the cluster owner.

Procedure

Navigate to the Clusters page of the Hybrid Cloud Console.
Click the name of your cluster to go to the cluster details page.
Click the Support tab.
On the Support tab, find the Notification contacts section.
Click Add notification contact.
In the Red Hat username or email field, enter the email address or the user name of the new recipient.
Click Add contact.

Verification steps

The "Notification contact added successfully" message displays.

Troubleshooting

The Add notification contact button is disabled: This button is disabled for users who do not have permission to add a notification contact. Log in to an account with the cluster owner, cluster editor, or cluster administrator role and try again.
Error: Could not find any account identified by <username> or <email-address>: This error occurs when the intended notification recipient is not part of the same Red Hat account organization as the cluster owner. Contact your organization administrator to add the intended recipient to the relevant organization and try again.

1.4.2. Removing notification contacts from your cluster
Copiar enlace

Notification contacts receive emails when cluster notifications are sent to the cluster.

You can remove notification contacts in your cluster support settings to prevent them from receiving notification emails.

Prerequisites

Your cluster is deployed and registered to the Red Hat Hybrid Cloud Console.
You are logged in to the Hybrid Cloud Console as the cluster owner or as a user with the cluster editor role.

Procedure

Navigate to the Clusters page of the Hybrid Cloud Console.
Click the name of your cluster to go to the cluster details page.
Click the Support tab.
On the Support tab, find the Notification contacts section.
Click the options menu (⚙) beside the recipient you want to remove.
Click Delete.

Verification steps

The "Notification contact deleted successfully" message displays.

1.5. Troubleshooting
Copiar enlace

If you are not receiving cluster notification emails

Ensure that emails sent from @redhat.com addresses are not filtered out of your email inbox.
Ensure that your correct email address is listed as a notification contact for the cluster.
Ask the cluster owner or administrator to add you as a notification contact: Cluster notification emails.

If your cluster does not receive notifications

Ensure that your cluster can access resources at api.openshift.com.

Chapter 2. Configuring private connections
Copiar enlace

2.1. Configuring private connections
Copiar enlace

Private cluster access can be implemented to suit the needs of your Red Hat OpenShift Service on AWS environment.

Procedure

Access your Red Hat OpenShift Service on AWS AWS account and use one or more of the following methods to establish a private connection to your cluster:
- Configuring AWS VPC peering: Enable VPC peering to route network traffic between two private IP addresses.
- Configuring AWS VPN: Establish a Virtual Private Network to securely connect your private network to your Amazon Virtual Private Cloud.
- Configuring AWS Direct Connect: Configure AWS Direct Connect to establish a dedicated network connection between your private network and an AWS Direct Connect location.

2.2. Configuring AWS VPC peering
Copiar enlace

This sample process configures an Amazon Web Services (AWS) VPC containing an Red Hat OpenShift Service on AWS cluster to peer with another AWS VPC network. For more information about creating an AWS VPC Peering connection or for other possible configurations, see the AWS VPC Peering guide.

2.2.1. VPC peering terms
Copiar enlace

When setting up a VPC peering connection between two VPCs on two separate AWS accounts, the following terms are used:

Red Hat OpenShift Service on AWS AWS Account	The AWS account that contains the Red Hat OpenShift Service on AWS cluster.
Red Hat OpenShift Service on AWS Cluster VPC	The VPC that contains the Red Hat OpenShift Service on AWS cluster.
Customer AWS Account	Your non-Red Hat OpenShift Service on AWS AWS Account that you would like to peer with.
Customer VPC	The VPC in your AWS Account that you would like to peer with.
Customer VPC Region	The region where the customer’s VPC resides.

Note

As of July 2018, AWS supports inter-region VPC peering between all commercial regions excluding China.

2.2.2. Initiating the VPC peer request
Copiar enlace

You can send a VPC peering connection request from the Red Hat OpenShift Service on AWS AWS Account to the Customer AWS Account.

Prerequisites

Gather the following information about the Customer VPC required to initiate the peering request:
- Customer AWS account number
- Customer VPC ID
- Customer VPC Region
- Customer VPC CIDR
Check the CIDR block used by the Red Hat OpenShift Service on AWS Cluster VPC. If it overlaps or matches the CIDR block for the Customer VPC, then peering between these two VPCs is not possible; see the Amazon VPC Unsupported VPC Peering Configurations documentation for details. If the CIDR blocks do not overlap, you can continue with the procedure.

Procedure

Log in to the Web Console for the Red Hat OpenShift Service on AWS AWS Account and navigate to the VPC Dashboard in the region where the cluster is being hosted.
Go to the Peering Connections page and click the Create Peering Connection button.
Verify the details of the account you are logged in to and the details of the account and VPC you are connecting to:
1. Peering connection name tag: Set a descriptive name for the VPC Peering Connection.
2. VPC (Requester): Select the Red Hat OpenShift Service on AWS Cluster VPC ID from the list.
3. Account: Select Another account and provide the Customer AWS Account number *(without dashes).
4. Region: If the Customer VPC Region differs from the current region, select Another Region and select the customer VPC Region from the list.
5. VPC (Accepter): Set the Customer VPC ID.
Click Create Peering Connection.
Confirm that the request enters a Pending state. If it enters a Failed state, confirm the details and repeat the process.

2.2.3. Accepting the VPC peer request
Copiar enlace

After you create the VPC peering connection, you must accept the request in the Customer AWS Account.

Prerequisites

Initiate the VPC peer request.

Procedure

Log in to the AWS Web Console.
Navigate to VPC Service.
Go to Peering Connections.
Click on Pending peering connection.
Confirm the AWS Account and VPC ID that the request originated from. This should be from the Red Hat OpenShift Service on AWS AWS Account and Red Hat OpenShift Service on AWS Cluster VPC.
Click Accept Request.

2.2.4. Configuring the routing tables
Copiar enlace

After you accept the VPC peering request, both VPCs must configure their routes to communicate across the peering connection.

Prerequisites

Initiate and accept the VPC peer request.

Procedure

Log in to the AWS Web Console for the Red Hat OpenShift Service on AWS AWS Account.
Navigate to the VPC Service, then Route tables.
Select the Route Table for the Red Hat OpenShift Service on AWS Cluster VPC.
Note
On some clusters, there may be more than one route table for a particular VPC. Select the private one that has a number of explicitly associated subnets.
Select the Routes tab, then Edit.
Enter the Customer VPC CIDR block in the Destination text box.
Enter the Peering Connection ID in the Target text box.
Click Save.
You must complete the same process with the other VPC’s CIDR block:
1. Log into the Customer AWS Web Console → VPC Service → Route Tables.
2. Select the Route Table for your VPC.
3. Select the Routes tab, then Edit.
4. Enter the Red Hat OpenShift Service on AWS Cluster VPC CIDR block in the Destination text box.
5. Enter the Peering Connection ID in the Target text box.
6. Click Save changes.

The VPC peering connection is now complete. Follow the verification procedure to ensure connectivity across the peering connection is working.

2.2.5. Verifying and troubleshooting VPC peering
Copiar enlace

After you set up a VPC peering connection, it is best to confirm it has been configured and is working correctly.

Prerequisites

Initiate and accept the VPC peer request.
Configure the routing tables.

Procedure

In the AWS console, look at the route table for the cluster VPC that is peered. Ensure that the steps for configuring the routing tables were followed and that there is a route table entry pointing the VPC CIDR range destination to the peering connection target.
If the routes look correct on both the Red Hat OpenShift Service on AWS Cluster VPC route table and Customer VPC route table, then the connection should be tested using the netcat method below. If the test calls are successful, then VPC peering is working correctly.
To test network connectivity to an endpoint device, nc (or netcat) is a helpful troubleshooting tool. It is included in the default image and provides quick and clear output if a connection can be established:
1. Create a temporary pod using the busybox image, which cleans up after itself:
  $ oc run netcat-test \ --image=busybox -i -t \ --restart=Never --rm \ -- /bin/sh
  Copy to Clipboard Toggle word wrap
2. Check the connection using nc.
  - Example successful connection results:
    
    / nc -zvv 192.168.1.1 8080 10.181.3.180 (10.181.3.180:8080) open sent 0, rcvd 0
    
    Copy to Clipboard Toggle word wrap
  - Example failed connection results:
    
    / nc -zvv 192.168.1.2 8080 nc: 10.181.3.180 (10.181.3.180:8081): Connection refused sent 0, rcvd 0
    
    Copy to Clipboard Toggle word wrap
3. Exit the container, which automatically deletes the Pod:
  / exit
  Copy to Clipboard Toggle word wrap

2.3. Configuring AWS VPN
Copiar enlace

This sample process configures an Amazon Web Services (AWS) Red Hat OpenShift Service on AWS cluster to use a customer’s on-site hardware VPN device.

Note

AWS VPN does not currently provide a managed option to apply NAT to VPN traffic. See the AWS Knowledge Center for more details.

Note

Routing all traffic, for example 0.0.0.0/0, through a private connection is not supported. This requires deleting the internet gateway, which disables SRE management traffic.

For more information about connecting an AWS VPC to remote networks using a hardware VPN device, see the Amazon VPC VPN Connections documentation.

2.3.1. Creating a VPN connection
Copiar enlace

You can configure an Amazon Web Services (AWS) Red Hat OpenShift Service on AWS cluster to use a customer’s on-site hardware VPN device using the following procedures.

Prerequisites

Hardware VPN gateway device model and software version, for example Cisco ASA running version 8.3. See the Amazon VPC Network Administrator Guide to confirm whether your gateway device is supported by AWS.
Public, static IP address for the VPN gateway device.
BGP or static routing: if BGP, the ASN is required. If static routing, you must configure at least one static route.
Optional: IP and Port/Protocol of a reachable service to test the VPN connection.

2.3.1.1. Configuring the VPN connection
Copiar enlace

Procedure

Log in to the Red Hat OpenShift Service on AWS AWS Account Dashboard, and navigate to the VPC Dashboard.
Under Virtual private cloud click on Your VPCs and identify the name and VPC ID for the VPC containing the Red Hat OpenShift Service on AWS cluster.
Under Virtual private network (VPN) click Customer gateways.
Click Create customer gateway and give it a meaningful name.
Enter the ASN of your customer gateway device in the BGP ASN field.
Enter the IP address for your customer gateway devices’s external interface in the IP address field.
Click Create customer gateway.
If you do not already have a Virtual Private Gateway attached to the intended VPC:
1. From the VPC Dashboard, click on Virtual Private Gateways.
2. Click Create virtual private gateway, give it a meaningful name.
3. Click Create virtual private gateway, leaving the Amazon default ASN.
4. Select the newly created gateway.
5. Select Actions from the list and click Attach to VPC.
6. Select the newly created gateway under Available VPC’s and click Attach to VPC to attach it to the cluster VPC you identified earlier.

2.3.1.2. Establishing the VPN Connection
Copiar enlace

Procedure

From the VPC dashboard, under Virtual private network (VPN) click on Site-to-Site VPN connections.
Click Create VPN connection.
1. Give it a meaningful name tag.
2. Select the Virtual private gateway created previously.
3. For Customer gateway, select Existing.
4. Select the Customer gateway id by name.
5. If the VPN will use BGP, select Dynamic, otherwise select Static and enter the Static IP CIDRs. If there are multiple CIDRs, add each CIDR as Another Rule.
6. Click Create VPN connection.
7. Under State wait for the VPN status to change from Pending to Available, approximately 5 to 10 minutes.
Select the VPN you just created and click Download configuration.
1. From the list, select the vendor, platform, and version of the customer gateway device, then click Download.
2. The Generic vendor configuration is also available for retrieving information in a plain text format.

Note

After the VPN connection has been established, be sure to set up Route Propagation or the VPN may not function as expected.

Note

Note the VPC subnet information, which you must add to your configuration as the remote network.

2.3.1.3. Enabling VPN route propagation
Copiar enlace

After you have set up the VPN connection, you must ensure that route propagation is enabled so that the necessary routes are added to the VPC’s route table.

Procedure

From the VPC Dashboard, under Virtual private cloud, click on Route tables.
Select the private Route table associated with the VPC that contains your Red Hat OpenShift Service on AWS cluster.
Note
On some clusters, there may be more than one route table for a particular VPC. Select the private one that has a number of explicitly associated subnets.
Click on the Route Propagation tab.
In the table that appears, you should see the Virtual Private Gateway you created previously. Check the value in the Propagate column.
1. If Propagation is set to No, click Edit route propagation, check the Enable checkbox in Propagation and click Save.

After you configure your VPN tunnel and AWS detects it as Up, your static or BGP routes are automatically added to the route table.

2.3.2. Verifying the VPN connection
Copiar enlace

After you have set up your side of the VPN tunnel, you can verify that the tunnel is up in the AWS console and that connectivity across the tunnel is working.

Prerequisites

Created a VPN connection.

Procedure

Verify the tunnel is up in AWS.
1. From the VPC Dashboard, under Virtual private network (VPN), click on Site-to-Site VPN connections.
2. Select the VPN connection you created previously and click the Tunnel details tab.
3. You should see that at least one of the VPN tunnels is in an Up status.
Verify the connection.
To test network connectivity to an endpoint device, nc (or netcat) is a helpful troubleshooting tool. It is included in the default image and provides quick and clear output if a connection can be established:
1. Create a temporary pod using the busybox image, which cleans up after itself:
  $ oc run netcat-test \ --image=busybox -i -t \ --restart=Never --rm \ -- /bin/sh
  Copy to Clipboard Toggle word wrap
2. Check the connection using nc.
  - Example successful connection results:
    
    / nc -zvv 192.168.1.1 8080 10.181.3.180 (10.181.3.180:8080) open sent 0, rcvd 0
    
    Copy to Clipboard Toggle word wrap
  - Example failed connection results:
    
    / nc -zvv 192.168.1.2 8080 nc: 10.181.3.180 (10.181.3.180:8081): Connection refused sent 0, rcvd 0
    
    Copy to Clipboard Toggle word wrap
3. Exit the container, which automatically deletes the Pod:
  / exit
  Copy to Clipboard Toggle word wrap

2.3.3. Troubleshooting the VPN connection
Copiar enlace

Tunnel does not connect

If the tunnel connection is still Down, there are several things you can verify:

The AWS tunnel will not initiate a VPN connection. The connection attempt must be initiated from the Customer Gateway.
Ensure that your source traffic is coming from the same IP as the configured customer gateway. AWS will silently drop all traffic to the gateway whose source IP address does not match.
Ensure that your configuration matches values supported by AWS. This includes IKE versions, DH groups, IKE lifetime, and more.
Recheck the route table for the VPC. Ensure that propagation is enabled and that there are entries in the route table that have the virtual private gateway you created earlier as a target.
Confirm that you do not have any firewall rules that could be causing an interruption.
Check if you are using a policy-based VPN as this can cause complications depending on how it is configured.
Further troubleshooting steps can be found at the AWS Knowledge Center.

Tunnel does not stay connected

If the tunnel connection has trouble staying Up consistently, know that all AWS tunnel connections must be initiated from your gateway. AWS tunnels do not initiate tunneling.

Red Hat recommends setting up an SLA Monitor (Cisco ASA) or some device on your side of the tunnel that constantly sends "interesting" traffic, for example ping, nc, or telnet, at any IP address configured within the VPC CIDR range. It does not matter whether the connection is successful, just that the traffic is being directed at the tunnel.

Secondary tunnel in Down state

When a VPN tunnel is created, AWS creates an additional failover tunnel. Depending upon the gateway device, sometimes the secondary tunnel will be seen as in the Down state.

The AWS Notification is as follows:

You have new non-redundant VPN connections

One or more of your vpn connections are not using both tunnels. This mode of
operation is not highly available and we strongly recommend you configure your
second tunnel. View your non-redundant VPN connections.

You have new non-redundant VPN connections

One or more of your vpn connections are not using both tunnels. This mode of
operation is not highly available and we strongly recommend you configure your
second tunnel. View your non-redundant VPN connections.

Copy to Clipboard

Toggle word wrap

2.4. Configuring AWS Direct Connect
Copiar enlace

This process describes accepting an AWS Direct Connect virtual interface with Red Hat OpenShift Service on AWS. For more information about AWS Direct Connect types and configuration, see the AWS Direct Connect components documentation.

2.4.1. AWS Direct Connect methods
Copiar enlace

A Direct Connect connection requires a hosted Virtual Interface (VIF) connected to a Direct Connect Gateway (DXGateway), which is in turn associated to a Virtual Gateway (VGW) or a Transit Gateway in order to access a remote VPC in the same or another account.

If you do not have an existing DXGateway, the typical process involves creating the hosted VIF, with the DXGateway and VGW being created in the Red Hat OpenShift Service on AWS AWS Account.

If you have an existing DXGateway connected to one or more existing VGWs, the process involves the Red Hat OpenShift Service on AWS AWS Account sending an Association Proposal to the DXGateway owner. The DXGateway owner must ensure that the proposed CIDR will not conflict with any other VGWs they have associated.

See the following AWS documentation for more details:

Important

When connecting to an existing DXGateway, you are responsible for the costs.

There are two configuration options available:

Method 1	Create the hosted VIF and then the DXGateway and VGW.
Method 2	Request a connection via an existing Direct Connect Gateway that you own.

2.4.2. Creating the hosted Virtual Interface
Copiar enlace

Prerequisites

Gather Red Hat OpenShift Service on AWS AWS Account ID.

2.4.2.1. Determining the type of Direct Connect connection
Copiar enlace

View the Direct Connect Virtual Interface details to determine the type of connection.

Procedure

Log in to the Red Hat OpenShift Service on AWS AWS Account Dashboard and select the correct region.
Select Direct Connect from the Services menu.
There will be one or more Virtual Interfaces waiting to be accepted, select one of them to view the Summary.
View the Virtual Interface type: private or public.
Record the Amazon side ASN value.

If the Direct Connect Virtual Interface type is Private, a Virtual Private Gateway is created. If the Direct Connect Virtual Interface is Public, a Direct Connect Gateway is created.

2.4.2.2. Creating a Private Direct Connect
Copiar enlace

A Private Direct Connect is created if the Direct Connect Virtual Interface type is Private.

Procedure

Log in to the Red Hat OpenShift Service on AWS AWS Account Dashboard and select the correct region.
From the AWS region, select VPC from the Services menu.
From Virtual private network (VPN), select Virtual private gateways.
Click Create virtual private gateway.
Give the Virtual Private Gateway a suitable name.
Select Custom ASN in the Enter custom ASN field enter the Amazon side ASN value gathered previously.
Click Create virtual private gateway.
Click the newly created Virtual Private Gateway and choose Attach to VPC from the Actions tab.
Select the Red Hat OpenShift Service on AWS Cluster VPC from the list, and click Attach VPC.

Note: Editing the kubelet config will cause the nodes for your machine pool to be recreated. This ma???

2.4.2.3. Creating a Public Direct Connect
Copiar enlace

A Public Direct Connect is created if the Direct Connect Virtual Interface type is Public.

Procedure

Log in to the Red Hat OpenShift Service on AWS AWS Account Dashboard and select the correct region.
From the Red Hat OpenShift Service on AWS AWS Account region, select Direct Connect from the Services menu.
Select Direct Connect gateways and Create Direct Connect gateway.
Give the Direct Connect gateway a suitable name.
In the Amazon side ASN, enter the Amazon side ASN value gathered previously.
Click Create the Direct Connect gateway.

2.4.2.4. Verifying the Virtual Interfaces
Copiar enlace

After the Direct Connect Virtual Interfaces have been accepted, wait a short period and view the status of the Interfaces.

Procedure

Log in to the Red Hat OpenShift Service on AWS AWS Account Dashboard and select the correct region.
From the Red Hat OpenShift Service on AWS AWS Account region, select Direct Connect from the Services menu.
Select one of the Direct Connect Virtual Interfaces from the list.
Check the Interface State has become Available
Check the Interface BGP Status has become Up.
Repeat this verification for any remaining Direct Connect Interfaces.

After the Direct Connect Virtual Interfaces are available, you can log in to the Red Hat OpenShift Service on AWS AWS Account Dashboard and download the Direct Connect configuration file for configuration on your side.

2.4.3. Connecting to an existing Direct Connect Gateway
Copiar enlace

Prerequisites

Confirm the CIDR range of the Red Hat OpenShift Service on AWS VPC will not conflict with any other VGWs you have associated.
Gather the following information:
- The Direct Connect Gateway ID.
- The AWS Account ID associated with the virtual interface.
- The BGP ASN assigned for the DXGateway. Optional: the Amazon default ASN may also be used.

Procedure

Log in to the Red Hat OpenShift Service on AWS AWS Account Dashboard and select the correct region.
From the Red Hat OpenShift Service on AWS AWS Account region, select VPC from the Services menu.
From Virtual private network (VPN), select Virtual private gateways.
Select Create virtual private gateway.
Give the virtual private gateway a suitable name in the Details field.
Click Custom ASN and enter the Amazon side ASN value gathered previously or use the Amazon Provided ASN.
Click Create virtual private gateway.
From the Red Hat OpenShift Service on AWS AWS Account region, select Direct Connect from the Services menu.
Click virtual private gateways and select the virtual private gateway.
Click View details.
Click the Direct Connect gateway associations tab.
Click Associate Direct Connect gateway
Under Association account type, for Account owner, click Another account.
Under Association settings, for Direct Connect gateway ID, enter the ID of the Direct Connect gateway.
For Direct Connect gateway owner, enter the ID of the AWS account that owns the Direct Connect gateway.
Optional: Add prefixes to Allowed prefixes, separating them using commas or put them on separate lines.
Click Associate Direct Connect gateway.
After the Association Proposal has been sent, it will be waiting for your acceptance. The final steps you must perform are available in the AWS Documentation.

2.4.4. Troubleshooting Direct Connect
Copiar enlace

Further troubleshooting can be found in the Troubleshooting AWS Direct Connect documentation.

Chapter 3. Cluster autoscaling for Red Hat OpenShift Service on AWS
Copiar enlace

Applying autoscaling to Red Hat OpenShift Service on AWS clusters involves configuring one or more machine pools with autoscaling. You can use the Cluster Autoscaler to further configure cluster-wide autoscaling that is applicable to all of the machine pools that are autoscaling.

3.1. About the cluster autoscaler
Copiar enlace

The cluster autoscaler adjusts the size of a Red Hat OpenShift Service on AWS cluster to meet its current deployment needs. It uses declarative, Kubernetes-style arguments to provide infrastructure management that does not rely on objects of a specific cloud provider. The cluster autoscaler has a cluster scope, and is not associated with a particular namespace. In Red Hat OpenShift Service on AWS, the Cluster Autoscaler is fully managed, which means it is hosted along with the control plane.

The cluster autoscaler increases the size of the cluster when there are pods that fail to schedule on any of the current worker nodes due to insufficient resources or when another node is necessary to meet deployment needs. The cluster autoscaler does not increase the cluster resources beyond the limits that you specify.

The cluster autoscaler computes the total memory, CPU, and GPU only on the nodes that belong to autoscaling machine pools. All of the machine pool nodes that are not autoscaling are excluded from this aggregation. For example, if you set the maxNodesTotal to 50 on a Red Hat OpenShift Service on AWS cluster with three machine pools in which a single machine pool is not autoscaling, the cluster autoscaler restricts the total nodes to 50 in only those two machine pools that are autoscaling. The single manually scaling machine pool can have additional nodes, making the overall cluster nodes total more than 50.

3.1.1. Automatic node removal
Copiar enlace

Every 10 seconds, the cluster autoscaler checks which nodes are unnecessary in the cluster and removes them. The cluster autoscaler considers a node for removal if the following conditions apply:

The node utilization is less than the node utilization level threshold for the cluster. The node utilization level is the sum of the requested resources divided by the allocated resources for the node. If you do not specify a value in the ClusterAutoscaler custom resource, the cluster autoscaler uses a default value of 0.5, which corresponds to 50% utilization.
The cluster autoscaler can move all pods running on the node to the other nodes. The Kubernetes scheduler is responsible for scheduling pods on the nodes.
The cluster autoscaler does not have scale down disabled annotation.

If the following types of pods are present on a node, the cluster autoscaler will not remove the node:

Pods with restrictive pod disruption budgets (PDBs).
Kube-system pods that do not run on the node by default.
Kube-system pods that do not have a PDB or have a PDB that is too restrictive.
Pods that are not backed by a controller object such as a deployment, replica set, or stateful set.
Pods with local storage.
Pods that cannot be moved elsewhere because of a lack of resources, incompatible node selectors or affinity, matching anti-affinity, and so on.
Unless they also have a "cluster-autoscaler.kubernetes.io/safe-to-evict": "true" annotation, pods that have a "cluster-autoscaler.kubernetes.io/safe-to-evict": "false" annotation.

For example, you set the maximum CPU limit to 64 cores and configure the cluster autoscaler to only create machines that have 8 cores each. If your cluster starts with 30 cores, the cluster autoscaler can add up to 4 more nodes with 32 cores, for a total of 62.

3.1.2. Limitations
Copiar enlace

If you configure the cluster autoscaler, additional usage restrictions apply:

Do not modify the nodes that are in autoscaled node groups directly. All nodes within the same node group have the same capacity and labels and run the same system pods.
Specify requests for your pods.
If you have to prevent pods from being deleted too quickly, configure appropriate PDBs.
Confirm that your cloud provider quota is large enough to support the maximum node pools that you configure.
Do not run additional node group autoscalers, especially the ones offered by your cloud provider.

Note

The cluster autoscaler only adds nodes in autoscaled node groups if doing so would result in a schedulable pod. If the available node types cannot meet the requirements for a pod request, or if the node groups that could meet these requirements are at their maximum size, the cluster autoscaler cannot scale up.

3.1.3. Interaction with other scheduling features
Copiar enlace

The horizontal pod autoscaler (HPA) and the cluster autoscaler modify cluster resources in different ways. The HPA changes the deployment’s or replica set’s number of replicas based on the current CPU load. If the load increases, the HPA creates new replicas, regardless of the amount of resources available to the cluster. If there are not enough resources, the cluster autoscaler adds resources so that the HPA-created pods can run. If the load decreases, the HPA stops some replicas. If this action causes some nodes to be underutilized or completely empty, the cluster autoscaler deletes the unnecessary nodes.

The cluster autoscaler takes pod priorities into account. The Pod Priority and Preemption feature enables scheduling pods based on priorities if the cluster does not have enough resources, but the cluster autoscaler ensures that the cluster has resources to run all pods. To honor the intention of both features, the cluster autoscaler includes a priority cutoff function. You can use this cutoff to schedule "best-effort" pods, which do not cause the cluster autoscaler to increase resources but instead run only when spare resources are available.

Pods with priority lower than the cutoff value do not cause the cluster to scale up or prevent the cluster from scaling down. No new nodes are added to run the pods, and nodes running these pods might be deleted to free resources.

3.2. Enable autoscaling during cluster creation by using the interactive mode with the ROSA CLI
Copiar enlace

You can use the ROSA command-line interface (CLI) (rosa) in interactive mode of your terminal, if available, to set cluster-wide autoscaling behavior during cluster creation.

Interactive mode provides more information about available configurable parameters. Interactive mode also does basic checks and preflight validations, meaning that if a provided value is invalid, the terminal outputs a prompt for a valid input.

Procedure

During cluster creation, use the --enable-autoscaling and --interactive parameters to enable cluster autoscaling:

Example

rosa create cluster --cluster-name <cluster_name> --enable-autoscaling --interactive

$ rosa create cluster --cluster-name <cluster_name> --enable-autoscaling --interactive

Copy to Clipboard

Toggle word wrap

Note

If your cluster name is longer than 15 characters, it will contain an autogenerated domain prefix as a sub-domain for your provisioned cluster on *.openshiftapps.com.

To customize the subdomain, use the --domain-prefix flag. The domain prefix cannot be longer than 15 characters, must be unique, and cannot be changed after cluster creation.

When the following prompt appears, enter y to go through all available autoscaling options.

Example interactive prompt

? Configure cluster-autoscaler (optional): [? for help] (y/N) y <enter>

? Configure cluster-autoscaler (optional): [? for help] (y/N) y <enter>

Copy to Clipboard

Toggle word wrap

3.3. Enable autoscaling during cluster creation with the ROSA CLI
Copiar enlace

You can use the ROSA command-line interface (CLI) (rosa) to set cluster-wide autoscaling behavior during cluster creation. You can enable the autoscaler on the entire machine or just a cluster.

Procedure

During cluster creation, type --enable autoscaling after the cluster name to enable machine autoscaling:

Note

If your cluster name is longer than 15 characters, it will contain an autogenerated domain prefix as a sub-domain for your provisioned cluster on *.openshiftapps.com.

To customize the subdomain, use the --domain-prefix flag. The domain prefix cannot be longer than 15 characters, must be unique, and cannot be changed after cluster creation.

Example

rosa create cluster --cluster-name <cluster_name> --enable-autoscaling

$ rosa create cluster --cluster-name <cluster_name> --enable-autoscaling

Copy to Clipboard

Toggle word wrap

Set at least one parameter to enable cluster autoscaling by running the following command:

Example

rosa create cluster --cluster-name <cluster_name> --enable-autoscaling <parameter>

$ rosa create cluster --cluster-name <cluster_name> --enable-autoscaling <parameter>

Copy to Clipboard

Toggle word wrap

3.4. Cluster autoscaling parameters using the ROSA CLI
Copiar enlace

You can add the following parameters to the cluster creation command to configure autoscaler parameters when using the ROSA command-line interface (CLI) (rosa).

Expand

Table 3.1. Configurable autoscaler parameters available with the ROSA CLI
Setting	Description	Type or Range	Example/Instruction
`--autoscaler-max-pod-grace-period int`	Gives pods graceful termination time before scaling down, measured in seconds. Replace int in the command with the number of seconds you want to use.	`integer`	`--autoscaler-max-pod-grace-period 0`
`--autoscaler-pod-priority-threshold int`	The priority that a pod must exceed to cause the cluster autoscaler to deploy additional nodes. Replace int in the command with the number you want to use, can be negative.	`integer`	`--autoscaler-pod-priority-threshold -10`
`--autoscaler-max-node-provision-time string`	Maximum time that the cluster autoscaler waits for a node to be provisioned. Replace string in the command with an integer and time unit (ns,us,µs,ms,s,m,h).	`string`	`--autoscaler-max-node-provision-time 35m`
`--autoscaler-max-nodes-total int`	Maximum amount of nodes in the cluster, including the autoscaled nodes. Replace int in the command with the number you want to use.	`integer`	`--autoscaler-max-nodes-total 180`

3.5. Edit autoscaling after cluster creation with the ROSA CLI
Copiar enlace

You can edit any specific parameters of the cluster autoscaler after creating the autoscaler when using the ROSA command-line interface (CLI) (rosa).

Procedure

To edit the cluster autoscaler, run the following command:
Example
```
rosa edit autoscaler --cluster=<mycluster>
```
```
$ rosa edit autoscaler --cluster=<mycluster>
```
Copy to Clipboard Toggle word wrap
- To edit a specific parameter, run the following command:
  Example
  $ rosa edit autoscaler --cluster=<mycluster> <parameter>
  
  Copy to Clipboard Toggle word wrap

3.6. View autoscaler configurations with the ROSA CLI
Copiar enlace

You can view your cluster autoscaler configurations using the rosa describe autoscaler command when using the ROSA command-line interface (CLI) (rosa).

Procedure

To view cluster autoscaler configurations, run the following command:
Example
```
rosa describe autoscaler -h --cluster=<mycluster>
```
```
$ rosa describe autoscaler -h --cluster=<mycluster>
```
Copy to Clipboard Toggle word wrap

Chapter 4. Managing compute nodes using machine pools
Copiar enlace

4.1. About machine pools
Copiar enlace

Red Hat OpenShift Service on AWS uses machine pools as an elastic, dynamic provisioning method on top of your cloud infrastructure.

The primary resources are machines, compute machine sets, and machine pools.

4.1.1. Machines
Copiar enlace

A machine is a fundamental unit that describes the host for a worker node.

4.1.2. Machine sets
Copiar enlace

MachineSet resources are groups of compute machines. If you need more machines or must scale them down, change the number of replicas in the machine pool to which the compute machine sets belong.

Machine sets are not directly modifiable in Red Hat OpenShift Service on AWS.

4.1.3. Machine pools
Copiar enlace

Machine pools are a higher level construct to compute machine sets.

A machine pool creates compute machine sets that are all clones of the same configuration across availability zones. Machine pools perform all of the host node provisioning management actions on a worker node. If you need more machines or must scale them down, change the number of replicas in the machine pool to meet your compute needs. You can manually configure scaling or set autoscaling.

In Red Hat OpenShift Service on AWS clusters, the hosted control plane spans multiple availability zones (AZ) in the installed cloud region. Each machine pool in a Red Hat OpenShift Service on AWS cluster deploys in a single subnet within a single AZ.

Important

Worker nodes are not guaranteed longevity, and may be replaced at any time as part of the normal operation and management of OpenShift. For more details about the node lifecycle, refer to additional resources.

Multiple machine pools can exist on a single cluster, and each machine pool can contain a unique node type and node size (AWS EC2 instance type and size) configuration.

4.1.3.1. Machine pools during cluster installation
Copiar enlace

By default, a cluster has one machine pool. During cluster installation, you can define instance type or size and add labels to this machine pool as well as define the size of the root disk.

4.1.3.2. Configuring machine pools after cluster installation
Copiar enlace

After a cluster’s installation:

You can remove or add labels to any machine pool.
You can add additional machine pools to an existing cluster.
You can add taints to any machine pool if there is one machine pool without any taints.
You can create or delete a machine pool if there is one machine pool without any taints and at least two replicas.
Note
You cannot change the machine pool node type or size. The machine pool node type or size is specified during their creation only. If you need a different node type or size, you must re-create a machine pool and specify the required node type or size values.
You can add a label to each added machine pool.

Important

Procedure

Optional: Add a label to the default machine pool after configuration by using the default machine pool labels and running the following command:

rosa edit machinepool -c <cluster_name> <machinepool_name> -i

$ rosa edit machinepool -c <cluster_name> <machinepool_name> -i

Copy to Clipboard

Toggle word wrap

Example input

rosa edit machinepool -c mycluster worker -i

$ rosa edit machinepool -c mycluster worker -i
? Enable autoscaling: No
? Replicas: 3
? Labels: mylabel=true
I: Updated machine pool 'worker' on cluster 'mycluster'

Copy to Clipboard

Toggle word wrap

4.1.3.3. Machine pool upgrade requirements
Copiar enlace

Each machine pool in a Red Hat OpenShift Service on AWS cluster upgrades independently. Because the machine pools upgrade independently, they must remain within 2 minor (Y-stream) versions of the hosted control plane. For example, if your hosted control plane is 4.16.z, your machine pools must be at least 4.14.z.

The following image depicts how machine pools work within Red Hat OpenShift Service on AWS clusters:

Machine pools on ROSA classic and Red Hat OpenShift Service on AWS clusters

Note

Machine pools in Red Hat OpenShift Service on AWS clusters each upgrade independently and the machine pool versions must remain within two minor (Y-stream) versions of the control plane.

4.1.4. Additional resources
Copiar enlace

Managing compute nodes
About autoscaling
Configuring PID limits

4.2. Managing compute nodes
Copiar enlace

This document describes how to manage compute (also known as worker) nodes with Red Hat OpenShift Service on AWS.

The majority of changes for compute nodes are configured on machine pools. A machine pool is a group of compute nodes in a cluster that have the same configuration, providing ease of management.

You can edit machine pool configuration options such as scaling, adding node labels, and adding taints.

4.2.1. Creating a machine pool
Copiar enlace

A machine pool is created when you install a Red Hat OpenShift Service on AWS cluster. After installation, you can create additional machine pools for your cluster by using OpenShift Cluster Manager or the ROSA command-line interface (CLI) (rosa).

Note

For users of rosa version 1.2.25 and earlier versions, the machine pool created along with the cluster is identified as Default. For users of rosa version 1.2.26 and later, the machine pool created along with the cluster is identified as worker.

4.2.1.1. Creating a machine pool using OpenShift Cluster Manager
Copiar enlace

You can create additional machine pools for your Red Hat OpenShift Service on AWS cluster by using OpenShift Cluster Manager.

Prerequisites

You created a Red Hat OpenShift Service on AWS cluster.

Procedure

Navigate to OpenShift Cluster Manager and select your cluster.
Under the Machine pools tab, click Add machine pool.
Add a Machine pool name.
Select a Compute node instance type from the list. The instance type defines the vCPU and memory allocation for each compute node in the machine pool.
Note
You cannot change the instance type for a machine pool after the pool is created.
Optional: Configure autoscaling for the machine pool:
1. Select Enable autoscaling to automatically scale the number of machines in your machine pool to meet the deployment needs.
2. Set the minimum and maximum node count limits for autoscaling. The cluster autoscaler does not reduce or increase the machine pool node count beyond the limits that you specify.
  Note
  Alternatively, you can set your autoscaling preferences for the machine pool after the machine pool is created.
If you did not enable autoscaling, select a Compute node count from the drop-down menu. This defines the number of compute nodes to provision to the machine pool for the availability zone.
Optional: Configure Root disk size.
Optional: Add node labels and taints for your machine pool:
1. Expand the Edit node labels and taints menu.
2. Under Node labels, add Key and Value entries for your node labels.
3. Under Taints, add Key and Value entries for your taints.
  Note
  Creating a machine pool with taints is only possible if the cluster already has at least one machine pool without a taint.
4. For each taint, select an Effect from the drop-down menu. Available options include NoSchedule, PreferNoSchedule, and NoExecute.
  Note
  Alternatively, you can add the node labels and taints after you create the machine pool.
Optional: Select additional custom security groups to use for nodes in this machine pool. You must have already created the security groups and associated them with the VPC that you selected for this cluster. You cannot add or edit security groups after you create the machine pool.
Important
You can use up to ten additional security groups for machine pools on Red Hat OpenShift Service on AWS clusters.
Click Add machine pool to create the machine pool.

Verification

Verify that the machine pool is visible on the Machine pools page and the configuration is as expected.

4.2.1.2. Creating a machine pool using the ROSA CLI
Copiar enlace

You can create additional machine pools for your Red Hat OpenShift Service on AWS cluster by using the ROSA command-line interface (CLI) (rosa).

Prerequisites

You installed and configured the latest ROSA CLI on your workstation.
You logged in to your Red Hat account using the ROSA CLI.
You created a Red Hat OpenShift Service on AWS cluster.

Procedure

To add a machine pool that does not use autoscaling, create the machine pool and define the instance type, compute (also known as worker) node count, and node labels:
```
rosa create machinepool --cluster=<cluster-name> \
                          --name=<machine_pool_id> \
                          --replicas=<replica_count> \
                          --instance-type=<instance_type> \
                          --labels=<key>=<value>,<key>=<value> \
                          --taints=<key>=<value>:<effect>,<key>=<value>:<effect> \
                          --disk-size=<disk_size> \
                          --availability-zone=<availability_zone_name> \
                          --additional-security-group-ids <sec_group_id> \
                          --subnet <subnet_id>
```
```
$ rosa create machinepool --cluster=<cluster-name> \
                          --name=<machine_pool_id> \
                          --replicas=<replica_count> \
                          --instance-type=<instance_type> \
                          --labels=<key>=<value>,<key>=<value> \
                          --taints=<key>=<value>:<effect>,<key>=<value>:<effect> \
                          --disk-size=<disk_size> \
                          --availability-zone=<availability_zone_name> \
                          --additional-security-group-ids <sec_group_id> \
                          --subnet <subnet_id>
```
Copy to Clipboard Toggle word wrap
where:
--name=<machine_pool_id>
Specifies the name of the machine pool.
--replicas=<replica_count>
Specifies the number of compute nodes to provision. If you deployed Red Hat OpenShift Service on AWS using a single availability zone, this defines the number of compute nodes to provision to the machine pool for the zone. If you deployed your cluster using multiple availability zones, this defines the number of compute nodes to provision in total across all zones and the count must be a multiple of 3. The --replicas argument is required when autoscaling is not configured.
--instance-type=<instance_type>
Optional: Sets the instance type for the compute nodes in your machine pool. The instance type defines the vCPU and memory allocation for each compute node in the pool. Replace <instance_type> with an instance type. The default is m5.xlarge. You cannot change the instance type for a machine pool after the pool is created.
--labels=<key>=<value>,<key>=<value>
Optional: Defines the labels for the machine pool. Replace <key>=<value>,<key>=<value> with a comma-delimited list of key-value pairs, for example --labels=key1=value1,key2=value2.
--taints=<key>=<value>:<effect>,<key>=<value>:<effect>
Optional: Defines the taints for the machine pool. Replace <key>=<value>:<effect>,<key>=<value>:<effect> with a key, value, and effect for each taint, for example --taints=key1=value1:NoSchedule,key2=value2:NoExecute. Available effects include NoSchedule, PreferNoSchedule, and NoExecute.
--disk-size=<disk_size>
Optional: Specifies the worker node disk size. The value can be in GB, GiB, TB, or TiB. Replace <disk_size> with a numeric value and unit, for example --disk-size=200GiB.
--availability-zone=<availability_zone_name>
Optional: You can create a machine pool in an availability zone of your choice. Replace <availability_zone_name> with an availability zone name.
--additional-security-group-ids <sec_group_id>
Optional: For machine pools in clusters that do not have Red Hat managed VPCs, you can select additional custom security groups to use in your machine pools. You must have already created the security groups and associated them with the VPC that you selected for this cluster. You cannot add or edit security groups after you create the machine pool.
Important
You can use up to ten additional security groups for machine pools on Red Hat OpenShift Service on AWS clusters.
--subnet <subnet_id>
Optional: For BYO VPC clusters, you can select a subnet to create a Single-AZ machine pool. If the subnet is out of your cluster creation subnets, there must be a tag with a key kubernetes.io/cluster/<infra-id> and value shared. Customers can obtain the Infra ID by using the following command:
$ rosa describe cluster -c <cluster name>|grep "Infra ID:"

Copy to Clipboard Toggle word wrap
Example output

Infra ID: mycluster-xqvj7

Copy to Clipboard Toggle word wrap
Note
You cannot set both --subnet and --availability-zone at the same time, only 1 is allowed for a Single-AZ machine pool creation.
The following example creates a machine pool called mymachinepool that uses the m5.xlarge instance type and has 2 compute node replicas. The example also adds 2 workload-specific labels:
```
rosa create machinepool --cluster=mycluster --name=mymachinepool --replicas=2 --instance-type=m5.xlarge --labels=app=db,tier=backend
```
```
$ rosa create machinepool --cluster=mycluster --name=mymachinepool --replicas=2 --instance-type=m5.xlarge --labels=app=db,tier=backend
```
Copy to Clipboard Toggle word wrap
Example output
```
I: Machine pool 'mymachinepool' created successfully on cluster 'mycluster'
I: To view all machine pools, run 'rosa list machinepools -c mycluster'
```
```
I: Machine pool 'mymachinepool' created successfully on cluster 'mycluster'
I: To view all machine pools, run 'rosa list machinepools -c mycluster'
```
Copy to Clipboard Toggle word wrap
To add a machine pool that uses autoscaling, create the machine pool and define the autoscaling configuration, instance type and node labels:
```
rosa create machinepool --cluster=<cluster-name> \
                          --name=<machine_pool_id> \
                          --enable-autoscaling \
                          --min-replicas=<minimum_replica_count> \
                          --max-replicas=<maximum_replica_count> \
                          --instance-type=<instance_type> \
                          --labels=<key>=<value>,<key>=<value> \
                          --taints=<key>=<value>:<effect>,<key>=<value>:<effect> \
                          --availability-zone=<availability_zone_name>
```
```
$ rosa create machinepool --cluster=<cluster-name> \
                          --name=<machine_pool_id> \
                          --enable-autoscaling \
                          --min-replicas=<minimum_replica_count> \
                          --max-replicas=<maximum_replica_count> \
                          --instance-type=<instance_type> \
                          --labels=<key>=<value>,<key>=<value> \
                          --taints=<key>=<value>:<effect>,<key>=<value>:<effect> \
                          --availability-zone=<availability_zone_name>
```
Copy to Clipboard Toggle word wrap
where:
--name=<machine_pool_id>
Specifies the name of the machine pool. Replace <machine_pool_id> with the name of your machine pool.
--enable-autoscaling
Enables autoscaling in the machine pool to meet the deployment needs.
--min-replicas=<minimum_replica_count> and --max-replicas=<maximum_replica_count>
Defines the minimum and maximum compute node limits. The cluster autoscaler does not reduce or increase the machine pool node count beyond the limits that you specify.
The --min-replicas and --max-replicas arguments define the autoscaling limits in the machine pool for the availability zone.
--instance-type=<instance_type>
Optional: Sets the instance type for the compute nodes in your machine pool. The instance type defines the vCPU and memory allocation for each compute node in the pool. Replace <instance_type> with an instance type. The default is m5.xlarge. You cannot change the instance type for a machine pool after the pool is created.
--labels=<key>=<value>,<key>=<value>
Optional: Defines the labels for the machine pool. Replace <key>=<value>,<key>=<value> with a comma-delimited list of key-value pairs, for example --labels=key1=value1,key2=value2.
--taints=<key>=<value>:<effect>,<key>=<value>:<effect>
Optional: Defines the taints for the machine pool. Replace <key>=<value>:<effect>,<key>=<value>:<effect> with a key, value, and effect for each taint, for example --taints=key1=value1:NoSchedule,key2=value2:NoExecute. Available effects include NoSchedule, PreferNoSchedule, and NoExecute.
--availability-zone=<availability_zone_name>
Optional: You can create a machine pool in an availability zone of your choice. Replace <availability_zone_name> with an availability zone name.
The following example creates a machine pool called mymachinepool that uses the m5.xlarge instance type and has autoscaling enabled. The minimum compute node limit is 3 and the maximum is 6 overall. The example also adds 2 workload-specific labels:
```
rosa create machinepool --cluster=mycluster --name=mymachinepool --enable-autoscaling --min-replicas=3 --max-replicas=6 --instance-type=m5.xlarge --labels=app=db,tier=backend
```
```
$ rosa create machinepool --cluster=mycluster --name=mymachinepool --enable-autoscaling --min-replicas=3 --max-replicas=6 --instance-type=m5.xlarge --labels=app=db,tier=backend
```
Copy to Clipboard Toggle word wrap
Example output
```
I: Machine pool 'mymachinepool' created successfully on hosted cluster 'mycluster'
I: To view all machine pools, run 'rosa list machinepools -c mycluster'
```
```
I: Machine pool 'mymachinepool' created successfully on hosted cluster 'mycluster'
I: To view all machine pools, run 'rosa list machinepools -c mycluster'
```
Copy to Clipboard Toggle word wrap

To add a Windows License Included enabled machine pool to a Red Hat OpenShift Service on AWS cluster, see AWS Windows License Included for Red Hat OpenShift Service on AWS.
Windows License Included enabled machine pools can only be created when the following criteria is met:
- The host cluster is a Red Hat OpenShift Service on AWS cluster.
- The instance type is bare metal EC2.
  Important
  AWS Windows License Included for Red Hat OpenShift Service on AWS is a Technology Preview feature only. Technology Preview features are not supported with Red Hat production service level agreements (SLAs) and might not be functionally complete. Red Hat does not recommend using them in production. These features provide early access to upcoming product features, enabling customers to test functionality and provide feedback during the development process.
  For more information about the support scope of Red Hat Technology Preview features, see Technology Preview Features Support Scope.

Verification

You can list all machine pools on your cluster or describe individual machine pools.

List the available machine pools on your cluster:

rosa list machinepools --cluster=<cluster_name>

$ rosa list machinepools --cluster=<cluster_name>

Copy to Clipboard

Toggle word wrap

Example output

ID             AUTOSCALING  REPLICAS  INSTANCE TYPE  LABELS                  TAINTS    AVAILABILITY ZONE  SUBNET                    VERSION  AUTOREPAIR
Default        No           1/1       m5. xlarge                                       us-east-2c         subnet-00552ad67728a6ba3  4.14.34  Yes
mymachinepool  Yes          3/3-6     m5.xlarge      app=db, tier=backend              us-east-2a         subnet-0cb56f5f41880c413  4.14.34  Yes

ID             AUTOSCALING  REPLICAS  INSTANCE TYPE  LABELS                  TAINTS    AVAILABILITY ZONE  SUBNET                    VERSION  AUTOREPAIR
Default        No           1/1       m5. xlarge                                       us-east-2c         subnet-00552ad67728a6ba3  4.14.34  Yes
mymachinepool  Yes          3/3-6     m5.xlarge      app=db, tier=backend              us-east-2a         subnet-0cb56f5f41880c413  4.14.34  Yes

Copy to Clipboard

Toggle word wrap

Describe the information of a specific machine pool in your cluster:

rosa describe machinepool --cluster=<cluster_name> --machinepool=mymachinepool

$ rosa describe machinepool --cluster=<cluster_name> --machinepool=mymachinepool

Copy to Clipboard

Toggle word wrap

Example output

ID:                         mymachinepool
Cluster ID:                 2d6010rjvg17anri30v84vspf7c7kr6v
Autoscaling:                Yes
Desired replicas:           3-6
Current replicas:           3
Instance type:              m5.xlarge
Labels:                     app=db, tier=backend
Taints:
Availability zone:          us-east-2a
Subnet:                     subnet-0cb56f5f41880c413
Version:                    4.14.34
Autorepair:                 Yes
Tuning configs:
Additional security group IDs:
Node drain grace period:
Message:

ID:                         mymachinepool
Cluster ID:                 2d6010rjvg17anri30v84vspf7c7kr6v
Autoscaling:                Yes
Desired replicas:           3-6
Current replicas:           3
Instance type:              m5.xlarge
Labels:                     app=db, tier=backend
Taints:
Availability zone:          us-east-2a
Subnet:                     subnet-0cb56f5f41880c413
Version:                    4.14.34
Autorepair:                 Yes
Tuning configs:
Additional security group IDs:
Node drain grace period:
Message:

Copy to Clipboard

Toggle word wrap

Verify that the machine pool is included in the output and the configuration is as expected.

4.2.2. Configuring machine pool disk volume
Copiar enlace

Machine pool disk volume size can be configured for additional flexibility. The default disk size is 300 GiB.

For Red Hat OpenShift Service on AWS clusters, the disk size can be configured from a minimum of 75 GiB to a maximum of 16,384 GiB.

You can configure the machine pool disk size for your cluster by using OpenShift Cluster Manager or the ROSA command-line interface (CLI) (rosa).

Note

Existing cluster and machine pool node volumes cannot be resized.

Prerequisite for cluster creation

You have the option to select the node disk sizing for the default machine pool during cluster installation.

Procedure for cluster creation

From the Red Hat OpenShift Service on AWS cluster wizard, navigate to Cluster settings.
Navigate to Machine pool step.
Select the desired Root disk size.
Select Next to continue creating your cluster.

Prerequisite for machine pool creation

You have the option to select the node disk sizing for the new machine pool after the cluster has been installed.

Procedure for machine pool creation

Navigate to OpenShift Cluster Manager and select your cluster.
Navigate to Machine pool tab.
Click Add machine pool.
Select the desired Root disk size.
Select Add machine pool to create the machine pool.

4.2.2.1. Configuring machine pool disk volume using the ROSA CLI
Copiar enlace

Prerequisite for cluster creation

You have the option to select the root disk sizing for the default machine pool during cluster installation.

Procedure for cluster creation

Run the following command when creating your OpenShift cluster for the desired root disk size:
```
rosa create cluster --worker-disk-size=<disk_size>
```
```
$ rosa create cluster --worker-disk-size=<disk_size>
```
Copy to Clipboard Toggle word wrap
The value can be in GB, GiB, TB, or TiB. Replace <disk_size> with a numeric value and unit, for example --worker-disk-size=200GiB. You cannot separate the digit and the unit. No spaces are allowed.

Prerequisite for machine pool creation

You have the option to select the root disk sizing for the new machine pool after the cluster has been installed.

Procedure for machine pool creation

Scale up the cluster by executing the following command:
```
rosa create machinepool --cluster=<cluster_id> \
                          --disk-size=<disk_size>
```
```
$ rosa create machinepool --cluster=<cluster_id> \
```
1
```
                          --disk-size=<disk_size> 
```
2
Copy to Clipboard Toggle word wrap
1
Specifies the ID or name of your existing OpenShift cluster.
2
Specifies the worker node disk size. The value can be in GB, GiB, TB, or TiB. Replace <disk_size> with a numeric value and unit, for example --disk-size=200GiB. You cannot separate the digit and the unit. No spaces are allowed.
Confirm new machine pool disk volume size by logging into the AWS console and find the EC2 virtual machine root volume size.

4.2.3. Deleting a machine pool
Copiar enlace

You can delete a machine pool in the event that your workload requirements have changed and your current machine pools no longer meet your needs.

You can delete machine pools using Red Hat OpenShift Cluster Manager or the ROSA command-line interface (CLI) (rosa).

4.2.3.1. Deleting a machine pool using OpenShift Cluster Manager
Copiar enlace

You can delete a machine pool for your Red Hat OpenShift Service on AWS cluster by using Red Hat OpenShift Cluster Manager.

Prerequisites

You created a Red Hat OpenShift Service on AWS cluster.
The cluster is in the ready state.
You have an existing machine pool without any taints and with at least two instances for a single-AZ cluster or three instances for a multi-AZ cluster.

Procedure

From OpenShift Cluster Manager, navigate to the Cluster List page and select the cluster that contains the machine pool that you want to delete.
On the selected cluster, select the Machine pools tab.
Under the Machine pools tab, click the Options menu for the machine pool that you want to delete.
Click Delete.
The selected machine pool is deleted.

4.2.3.2. Deleting a machine pool using the ROSA CLI
Copiar enlace

You can delete a machine pool for your Red Hat OpenShift Service on AWS cluster by using the ROSA command-line interface (CLI) (rosa).

Note

For users of rosa version 1.2.25 and earlier versions, the machine pool (ID='Default') that is created along with the cluster cannot be deleted. For users of rosa version 1.2.26 and later, the machine pool (ID='worker') that is created along with the cluster can be deleted if there is one machine pool within the cluster that contains no taints, and at least two replicas for a Single-AZ cluster or three replicas for a Multi-AZ cluster.

Prerequisites

You created a Red Hat OpenShift Service on AWS cluster.
The cluster is in the ready state.
You have an existing machine pool without any taints and with at least two instances for a Single-AZ cluster or three instances for a Multi-AZ cluster.

Procedure

From the ROSA CLI, run the following command:

rosa delete machinepool -c=<cluster_name> <machine_pool_ID>

$ rosa delete machinepool -c=<cluster_name> <machine_pool_ID>

Copy to Clipboard

Toggle word wrap

Example output

? Are you sure you want to delete machine pool <machine_pool_ID> on cluster <cluster_name>? (y/N)

? Are you sure you want to delete machine pool <machine_pool_ID> on cluster <cluster_name>? (y/N)

Copy to Clipboard

Toggle word wrap

Enter y to delete the machine pool.
The selected machine pool is deleted.

4.2.4. Scaling compute nodes manually
Copiar enlace

If you have not enabled autoscaling for your machine pool, you can manually scale the number of compute (also known as worker) nodes in the pool to meet your deployment needs.

You must scale each machine pool separately.

Prerequisites

You installed and configured the latest ROSA command-line interface (CLI) (rosa) on your workstation.
You logged in to your Red Hat account using the ROSA CLI.
You created a Red Hat OpenShift Service on AWS cluster.
You have an existing machine pool.

Procedure

List the machine pools in the cluster:

rosa list machinepools --cluster=<cluster_name>

$ rosa list machinepools --cluster=<cluster_name>

Copy to Clipboard

Toggle word wrap

Example output

ID        AUTOSCALING   REPLICAS    INSTANCE TYPE  LABELS    TAINTS   AVAILABILITY ZONES    DISK SIZE   SG IDs
default   No            2           m5.xlarge                         us-east-1a            300GiB      sg-0e375ff0ec4a6cfa2
mp1       No            2           m5.xlarge                         us-east-1a            300GiB      sg-0e375ff0ec4a6cfa2

ID        AUTOSCALING   REPLICAS    INSTANCE TYPE  LABELS    TAINTS   AVAILABILITY ZONES    DISK SIZE   SG IDs
default   No            2           m5.xlarge                         us-east-1a            300GiB      sg-0e375ff0ec4a6cfa2
mp1       No            2           m5.xlarge                         us-east-1a            300GiB      sg-0e375ff0ec4a6cfa2

Copy to Clipboard

Toggle word wrap

Increase or decrease the number of compute node replicas in a machine pool:

rosa edit machinepool --cluster=<cluster_name> \
                        --replicas=<replica_count> \
                        <machine_pool_id>

$ rosa edit machinepool --cluster=<cluster_name> \
                        --replicas=<replica_count> \


                        <machine_pool_id>

Copy to Clipboard

Toggle word wrap

1: The replica count defines the number of compute nodes to provision to the machine pool for the zone.
2: Replace <machine_pool_id> with the ID of your machine pool, as listed in the output of the preceding command.

Verification

List the available machine pools in your cluster:

rosa list machinepools --cluster=<cluster_name>

$ rosa list machinepools --cluster=<cluster_name>

Copy to Clipboard

Toggle word wrap

Example output

ID        AUTOSCALING   REPLICAS    INSTANCE TYPE  LABELS    TAINTS   AVAILABILITY ZONES    DISK SIZE   SG IDs
default   No            2           m5.xlarge                         us-east-1a            300GiB      sg-0e375ff0ec4a6cfa2
mp1       No            3           m5.xlarge                         us-east-1a            300GiB      sg-0e375ff0ec4a6cfa2

ID        AUTOSCALING   REPLICAS    INSTANCE TYPE  LABELS    TAINTS   AVAILABILITY ZONES    DISK SIZE   SG IDs
default   No            2           m5.xlarge                         us-east-1a            300GiB      sg-0e375ff0ec4a6cfa2
mp1       No            3           m5.xlarge                         us-east-1a            300GiB      sg-0e375ff0ec4a6cfa2

Copy to Clipboard

Toggle word wrap

In the output of the preceding command, verify that the compute node replica count is as expected for your machine pool. In the example output, the compute node replica count for the mp1 machine pool is scaled to 3.

4.2.5. Node labels
Copiar enlace

A label is a key-value pair applied to a Node object. You can use labels to organize sets of objects and control the scheduling of pods.

You can add labels during cluster creation or after. Labels can be modified or updated at any time.

4.2.5.1. Adding node labels to a machine pool
Copiar enlace

Add or edit labels for compute (also known as worker) nodes at any time to manage the nodes in a manner that is relevant to you. For example, you can assign types of workloads to specific nodes.

Labels are assigned as key-value pairs. Each key must be unique to the object it is assigned to.

Prerequisites

You installed and configured the latest ROSA command-line interface (CLI) (rosa) on your workstation.
You logged in to your Red Hat account using the ROSA CLI.
You created a Red Hat OpenShift Service on AWS cluster.
You have an existing machine pool.

Procedure

List the machine pools in the cluster:

rosa list machinepools --cluster=<cluster_name>

$ rosa list machinepools --cluster=<cluster_name>

Copy to Clipboard

Toggle word wrap

Example output

ID           AUTOSCALING  REPLICAS  INSTANCE TYPE  LABELS    TAINTS    AVAILABILITY ZONE  SUBNET                    VERSION  AUTOREPAIR
workers      No           2/2       m5.xlarge                          us-east-2a         subnet-0df2ec3377847164f  4.16.6   Yes
db-nodes-mp  No           2/2       m5.xlarge                          us-east-2a         subnet-0df2ec3377847164f  4.16.6   Yes

ID           AUTOSCALING  REPLICAS  INSTANCE TYPE  LABELS    TAINTS    AVAILABILITY ZONE  SUBNET                    VERSION  AUTOREPAIR
workers      No           2/2       m5.xlarge                          us-east-2a         subnet-0df2ec3377847164f  4.16.6   Yes
db-nodes-mp  No           2/2       m5.xlarge                          us-east-2a         subnet-0df2ec3377847164f  4.16.6   Yes

Copy to Clipboard

Toggle word wrap

Add or update the node labels for a machine pool:

To add or update node labels for a machine pool that does not use autoscaling, run the following command:

rosa edit machinepool --cluster=<cluster_name> \
                        --labels=<key>=<value>,<key>=<value> \
                        <machine_pool_id>

$ rosa edit machinepool --cluster=<cluster_name> \
                        --labels=<key>=<value>,<key>=<value> \


                        <machine_pool_id>

Copy to Clipboard

Toggle word wrap

1: Replace <key>=<value>,<key>=<value> with a comma-delimited list of key-value pairs, for example --labels=key1=value1,key2=value2. This list overwrites any modifications made to node labels on an ongoing basis.

The following example adds labels to the db-nodes-mp machine pool:

rosa edit machinepool --cluster=mycluster --replicas=2 --labels=app=db,tier=backend db-nodes-mp

$ rosa edit machinepool --cluster=mycluster --replicas=2 --labels=app=db,tier=backend db-nodes-mp

Copy to Clipboard

Toggle word wrap

Example output

I: Updated machine pool 'db-nodes-mp' on cluster 'mycluster'

I: Updated machine pool 'db-nodes-mp' on cluster 'mycluster'

Copy to Clipboard

Toggle word wrap

Verification

Describe the details of the machine pool with the new labels:

rosa describe machinepool --cluster=<cluster_name> --machinepool=<machine-pool-name>

$ rosa describe machinepool --cluster=<cluster_name> --machinepool=<machine-pool-name>

Copy to Clipboard

Toggle word wrap

Example output

ID:                            db-nodes-mp
Cluster ID:                    <ID_of_cluster>
Autoscaling:                   No
Desired replicas:              2
Current replicas:              2
Instance type:                 m5.xlarge
Labels:                        app=db, tier=backend
Tags:
Taints:
Availability zone:             us-east-2a
Subnet:                        subnet-0df2ec3377847164f
Disk size:                     300 GiB
Version:                       4.16.6
EC2 Metadata Http Tokens:      optional
Autorepair:                    Yes
Tuning configs:
Kubelet configs:
Additional security group IDs:
Node drain grace period:
Management upgrade:
 - Type:                       Replace
 - Max surge:                  1
 - Max unavailable:            0
Message:

ID:                            db-nodes-mp
Cluster ID:                    <ID_of_cluster>
Autoscaling:                   No
Desired replicas:              2
Current replicas:              2
Instance type:                 m5.xlarge
Labels:                        app=db, tier=backend
Tags:
Taints:
Availability zone:             us-east-2a
Subnet:                        subnet-0df2ec3377847164f
Disk size:                     300 GiB
Version:                       4.16.6
EC2 Metadata Http Tokens:      optional
Autorepair:                    Yes
Tuning configs:
Kubelet configs:
Additional security group IDs:
Node drain grace period:
Management upgrade:
 - Type:                       Replace
 - Max surge:                  1
 - Max unavailable:            0
Message:

Copy to Clipboard

Toggle word wrap

Verify that the labels are included for your machine pool in the output.

4.2.6. Adding tags to a machine pool
Copiar enlace

You can add tags for compute nodes, also known as worker nodes, in a machine pool to introduce custom user tags for AWS resources that are generated when you provision your machine pool, noting that you can not edit the tags after you create the machine pool.

4.2.6.1. Adding tags to a machine pool using the ROSA CLI
Copiar enlace

You can add tags to a machine pool for your Red Hat OpenShift Service on AWS cluster by using the ROSA command-line interface (CLI) (rosa). You can not edit the tags after after you create the machine pool.

Important

You must ensure that your tag keys are not aws, red-hat-managed, red-hat-clustertype, or Name. In addition, you must not set a tag key that begins with kubernetes.io/cluster/. Your tag’s key cannot be longer than 128 characters, while your tag’s value cannot be longer than 256 characters. Red Hat reserves the right to add additional reserved tags in the future.

Prerequisites

You installed and configured the latest AWS (aws), ROSA (rosa), and OpenShift (oc) CLIs on your workstation.
You logged in to your Red Hat account by using the ROSA CLI.
You created a Red Hat OpenShift Service on AWS cluster.

Procedure

Create a machine pool with a custom tag by running the following command:

rosa create machinepools --cluster=<name> --replicas=<replica_count> \
     --name <mp_name> --tags='<key> <value>,<key> <value>'

$ rosa create machinepools --cluster=<name> --replicas=<replica_count> \
     --name <mp_name> --tags='<key> <value>,<key> <value>'

Copy to Clipboard

Toggle word wrap

1: Replace <key> <value>,<key> <value> with a key and value for each tag.

Example output

rosa create machinepools --cluster=mycluster --replicas 2 --tags='tagkey1 tagvalue1,tagkey2 tagvaluev2'

$ rosa create machinepools --cluster=mycluster --replicas 2 --tags='tagkey1 tagvalue1,tagkey2 tagvaluev2'

I: Checking available instance types for machine pool 'mp-1'
I: Machine pool 'mp-1' created successfully on cluster 'mycluster'
I: To view the machine pool details, run 'rosa describe machinepool --cluster mycluster --machinepool mp-1'
I: To view all machine pools, run 'rosa list machinepools --cluster mycluster'

Copy to Clipboard

Toggle word wrap

Verification

Use the describe command to see the details of the machine pool with the tags, and verify that the tags are included for your machine pool in the output:

rosa describe machinepool --cluster=<cluster_name> --machinepool=<machinepool_name>

$ rosa describe machinepool --cluster=<cluster_name> --machinepool=<machinepool_name>

Copy to Clipboard

Toggle word wrap

Example output

ID:                            db-nodes-mp
Cluster ID:                    <ID_of_cluster>
Autoscaling:                   No
Desired replicas:              2
Current replicas:              2
Instance type:                 m5.xlarge
Labels:
Tags:                          red-hat-clustertype=rosa, red-hat-managed=true, tagkey1=tagvalue1, tagkey2=tagvaluev2
Taints:
Availability zone:             us-east-2a
...

ID:                            db-nodes-mp
Cluster ID:                    <ID_of_cluster>
Autoscaling:                   No
Desired replicas:              2
Current replicas:              2
Instance type:                 m5.xlarge
Labels:
Tags:                          red-hat-clustertype=rosa, red-hat-managed=true, tagkey1=tagvalue1, tagkey2=tagvaluev2
Taints:
Availability zone:             us-east-2a
...

Copy to Clipboard

Toggle word wrap

4.2.7. Adding taints to a machine pool
Copiar enlace

You can add taints for compute (also known as worker) nodes in a machine pool to control which pods are scheduled to them. When you apply a taint to a machine pool, the scheduler cannot place a pod on the nodes in the pool unless the pod specification includes a toleration for the taint. Taints can be added to a machine pool using Red Hat OpenShift Cluster Manager or the ROSA command-line interface (CLI) (rosa).

Note

A cluster must have at least one machine pool that does not contain any taints.

4.2.7.1. Adding taints to a machine pool using OpenShift Cluster Manager
Copiar enlace

You can add taints to a machine pool for your Red Hat OpenShift Service on AWS cluster by using Red Hat OpenShift Cluster Manager.

Prerequisites

You created a Red Hat OpenShift Service on AWS cluster.
You have an existing machine pool that does not contain any taints and contains at least two instances.

Procedure

Navigate to OpenShift Cluster Manager and select your cluster.
Under the Machine pools tab, click the Options menu for the machine pool that you want to add a taint to.
Select Edit taints.
Add Key and Value entries for your taint.
Select an Effect for your taint from the list. Available options include NoSchedule, PreferNoSchedule, and NoExecute.
Optional: Select Add taint if you want to add more taints to the machine pool.
Click Save to apply the taints to the machine pool.

Verification

Under the Machine pools tab, select > next to your machine pool to expand the view.
Verify that your taints are listed under Taints in the expanded view.

4.2.7.2. Adding taints to a machine pool using the ROSA CLI
Copiar enlace

You can add taints to a machine pool for your Red Hat OpenShift Service on AWS cluster by using the ROSA command-line interface (CLI) (rosa).

Note

For users of rosa version 1.2.25 and prior versions, the number of taints cannot be changed within the machine pool (ID=Default) created along with the cluster. For users of rosa version 1.2.26 and beyond, the number of taints can be changed within the machine pool (ID=worker) created along with the cluster. There must be at least one machine pool without any taints and with at least two replicas.

Prerequisites

You installed and configured the latest AWS (aws), ROSA (rosa), and OpenShift (oc) CLIs on your workstation.
You logged in to your Red Hat account by using the rosa CLI.
You created a Red Hat OpenShift Service on AWS cluster.
You have an existing machine pool that does not contain any taints and contains at least two instances.

Procedure

List the machine pools in the cluster by running the following command:

rosa list machinepools --cluster=<cluster_name>

$ rosa list machinepools --cluster=<cluster_name>

Copy to Clipboard

Toggle word wrap

Example output

ID           AUTOSCALING  REPLICAS  INSTANCE TYPE  LABELS    TAINTS    AVAILABILITY ZONE  SUBNET                    VERSION  AUTOREPAIR
workers      No           2/2       m5.xlarge                          us-east-2a         subnet-0df2ec3377847164f  4.16.6   Yes
db-nodes-mp  No           2/2       m5.xlarge                          us-east-2a         subnet-0df2ec3377847164f  4.16.6   Yes

ID           AUTOSCALING  REPLICAS  INSTANCE TYPE  LABELS    TAINTS    AVAILABILITY ZONE  SUBNET                    VERSION  AUTOREPAIR
workers      No           2/2       m5.xlarge                          us-east-2a         subnet-0df2ec3377847164f  4.16.6   Yes
db-nodes-mp  No           2/2       m5.xlarge                          us-east-2a         subnet-0df2ec3377847164f  4.16.6   Yes

Copy to Clipboard

Toggle word wrap

Add or update the taints for a machine pool:

To add or update taints for a machine pool that does not use autoscaling, run the following command:

rosa edit machinepool --cluster=<cluster_name> \
                        --taints=<key>=<value>:<effect>,<key>=<value>:<effect> \
                        <machine_pool_id>

$ rosa edit machinepool --cluster=<cluster_name> \
                        --taints=<key>=<value>:<effect>,<key>=<value>:<effect> \


                        <machine_pool_id>

Copy to Clipboard

Toggle word wrap

1: Replace <key>=<value>:<effect>,<key>=<value>:<effect> with a key, value, and effect for each taint, for example --taints=key1=value1:NoSchedule,key2=value2:NoExecute. Available effects include NoSchedule, PreferNoSchedule, and NoExecute.This list overwrites any modifications made to node taints on an ongoing basis.

The following example adds taints to the db-nodes-mp machine pool:

rosa edit machinepool --cluster=mycluster --replicas 2 --taints=key1=value1:NoSchedule,key2=value2:NoExecute db-nodes-mp

$ rosa edit machinepool --cluster=mycluster --replicas 2 --taints=key1=value1:NoSchedule,key2=value2:NoExecute db-nodes-mp

Copy to Clipboard

Toggle word wrap

Example output

I: Updated machine pool 'db-nodes-mp' on cluster 'mycluster'

I: Updated machine pool 'db-nodes-mp' on cluster 'mycluster'

Copy to Clipboard

Toggle word wrap

Verification

Describe the details of the machine pool with the new taints:

rosa describe machinepool --cluster=<cluster_name> --machinepool=<machinepool_name>

$ rosa describe machinepool --cluster=<cluster_name> --machinepool=<machinepool_name>

Copy to Clipboard

Toggle word wrap

Example output

ID:                            db-nodes-mp
Cluster ID:                    <ID_of_cluster>
Autoscaling:                   No
Desired replicas:              2
Current replicas:              2
Instance type:                 m5.xlarge
Labels:
Tags:
Taints:                        key1=value1:NoSchedule, key2=value2:NoExecute
Availability zone:             us-east-2a
...

ID:                            db-nodes-mp
Cluster ID:                    <ID_of_cluster>
Autoscaling:                   No
Desired replicas:              2
Current replicas:              2
Instance type:                 m5.xlarge
Labels:
Tags:
Taints:                        key1=value1:NoSchedule, key2=value2:NoExecute
Availability zone:             us-east-2a
...

Copy to Clipboard

Toggle word wrap

Verify that the taints are included for your machine pool in the output.

4.2.8. Configuring machine pool AutoRepair
Copiar enlace

Red Hat OpenShift Service on AWS supports an automatic repair process for machine pools, called AutoRepair. AutoRepair is useful when you want the Red Hat OpenShift Service on AWS service to detect certain unhealthy nodes, drain the unhealthy nodes, and re-create the nodes. You can disable AutoRepair if the unhealthy nodes should not be replaced, such as in cases where the nodes should be preserved. AutoRepair is enabled by default on machine pools.

The AutoRepair process deems a node unhealthy when the state of the node is either NotReady or is in an unknown state for predefined amount of time (typically 8 minutes). Whenever two or more nodes become unhealthy simultaneously, the AutoRepair process stops repairing the nodes. Similarly, when a new node is created unhealthy even after a predefined amount of time (typically 20 minutes), the service will auto-repair.

Note

Machine pool AutoRepair is only available for Red Hat OpenShift Service on AWS clusters.

4.2.8.1. Configuring AutoRepair on a machine pool using OpenShift Cluster Manager
Copiar enlace

You can configure machine pool AutoRepair for your Red Hat OpenShift Service on AWS cluster by using Red Hat OpenShift Cluster Manager.

Prerequisites

You created a ROSA with HCP cluster.
You have an existing machine pool.

Procedure

Navigate to OpenShift Cluster Manager and select your cluster.
Under the Machine pools tab, click the Options menu for the machine pool that you want to configure auto repair for.
From the menu, select Edit.
From the Edit Machine Pool dialog box that displays, find the AutoRepair option.
Select or clear the box next to AutoRepair to enable or disable.
Click Save to apply the change to the machine pool.

Verification

Under the Machine pools tab, select > next to your machine pool to expand the view.
Verify that your machine pool has the correct AutoRepair setting in the expanded view.

4.2.8.2. Configuring machine pool AutoRepair using the ROSA CLI
Copiar enlace

You can configure machine pool AutoRepair for your Red Hat OpenShift Service on AWS cluster by using the ROSA command-line interface (CLI) (rosa).

Prerequisites

You installed and configured the latest AWS (aws) and ROSA (rosa) CLIs on your workstation.
You logged in to your Red Hat account by using the rosa CLI.
You created a Red Hat OpenShift Service on AWS cluster.
You have an existing machine pool.

Procedure

List the machine pools in the cluster by running the following command:

rosa list machinepools --cluster=<cluster_name>

$ rosa list machinepools --cluster=<cluster_name>

Copy to Clipboard

Toggle word wrap

Example output

ID           AUTOSCALING  REPLICAS  INSTANCE TYPE  LABELS    TAINTS    AVAILABILITY ZONE  SUBNET                    VERSION  AUTOREPAIR
workers      No           2/2       m5.xlarge                          us-east-2a         subnet-0df2ec3377847164f  4.16.6   Yes
db-nodes-mp  No           2/2       m5.xlarge                          us-east-2a         subnet-0df2ec3377847164f  4.16.6   Yes

ID           AUTOSCALING  REPLICAS  INSTANCE TYPE  LABELS    TAINTS    AVAILABILITY ZONE  SUBNET                    VERSION  AUTOREPAIR
workers      No           2/2       m5.xlarge                          us-east-2a         subnet-0df2ec3377847164f  4.16.6   Yes
db-nodes-mp  No           2/2       m5.xlarge                          us-east-2a         subnet-0df2ec3377847164f  4.16.6   Yes

Copy to Clipboard

Toggle word wrap

Enable or disable AutoRepair on a machine pool:

To disable AutoRepair for a machine pool, run the following command:

rosa edit machinepool --cluster=mycluster --machinepool=<machinepool_name>  --autorepair=false

$ rosa edit machinepool --cluster=mycluster --machinepool=<machinepool_name>  --autorepair=false

Copy to Clipboard

Toggle word wrap

To enable AutoRepair for a machine pool, run the following command:

rosa edit machinepool --cluster=mycluster --machinepool=<machinepool_name>  --autorepair=true

$ rosa edit machinepool --cluster=mycluster --machinepool=<machinepool_name>  --autorepair=true

Copy to Clipboard

Toggle word wrap

Example output

I: Updated machine pool 'machinepool_name' on cluster 'mycluster'

I: Updated machine pool 'machinepool_name' on cluster 'mycluster'

Copy to Clipboard

Toggle word wrap

Verification

Describe the details of the machine pool:

rosa describe machinepool --cluster=<cluster_name> --machinepool=<machinepool_name>

$ rosa describe machinepool --cluster=<cluster_name> --machinepool=<machinepool_name>

Copy to Clipboard

Toggle word wrap

Example output

ID:                            machinepool_name
Cluster ID:                    <ID_of_cluster>
Autoscaling:                   No
Desired replicas:              2
Current replicas:              2
Instance type:                 m5.xlarge
Labels:
Tags:
Taints:
Availability zone:             us-east-2a
...
Autorepair:                    Yes
Tuning configs:
Kubelet configs:
Additional security group IDs:
Node drain grace period:
Management upgrade:
 - Type:                               Replace
 - Max surge:                          1
 - Max unavailable:                    0

ID:                            machinepool_name
Cluster ID:                    <ID_of_cluster>
Autoscaling:                   No
Desired replicas:              2
Current replicas:              2
Instance type:                 m5.xlarge
Labels:
Tags:
Taints:
Availability zone:             us-east-2a
...
Autorepair:                    Yes
Tuning configs:
Kubelet configs:
Additional security group IDs:
Node drain grace period:
Management upgrade:
 - Type:                               Replace
 - Max surge:                          1
 - Max unavailable:                    0

Copy to Clipboard

Toggle word wrap

Verify that the AutoRepair setting is correct for your machine pool in the output.

4.2.9. Adding node tuning to a machine pool
Copiar enlace

You can add tunings for compute, also called worker, nodes in a machine pool to control their configuration on Red Hat OpenShift Service on AWS clusters.

Prerequisites

You installed and configured the latest ROSA command-line interface (CLI) (rosa) on your workstation.
You logged in to your Red Hat account by using 'rosa'.
You created a Red Hat OpenShift Service on AWS cluster.
You have an existing machine pool.
You have an existing tuning configuration.

Procedure

List all of the machine pools in the cluster:

rosa list machinepools --cluster=<cluster_name>

$ rosa list machinepools --cluster=<cluster_name>

Copy to Clipboard

Toggle word wrap

Example output

ID           AUTOSCALING  REPLICAS  INSTANCE TYPE  LABELS    TAINTS    AVAILABILITY ZONE  SUBNET                    VERSION  AUTOREPAIR
db-nodes-mp  No           0/2       m5.xlarge                          us-east-2a         subnet-08d4d81def67847b6  4.14.34  Yes
workers      No           2/2       m5.xlarge                          us-east-2a         subnet-08d4d81def67847b6  4.14.34  Yes

ID           AUTOSCALING  REPLICAS  INSTANCE TYPE  LABELS    TAINTS    AVAILABILITY ZONE  SUBNET                    VERSION  AUTOREPAIR
db-nodes-mp  No           0/2       m5.xlarge                          us-east-2a         subnet-08d4d81def67847b6  4.14.34  Yes
workers      No           2/2       m5.xlarge                          us-east-2a         subnet-08d4d81def67847b6  4.14.34  Yes

Copy to Clipboard

Toggle word wrap

You can add tuning configurations to an existing or new machine pool.

Add tunings when creating a machine pool:

rosa create machinepool -c <cluster-name> --name <machinepoolname> --tuning-configs <tuning_config_name>

$ rosa create machinepool -c <cluster-name> --name <machinepoolname> --tuning-configs <tuning_config_name>

Copy to Clipboard

Toggle word wrap

Example output

? Tuning configs: sample-tuning
I: Machine pool 'db-nodes-mp' created successfully on hosted cluster 'sample-cluster'
I: To view all machine pools, run 'rosa list machinepools -c sample-cluster'

? Tuning configs: sample-tuning
I: Machine pool 'db-nodes-mp' created successfully on hosted cluster 'sample-cluster'
I: To view all machine pools, run 'rosa list machinepools -c sample-cluster'

Copy to Clipboard

Toggle word wrap

Add or update the tunings for a machine pool:

rosa edit machinepool -c <cluster-name> --machinepool <machinepoolname> --tuning-configs <tuning_config_name>

$ rosa edit machinepool -c <cluster-name> --machinepool <machinepoolname> --tuning-configs <tuning_config_name>

Copy to Clipboard

Toggle word wrap

Example output

I: Updated machine pool 'db-nodes-mp' on cluster 'mycluster'

I: Updated machine pool 'db-nodes-mp' on cluster 'mycluster'

Copy to Clipboard

Toggle word wrap

Verification

Describe the machine pool for which you added a tuning config:

rosa describe machinepool --cluster=<cluster_name> --machinepool=<machine_pool_name>

$ rosa describe machinepool --cluster=<cluster_name> --machinepool=<machine_pool_name>

Copy to Clipboard

Toggle word wrap

Example output

ID:                                    db-nodes-mp
Cluster ID:                            <cluster_ID>
Autoscaling:                           No
Desired replicas:                      2
Current replicas:                      2
Instance type:                         m5.xlarge
Labels:
Tags:
Taints:
Availability zone:                     us-east-2a
Subnet:                                subnet-08d4d81def67847b6
Version:                               4.14.34
EC2 Metadata Http Tokens:              optional
Autorepair:                            Yes
Tuning configs:                        sample-tuning
...

ID:                                    db-nodes-mp
Cluster ID:                            <cluster_ID>
Autoscaling:                           No
Desired replicas:                      2
Current replicas:                      2
Instance type:                         m5.xlarge
Labels:
Tags:
Taints:
Availability zone:                     us-east-2a
Subnet:                                subnet-08d4d81def67847b6
Version:                               4.14.34
EC2 Metadata Http Tokens:              optional
Autorepair:                            Yes
Tuning configs:                        sample-tuning
...

Copy to Clipboard

Toggle word wrap

Verify that the tuning config is included for your machine pool in the output.

4.2.10. Configuring node drain grace periods
Copiar enlace

You can configure the node drain grace period for machine pools in your cluster. The node drain grace period for a machine pool is how long the cluster respects the Pod Disruption Budget protected workloads when upgrading or replacing the machine pool. After this grace period, all remaining workloads are forcibly evicted. The value range for the node drain grace period is from 0 to 1 week. With the default value 0, or empty value, the machine pool drains without any time limitation until complete.

Prerequisites

You installed and configured the latest ROSA command-line interface (CLI) (rosa) on your workstation.
You created a Red Hat OpenShift Service on AWS cluster.
You have an existing machine pool.

Procedure

List all of the machine pools in the cluster by running the following command:

rosa list machinepools --cluster=<cluster_name>

$ rosa list machinepools --cluster=<cluster_name>

Copy to Clipboard

Toggle word wrap

Example output

ID           AUTOSCALING  REPLICAS  INSTANCE TYPE  LABELS    TAINTS    AVAILABILITY ZONE  SUBNET                    VERSION  AUTOREPAIR
db-nodes-mp  No           2/2       m5.xlarge                          us-east-2a         subnet-08d4d81def67847b6  4.14.34  Yes
workers      No           2/2       m5.xlarge                          us-east-2a         subnet-08d4d81def67847b6  4.14.34  Yes

ID           AUTOSCALING  REPLICAS  INSTANCE TYPE  LABELS    TAINTS    AVAILABILITY ZONE  SUBNET                    VERSION  AUTOREPAIR
db-nodes-mp  No           2/2       m5.xlarge                          us-east-2a         subnet-08d4d81def67847b6  4.14.34  Yes
workers      No           2/2       m5.xlarge                          us-east-2a         subnet-08d4d81def67847b6  4.14.34  Yes

Copy to Clipboard

Toggle word wrap

Check the node drain grace period for a machine pool by running the following command:

rosa describe machinepool --cluster <cluster_name> --machinepool=<machinepool_name>

$ rosa describe machinepool --cluster <cluster_name> --machinepool=<machinepool_name>

Copy to Clipboard

Toggle word wrap

Example output

ID:                                    workers
Cluster ID:                            2a90jdl0i4p9r9k9956v5ocv40se1kqs
...
Node drain grace period:               
...

ID:                                    workers
Cluster ID:                            2a90jdl0i4p9r9k9956v5ocv40se1kqs
...
Node drain grace period:

...

Copy to Clipboard

Toggle word wrap

1: If this value is empty, the machine pool drains without any time limitation until complete.

Optional: Update the node drain grace period for a machine pool by running the following command:
```
rosa edit machinepool --node-drain-grace-period="<node_drain_grace_period_value>" --cluster=<cluster_name>  <machinepool_name>
```
```
$ rosa edit machinepool --node-drain-grace-period="<node_drain_grace_period_value>" --cluster=<cluster_name>  <machinepool_name>
```
Copy to Clipboard Toggle word wrap
Note
Changing the node drain grace period during a machine pool upgrade applies to future upgrades, not in-progress upgrades.

Verification

Check the node drain grace period for a machine pool by running the following command:

rosa describe machinepool --cluster <cluster_name> <machinepool_name>

$ rosa describe machinepool --cluster <cluster_name> <machinepool_name>

Copy to Clipboard

Toggle word wrap

Example output

ID:                                    workers
Cluster ID:                            2a90jdl0i4p9r9k9956v5ocv40se1kqs
...
Node drain grace period:               30 minutes
...

ID:                                    workers
Cluster ID:                            2a90jdl0i4p9r9k9956v5ocv40se1kqs
...
Node drain grace period:               30 minutes
...

Copy to Clipboard

Toggle word wrap

Verify the correct Node drain grace period for your machine pool in the output.

4.2.11. Additional resources
Copiar enlace

About machine pools
About autoscaling
Enabling autoscaling
Disabling autoscaling
Red Hat OpenShift Service on AWS Service Definition

4.3. About autoscaling nodes on a cluster
Copiar enlace

The autoscaler option can be configured to automatically scale the number of machines in a machine pool.

The cluster autoscaler increases the size of the machine pool when there are pods that failed to schedule on any of the current nodes due to insufficient resources or when another node is necessary to meet deployment needs. The cluster autoscaler does not increase the cluster resources beyond the limits that you specify.

Additionally, the cluster autoscaler decreases the size of the machine pool when some nodes are consistently not needed for a significant period, such as when it has low resource use and all of its important pods can fit on other nodes.

When you enable autoscaling, you must also set a minimum and maximum number of worker nodes.

Note

Only cluster owners and organization admins can scale or delete a cluster.

4.3.1. Enabling autoscaling nodes on a cluster
Copiar enlace

You can enable autoscaling on worker nodes to increase or decrease the number of nodes available by editing the machine pool definition for an existing cluster.

4.3.1.1. Enabling autoscaling nodes in an existing cluster using Red Hat OpenShift Cluster Manager
Copiar enlace

Enable autoscaling for worker nodes in the machine pool definition from OpenShift Cluster Manager console.

Procedure

From OpenShift Cluster Manager, navigate to the Cluster List page and select the cluster that you want to enable autoscaling for.
On the selected cluster, select the Machine pools tab.
Click the Options menu at the end of the machine pool that you want to enable autoscaling for and select Edit.
On the Edit machine pool dialog, select the Enable autoscaling checkbox.
Select Save to save these changes and enable autoscaling for the machine pool.

4.3.1.2. Enabling autoscaling nodes in an existing cluster using the ROSA CLI
Copiar enlace

Configure autoscaling to dynamically scale the number of worker nodes up or down based on load.

Successful autoscaling is dependent on having the correct AWS resource quotas in your AWS account. Verify resource quotas and request quota increases from the AWS console.

Procedure

To identify the machine pool IDs in a cluster, enter the following command:

rosa list machinepools --cluster=<cluster_name>

$ rosa list machinepools --cluster=<cluster_name>

Copy to Clipboard

Toggle word wrap

Example output

ID       AUTOSCALING  REPLICAS  INSTANCE TYPE  LABELS    TAINTS    AVAILABILITY ZONE  SUBNET                    VERSION  AUTOREPAIR
workers  No           2/2       m5.xlarge                          us-east-2a         subnet-03c2998b482bf3b20  4.16.6   Yes
mp1      No           2/2       m5.xlarge                          us-east-2a         subnet-03c2998b482bf3b20  4.16.6   Yes

ID       AUTOSCALING  REPLICAS  INSTANCE TYPE  LABELS    TAINTS    AVAILABILITY ZONE  SUBNET                    VERSION  AUTOREPAIR
workers  No           2/2       m5.xlarge                          us-east-2a         subnet-03c2998b482bf3b20  4.16.6   Yes
mp1      No           2/2       m5.xlarge                          us-east-2a         subnet-03c2998b482bf3b20  4.16.6   Yes

Copy to Clipboard

Toggle word wrap

Get the ID of the machine pools that you want to configure.

To enable autoscaling on a machine pool, enter the following command:

rosa edit machinepool --cluster=<cluster_name> <machinepool_ID> --enable-autoscaling --min-replicas=<number> --max-replicas=<number>

$ rosa edit machinepool --cluster=<cluster_name> <machinepool_ID> --enable-autoscaling --min-replicas=<number> --max-replicas=<number>

Copy to Clipboard

Toggle word wrap

Example

Enable autoscaling on a machine pool with the ID mp1 on a cluster named mycluster, with the number of replicas set to scale between 2 and 5 worker nodes:

rosa edit machinepool --cluster=mycluster mp1 --enable-autoscaling --min-replicas=2 --max-replicas=5

$ rosa edit machinepool --cluster=mycluster mp1 --enable-autoscaling --min-replicas=2 --max-replicas=5

Copy to Clipboard

Toggle word wrap

4.3.2. Disabling autoscaling nodes on a cluster
Copiar enlace

You can disable autoscaling on worker nodes to increase or decrease the number of nodes available by editing the machine pool definition for an existing cluster.

You can disable autoscaling on a cluster using Red Hat OpenShift Cluster Manager or the ROSA command-line interface (CLI) (rosa).

4.3.2.1. Disabling autoscaling nodes in an existing cluster using Red Hat OpenShift Cluster Manager
Copiar enlace

Disable autoscaling for worker nodes in the machine pool definition from OpenShift Cluster Manager.

Procedure

From OpenShift Cluster Manager, navigate to the Cluster List page and select the cluster with autoscaling that must be disabled.
On the selected cluster, select the Machine pools tab.
Click the Options menu at the end of the machine pool with autoscaling and select Edit.
On the Edit machine pool dialog, deselect the Enable autoscaling checkbox.
Select Save to save these changes and disable autoscaling from the machine pool.

4.3.2.2. Disabling autoscaling nodes in an existing cluster using the ROSA CLI
Copiar enlace

Disable autoscaling for worker nodes in the machine pool definition using the ROSA command-line interface (CLI) (rosa).

Procedure

Enter the following command:

rosa edit machinepool --cluster=<cluster_name> <machinepool_ID> --enable-autoscaling=false --replicas=<number>

$ rosa edit machinepool --cluster=<cluster_name> <machinepool_ID> --enable-autoscaling=false --replicas=<number>

Copy to Clipboard

Toggle word wrap

Example

Disable autoscaling on the default machine pool on a cluster named mycluster:

rosa edit machinepool --cluster=mycluster default --enable-autoscaling=false --replicas=3

$ rosa edit machinepool --cluster=mycluster default --enable-autoscaling=false --replicas=3

Copy to Clipboard

Toggle word wrap

4.4. Configuring cluster memory to meet container memory and risk requirements
Copiar enlace

As a cluster administrator, you can help your clusters operate efficiently through managing application memory by:

Determining the memory and risk requirements of a containerized application component and configuring the container memory parameters to suit those requirements.
Configuring containerized application runtimes (for example, OpenJDK) to adhere optimally to the configured container memory parameters.
Diagnosing and resolving memory-related error conditions associated with running in a container.

4.4.1. Understanding managing application memory
Copiar enlace

It is recommended to fully read the overview of how Red Hat OpenShift Service on AWS manages Compute Resources before proceeding.

For each kind of resource (memory, CPU, storage), Red Hat OpenShift Service on AWS allows optional request and limit values to be placed on each container in a pod.

Note the following about memory requests and memory limits:

Memory request
- The memory request value, if specified, influences the Red Hat OpenShift Service on AWS scheduler. The scheduler considers the memory request when scheduling a container to a node, then fences off the requested memory on the chosen node for the use of the container.
- If a node’s memory is exhausted, Red Hat OpenShift Service on AWS prioritizes evicting its containers whose memory usage most exceeds their memory request. In serious cases of memory exhaustion, the node OOM killer may select and kill a process in a container based on a similar metric.
- The cluster administrator can assign quota or assign default values for the memory request value.
- The cluster administrator can override the memory request values that a developer specifies, to manage cluster overcommit.
Memory limit
- The memory limit value, if specified, provides a hard limit on the memory that can be allocated across all the processes in a container.
- If the memory allocated by all of the processes in a container exceeds the memory limit, the node Out of Memory (OOM) killer will immediately select and kill a process in the container.
- If both memory request and limit are specified, the memory limit value must be greater than or equal to the memory request.
- The cluster administrator can assign quota or assign default values for the memory limit value.
- The minimum memory limit is 12 MB. If a container fails to start due to a Cannot allocate memory pod event, the memory limit is too low. Either increase or remove the memory limit. Removing the limit allows pods to consume unbounded node resources.

4.4.1.1. Managing application memory strategy
Copiar enlace

The steps for sizing application memory on Red Hat OpenShift Service on AWS are as follows:

Determine expected container memory usage
Determine expected mean and peak container memory usage, empirically if necessary (for example, by separate load testing). Remember to consider all the processes that may potentially run in parallel in the container: for example, does the main application spawn any ancillary scripts?
Determine risk appetite
Determine risk appetite for eviction. If the risk appetite is low, the container should request memory according to the expected peak usage plus a percentage safety margin. If the risk appetite is higher, it may be more appropriate to request memory according to the expected mean usage.
Set container memory request
Set container memory request based on the above. The more accurately the request represents the application memory usage, the better. If the request is too high, cluster and quota usage will be inefficient. If the request is too low, the chances of application eviction increase.
Set container memory limit, if required
Set container memory limit, if required. Setting a limit has the effect of immediately killing a container process if the combined memory usage of all processes in the container exceeds the limit, and is therefore a mixed blessing. On the one hand, it may make unanticipated excess memory usage obvious early ("fail fast"); on the other hand it also terminates processes abruptly.
Note that some Red Hat OpenShift Service on AWS clusters may require a limit value to be set; some may override the request based on the limit; and some application images rely on a limit value being set as this is easier to detect than a request value.
If the memory limit is set, it should not be set to less than the expected peak container memory usage plus a percentage safety margin.
Ensure application is tuned
Ensure application is tuned with respect to configured request and limit values, if appropriate. This step is particularly relevant to applications which pool memory, such as the JVM. The rest of this page discusses this.

4.4.2. Understanding OpenJDK settings for Red Hat OpenShift Service on AWS
Copiar enlace

The default OpenJDK settings do not work well with containerized environments. As a result, some additional Java memory settings must always be provided whenever running the OpenJDK in a container.

The JVM memory layout is complex, version dependent, and describing it in detail is beyond the scope of this documentation. However, as a starting point for running OpenJDK in a container, at least the following three memory-related tasks are key:

Overriding the JVM maximum heap size.
Encouraging the JVM to release unused memory to the operating system, if appropriate.
Ensuring all JVM processes within a container are appropriately configured.

Optimally tuning JVM workloads for running in a container is beyond the scope of this documentation, and may involve setting multiple additional JVM options.

4.4.2.1. Understanding how to override the JVM maximum heap size
Copiar enlace

OpenJDK defaults to using a maximum of 25% of available memory (recognizing any container memory limits in place) for "heap" memory. This default value is conservative, and, in a properly-configured container environment, this value would result in 75% of the memory assigned to a container being mostly unused. A much higher percentage for the JVM to use for heap memory, such as 80%, is more suitable in a container context where memory limits are imposed on the container level.

Most of the Red Hat containers include a startup script that replaces the OpenJDK default by setting updated values when the JVM launches.

For example, the Red Hat build of OpenJDK containers have a default value of 80%. This value can be set to a different percentage by defining the JAVA_MAX_RAM_RATIO environment variable.

For other OpenJDK deployements, the default value of 25% can be changed using the following command:

Example

java -XX:MaxRAMPercentage=80.0

$ java -XX:MaxRAMPercentage=80.0

Copy to Clipboard

Toggle word wrap

4.4.2.2. Understanding how to encourage the JVM to release unused memory to the operating system
Copiar enlace

By default, the OpenJDK does not aggressively return unused memory to the operating system. This may be appropriate for many containerized Java workloads, but notable exceptions include workloads where additional active processes co-exist with a JVM within a container, whether those additional processes are native, additional JVMs, or a combination of the two.

Java-based agents can use the following JVM arguments to encourage the JVM to release unused memory to the operating system:

-XX:+UseParallelGC
-XX:MinHeapFreeRatio=5 -XX:MaxHeapFreeRatio=10 -XX:GCTimeRatio=4
-XX:AdaptiveSizePolicyWeight=90

-XX:+UseParallelGC
-XX:MinHeapFreeRatio=5 -XX:MaxHeapFreeRatio=10 -XX:GCTimeRatio=4
-XX:AdaptiveSizePolicyWeight=90

Copy to Clipboard

Toggle word wrap

These arguments are intended to return heap memory to the operating system whenever allocated memory exceeds 110% of in-use memory (-XX:MaxHeapFreeRatio), spending up to 20% of CPU time in the garbage collector (-XX:GCTimeRatio). At no time will the application heap allocation be less than the initial heap allocation (overridden by -XX:InitialHeapSize / -Xms). Detailed additional information is available Tuning Java’s footprint in OpenShift (Part 1), Tuning Java’s footprint in OpenShift (Part 2), and at OpenJDK and Containers.

4.4.2.3. Understanding how to ensure all JVM processes within a container are appropriately configured
Copiar enlace

In the case that multiple JVMs run in the same container, it is essential to ensure that they are all configured appropriately. For many workloads it will be necessary to grant each JVM a percentage memory budget, leaving a perhaps substantial additional safety margin.

Many Java tools use different environment variables (JAVA_OPTS, GRADLE_OPTS, and so on) to configure their JVMs and it can be challenging to ensure that the right settings are being passed to the right JVM.

The JAVA_TOOL_OPTIONS environment variable is always respected by the OpenJDK, and values specified in JAVA_TOOL_OPTIONS will be overridden by other options specified on the JVM command line. By default, to ensure that these options are used by default for all JVM workloads run in the Java-based agent image, the Red Hat OpenShift Service on AWS Jenkins Maven agent image sets the following variable:

JAVA_TOOL_OPTIONS="-Dsun.zip.disableMemoryMapping=true"

JAVA_TOOL_OPTIONS="-Dsun.zip.disableMemoryMapping=true"

Copy to Clipboard

Toggle word wrap

This does not guarantee that additional options are not required, but is intended to be a helpful starting point.

4.4.3. Finding the memory request and limit from within a pod
Copiar enlace

An application wishing to dynamically discover its memory request and limit from within a pod should use the Downward API.

Procedure

Configure the pod to add the MEMORY_REQUEST and MEMORY_LIMIT stanzas:

Create a YAML file similar to the following:

apiVersion: v1
kind: Pod
metadata:
  name: test
spec:
  securityContext:
    runAsNonRoot: false
    seccompProfile:
      type: RuntimeDefault
  containers:
  - name: test
    image: fedora:latest
    command:
    - sleep
    - "3600"
    env:
    - name: MEMORY_REQUEST 
      valueFrom:
        resourceFieldRef:
          containerName: test
          resource: requests.memory
    - name: MEMORY_LIMIT 
      valueFrom:
        resourceFieldRef:
          containerName: test
          resource: limits.memory
    resources:
      requests:
        memory: 384Mi
      limits:
        memory: 512Mi
    securityContext:
      allowPrivilegeEscalation: false
      capabilities:
        drop: [ALL]

apiVersion: v1
kind: Pod
metadata:
  name: test
spec:
  securityContext:
    runAsNonRoot: false
    seccompProfile:
      type: RuntimeDefault
  containers:
  - name: test
    image: fedora:latest
    command:
    - sleep
    - "3600"
    env:
    - name: MEMORY_REQUEST


      valueFrom:
        resourceFieldRef:
          containerName: test
          resource: requests.memory
    - name: MEMORY_LIMIT


      valueFrom:
        resourceFieldRef:
          containerName: test
          resource: limits.memory
    resources:
      requests:
        memory: 384Mi
      limits:
        memory: 512Mi
    securityContext:
      allowPrivilegeEscalation: false
      capabilities:
        drop: [ALL]

Copy to Clipboard

Toggle word wrap

1: Add this stanza to discover the application memory request value.
2: Add this stanza to discover the application memory limit value.

Create the pod by running the following command:
```
oc create -f <file_name>.yaml
```
```
$ oc create -f <file_name>.yaml
```
Copy to Clipboard Toggle word wrap

Verification

Access the pod using a remote shell:
```
oc rsh test
```
```
$ oc rsh test
```
Copy to Clipboard Toggle word wrap
Check that the requested values were applied:
```
env | grep MEMORY | sort
```
```
$ env | grep MEMORY | sort
```
Copy to Clipboard Toggle word wrap
Example output
```
MEMORY_LIMIT=536870912
MEMORY_REQUEST=402653184
```
```
MEMORY_LIMIT=536870912
MEMORY_REQUEST=402653184
```
Copy to Clipboard Toggle word wrap

Note

The memory limit value can also be read from inside the container by the /sys/fs/cgroup/memory/memory.limit_in_bytes file.

4.4.4. Understanding OOM kill policy
Copiar enlace

Red Hat OpenShift Service on AWS can kill a process in a container if the total memory usage of all the processes in the container exceeds the memory limit, or in serious cases of node memory exhaustion.

When a process is Out of Memory (OOM) killed, this might result in the container exiting immediately. If the container PID 1 process receives the SIGKILL, the container will exit immediately. Otherwise, the container behavior is dependent on the behavior of the other processes.

For example, a container process exited with code 137, indicating it received a SIGKILL signal.

If the container does not exit immediately, an OOM kill is detectable as follows:

Access the pod using a remote shell:
```
oc rsh <pod name>
```
```
# oc rsh <pod name>
```
Copy to Clipboard Toggle word wrap
Run the following command to see the current OOM kill count in /sys/fs/cgroup/memory/memory.oom_control:
```
grep '^oom_kill ' /sys/fs/cgroup/memory/memory.oom_control
```
```
$ grep '^oom_kill ' /sys/fs/cgroup/memory/memory.oom_control
```
Copy to Clipboard Toggle word wrap
Example output
```
oom_kill 0
```
```
oom_kill 0
```
Copy to Clipboard Toggle word wrap
Run the following command to provoke an OOM kill:
```
sed -e '' </dev/zero
```
```
$ sed -e '' </dev/zero
```
Copy to Clipboard Toggle word wrap
Example output
```
Killed
```
```
Killed
```
Copy to Clipboard Toggle word wrap

Run the following command to see that the OOM kill counter in /sys/fs/cgroup/memory/memory.oom_control incremented:

grep '^oom_kill ' /sys/fs/cgroup/memory/memory.oom_control

$ grep '^oom_kill ' /sys/fs/cgroup/memory/memory.oom_control

Copy to Clipboard

Toggle word wrap

Example output

oom_kill 1

oom_kill 1

Copy to Clipboard

Toggle word wrap

If one or more processes in a pod are OOM killed, when the pod subsequently exits, whether immediately or not, it will have phase Failed and reason OOMKilled. An OOM-killed pod might be restarted depending on the value of restartPolicy. If not restarted, controllers such as the replication controller will notice the pod’s failed status and create a new pod to replace the old one.

Use the following command to get the pod status:

oc get pod test

$ oc get pod test

Copy to Clipboard

Toggle word wrap

Example output

NAME      READY     STATUS      RESTARTS   AGE
test      0/1       OOMKilled   0          1m

NAME      READY     STATUS      RESTARTS   AGE
test      0/1       OOMKilled   0          1m

Copy to Clipboard

Toggle word wrap

If the pod has not restarted, run the following command to view the pod:

oc get pod test -o yaml

$ oc get pod test -o yaml

Copy to Clipboard

Toggle word wrap

Example output

...
status:
  containerStatuses:
  - name: test
    ready: false
    restartCount: 0
    state:
      terminated:
        exitCode: 137
        reason: OOMKilled
  phase: Failed

...
status:
  containerStatuses:
  - name: test
    ready: false
    restartCount: 0
    state:
      terminated:
        exitCode: 137
        reason: OOMKilled
  phase: Failed

Copy to Clipboard

Toggle word wrap

If restarted, run the following command to view the pod:

oc get pod test -o yaml

$ oc get pod test -o yaml

Copy to Clipboard

Toggle word wrap

Example output

...
status:
  containerStatuses:
  - name: test
    ready: true
    restartCount: 1
    lastState:
      terminated:
        exitCode: 137
        reason: OOMKilled
    state:
      running:
  phase: Running

...
status:
  containerStatuses:
  - name: test
    ready: true
    restartCount: 1
    lastState:
      terminated:
        exitCode: 137
        reason: OOMKilled
    state:
      running:
  phase: Running

Copy to Clipboard

Toggle word wrap

4.4.5. Understanding pod eviction
Copiar enlace

Red Hat OpenShift Service on AWS may evict a pod from its node when the node’s memory is exhausted. Depending on the extent of memory exhaustion, the eviction may or may not be graceful. Graceful eviction implies the main process (PID 1) of each container receiving a SIGTERM signal, then some time later a SIGKILL signal if the process has not exited already. Non-graceful eviction implies the main process of each container immediately receiving a SIGKILL signal.

An evicted pod has phase Failed and reason Evicted. It will not be restarted, regardless of the value of restartPolicy. However, controllers such as the replication controller will notice the pod’s failed status and create a new pod to replace the old one.

oc get pod test

$ oc get pod test

Copy to Clipboard

Toggle word wrap

Example output

NAME      READY     STATUS    RESTARTS   AGE
test      0/1       Evicted   0          1m

NAME      READY     STATUS    RESTARTS   AGE
test      0/1       Evicted   0          1m

Copy to Clipboard

Toggle word wrap

oc get pod test -o yaml

$ oc get pod test -o yaml

Copy to Clipboard

Toggle word wrap

Example output

...
status:
  message: 'Pod The node was low on resource: [MemoryPressure].'
  phase: Failed
  reason: Evicted

...
status:
  message: 'Pod The node was low on resource: [MemoryPressure].'
  phase: Failed
  reason: Evicted

Copy to Clipboard

Toggle word wrap

Chapter 5. Configuring PID limits
Copiar enlace

A process identifier (PID) is a unique identifier assigned by the Linux kernel to each process or thread currently running on a system. The number of processes that can run simultaneously on a system is limited to 4,194,304 by the Linux kernel. This number might also be affected by limited access to other system resources such as memory, CPU, and disk space.

In Red Hat OpenShift Service on AWS 4.11 and later, by default, a pod can have a maximum of 4,096 PIDs. If your workload requires more than that, you can increase the allowed maximum number of PIDs by configuring a KubeletConfig object.

Red Hat OpenShift Service on AWS clusters running versions earlier than 4.11 use a default PID limit of 1024.

5.1. Understanding process ID limits
Copiar enlace

In Red Hat OpenShift Service on AWS, consider these two supported limits for process ID (PID) usage before you schedule work on your cluster:

Maximum number of PIDs per pod.
The default value is 4,096 in Red Hat OpenShift Service on AWS 4.11 and later. This value is controlled by the podPidsLimit parameter set on the node.
Maximum number of PIDs per node.
The default value depends on node resources. In Red Hat OpenShift Service on AWS, this value is controlled by the --system-reserved parameter, which reserves PIDs on each node based on the total resources of the node.

When a pod exceeds the allowed maximum number of PIDs per pod, the pod might stop functioning correctly and might be evicted from the node. See the Kubernetes documentation for eviction signals and thresholds for more information.

When a node exceeds the allowed maximum number of PIDs per node, the node can become unstable because new processes cannot have PIDs assigned. If existing processes cannot complete without creating additional processes, the entire node can become unusable and require reboot. This situation can result in data loss, depending on the processes and applications being run. Customer administrators and Red Hat Site Reliability Engineering are notified when this threshold is reached, and a Worker node is experiencing PIDPressure warning will appear in the cluster logs.

5.2. Risks of setting higher process ID limits for Red Hat OpenShift Service on AWS pods
Copiar enlace

The podPidsLimit parameter for a pod controls the maximum number of processes and threads that can run simultaneously in that pod.

You can increase the value for podPidsLimit from the default of 4,096 to a maximum of 16,384. Changing this value might incur downtime for applications, because changing the podPidsLimit requires rebooting the affected node.

If you are running a large number of pods per node, and you have a high podPidsLimit value on your nodes, you risk exceeding the PID maximum for the node.

To find the maximum number of pods that you can run simultaneously on a single node without exceeding the PID maximum for the node, divide 3,650,000 by your podPidsLimit value. For example, if your podPidsLimit value is 16,384, and you expect the pods to use close to that number of process IDs, you can safely run 222 pods on a single node.

Note

Memory, CPU, and available storage can also limit the maximum number of pods that can run simultaneously, even when the podPidsLimit value is set appropriately. For more information, see "Planning your environment" and "Limits and scalability".

5.3. Setting a higher process ID limit on a machine pool in a Red Hat OpenShift Service on AWS cluster
Copiar enlace

You can set a higher podPidsLimit for machine pools in an existing Red Hat OpenShift Service on AWS cluster by creating or editing a KubeletConfig object that changes the --pod-pids-limit parameter.

Important

Changing the podPidsLimit on an existing machine pool triggers nodes in the machine pool to reboot one at a time. Make this change outside of peak usage hours for workloads in your machine pool and avoid upgrading or hibernating your cluster until all nodes have rebooted.

Prerequisites

You have a Red Hat OpenShift Service on AWS cluster.
You have installed the ROSA command-line interface (CLI) (rosa).
You have logged in to your Red Hat account by using the ROSA CLI.

Procedure

Create a new KubeletConfig object for your cluster that specifies a new --pod-pids-limit:

rosa create kubeletconfig -c <cluster_name> --name=<kubeletconfig_name> --pod-pids-limit=<value>

$ rosa create kubeletconfig -c <cluster_name> --name=<kubeletconfig_name> --pod-pids-limit=<value>

Copy to Clipboard

Toggle word wrap

For example, the following command creates a set-high-pids KubeletConfig object for the my-cluster cluster that sets a maximum of 16,384 PIDs per pod:

rosa create kubeletconfig -c my-cluster --name=set-high-pids --pod-pids-limit=16384

$ rosa create kubeletconfig -c my-cluster --name=set-high-pids --pod-pids-limit=16384

Copy to Clipboard

Toggle word wrap

Associate the new KubeletConfig object with a new or existing machine pool.

For a new machine pool:

rosa create machinepool -c <cluster_name> --name <machinepool_name> --kubelet-configs=<kubeletconfig_name>

$ rosa create machinepool -c <cluster_name> --name <machinepool_name> --kubelet-configs=<kubeletconfig_name>

Copy to Clipboard

Toggle word wrap

For an existing machine pool:

rosa edit machinepool -c <cluster_name> --kubelet-configs=<kubeletconfig_name> <machinepool_name>

$ rosa edit machinepool -c <cluster_name> --kubelet-configs=<kubeletconfig_name> <machinepool_name>

Copy to Clipboard

Toggle word wrap

Example output

Editing the kubelet config will cause the Nodes for your Machine Pool to be recreated. This may cause outages to your applications. Do you wish to continue? (y/N)

Editing the kubelet config will cause the Nodes for your Machine Pool to be recreated. This may cause outages to your applications. Do you wish to continue? (y/N)

Copy to Clipboard

Toggle word wrap

For example, the following command associates the set-high-pids KubeletConfig object with the high-pid-pool machine pool in the my-cluster cluster:

rosa edit machinepool -c my-cluster --kubelet-configs=set-high-pids high-pid-pool

$ rosa edit machinepool -c my-cluster --kubelet-configs=set-high-pids high-pid-pool

Copy to Clipboard

Toggle word wrap

A rolling reboot of worker nodes is triggered when a new KubeletConfig object is attached to an existing machine pool. You can check the progress of the rollout in the machine pool description:

rosa describe machinepool --cluster <cluster_name> --machinepool <machinepool_name>

$ rosa describe machinepool --cluster <cluster_name> --machinepool <machinepool_name>

Copy to Clipboard

Toggle word wrap

Verification

Confirm that the new setting is in place on nodes in the machine pool:
```
rosa describe kubeletconfig --cluster=<cluster_name> --name <kubeletconfig_name>
```
```
$ rosa describe kubeletconfig --cluster=<cluster_name> --name <kubeletconfig_name>
```
Copy to Clipboard Toggle word wrap
The new PIDs limit appears in the output, as shown in the following example:
Example output
```
Pod Pids Limit:                       16384
```
```
Pod Pids Limit:                       16384
```
Copy to Clipboard Toggle word wrap

5.4. Removing custom configuration from a machine pool
Copiar enlace

You can remove custom configuration on your machine pools by removing the KubeletConfig object that contains the configuration details.

Prerequisites

You have an existing Red Hat OpenShift Service on AWS cluster.
You have installed the ROSA CLI (rosa).
You have logged in to your Red Hat account by using the ROSA CLI.

Procedure

Edit the machine pool and set the --kubeletconfigs parameter so that the KubeletConfig object you want to remove is omitted.
To remove all KubeletConfig objects from the machine pool, set an empty value for the --kubeletconfigs parameter, for example:
```
rosa edit machinepool -c <cluster_name> --kubelet-configs="" <machinepool_name>
```
```
$ rosa edit machinepool -c <cluster_name> --kubelet-configs="" <machinepool_name>
```
Copy to Clipboard Toggle word wrap

Verification steps

Confirm that the KubeletConfig object you removed is not visible in the machine pool description:

rosa describe machinepool --cluster <cluster_name> --machinepool=<machinepool_name>

$ rosa describe machinepool --cluster <cluster_name> --machinepool=<machinepool_name>

Copy to Clipboard

Toggle word wrap

Chapter 6. Managing a cluster with multi-architecture compute machines
Copiar enlace

Managing a cluster that has nodes with multiple architectures requires you to consider node architecture as you monitor the cluster and manage your workloads. This requires you to take additional considerations into account when you schedule workloads in a multi-architecture cluster.

6.1. Scheduling workloads on clusters with multi-architecture compute machines
Copiar enlace

When you deploy workloads on a cluster with compute nodes that use different architectures, you must align pod architecture with the architecture of the underlying node. Your workload may also require additional configuration to particular resources depending on the underlying node architecture.

You can use the Multiarch Tuning Operator to enable architecture-aware scheduling of workloads on clusters with multi-architecture compute machines. The Multiarch Tuning Operator implements additional scheduler predicates in the pods specifications based on the architectures that the pods can support at creation time.

For information about the Multiarch Tuning Operator, see Managing workloads on multi-architecture clusters by using the Multiarch Tuning Operator.

6.1.1. Sample multi-architecture node workload deployments
Copiar enlace

Scheduling a workload to an appropriate node based on architecture works in the same way as scheduling based on any other node characteristic.

Using nodeAffinity to schedule nodes with specific architectures

You can allow a workload to be scheduled on only a set of nodes with architectures supported by its images, you can set the spec.affinity.nodeAffinity field in your pod’s template specification.

Example deployment with node affinity set

apiVersion: apps/v1
kind: Deployment
metadata: # ...
spec:
   # ...
  template:
     # ...
    spec:
      affinity:
        nodeAffinity:
          requiredDuringSchedulingIgnoredDuringExecution:
            nodeSelectorTerms:
            - matchExpressions:
              - key: kubernetes.io/arch
                operator: In
                values: 
                - amd64
                - arm64

apiVersion: apps/v1
kind: Deployment
metadata: # ...
spec:
   # ...
  template:
     # ...
    spec:
      affinity:
        nodeAffinity:
          requiredDuringSchedulingIgnoredDuringExecution:
            nodeSelectorTerms:
            - matchExpressions:
              - key: kubernetes.io/arch
                operator: In
                values:


                - amd64
                - arm64

Copy to Clipboard

Toggle word wrap

1: Specify the supported architectures. Valid values include amd64,arm64, or both values.

Legal Notice
Copiar enlace

OpenShift documentation is licensed under the Apache License 2.0 (https://www.apache.org/licenses/LICENSE-2.0).

Modified versions must remove all Red Hat trademarks.

Portions adapted from https://github.com/kubernetes-incubator/service-catalog/ with modifications by Red Hat.

Red Hat, Red Hat Enterprise Linux, the Red Hat logo, the Shadowman logo, JBoss, OpenShift, Fedora, the Infinity logo, and RHCE are trademarks of Red Hat, Inc., registered in the United States and other countries.

Linux® is the registered trademark of Linus Torvalds in the United States and other countries.

Java® is a registered trademark of Oracle and/or its affiliates.

XFS® is a trademark of Silicon Graphics International Corp. or its subsidiaries in the United States and/or other countries.

MySQL® is a registered trademark of MySQL AB in the United States, the European Union and other countries.

Node.js® is an official trademark of Joyent. Red Hat Software Collections is not formally related to or endorsed by the official Joyent Node.js open source or commercial project.

The OpenStack® Word Mark and OpenStack logo are either registered trademarks/service marks or trademarks/service marks of the OpenStack Foundation, in the United States and other countries and are used with the OpenStack Foundation’s permission. We are not affiliated with, endorsed or sponsored by the OpenStack Foundation, or the OpenStack community.

All other trademarks are the property of their respective owners.

Volver arriba

Este contenido no está disponible en el idioma seleccionado.

Cluster administration

Configuring Red Hat OpenShift Service on AWS clusters

Chapter 1. Cluster notificationsCopiar enlaceEnlace copiado en el portapapeles!

1.2. What to expect from cluster notificationsCopiar enlaceEnlace copiado en el portapapeles!

1.2.1. Cluster notification policyCopiar enlaceEnlace copiado en el portapapeles!

1.2.2. Cluster notification severity levelsCopiar enlaceEnlace copiado en el portapapeles!

1.2.3. Cluster notification typesCopiar enlaceEnlace copiado en el portapapeles!

1.3. Viewing cluster notifications using the Red Hat Hybrid Cloud ConsoleCopiar enlaceEnlace copiado en el portapapeles!

1.4. Cluster notification emailsCopiar enlaceEnlace copiado en el portapapeles!

1.4.1. Adding notification contacts to your clusterCopiar enlaceEnlace copiado en el portapapeles!

1.4.2. Removing notification contacts from your clusterCopiar enlaceEnlace copiado en el portapapeles!

1.5. TroubleshootingCopiar enlaceEnlace copiado en el portapapeles!

Chapter 2. Configuring private connectionsCopiar enlaceEnlace copiado en el portapapeles!

2.1. Configuring private connectionsCopiar enlaceEnlace copiado en el portapapeles!

2.2. Configuring AWS VPC peeringCopiar enlaceEnlace copiado en el portapapeles!

2.2.1. VPC peering termsCopiar enlaceEnlace copiado en el portapapeles!

2.2.2. Initiating the VPC peer requestCopiar enlaceEnlace copiado en el portapapeles!

2.2.3. Accepting the VPC peer requestCopiar enlaceEnlace copiado en el portapapeles!

2.2.4. Configuring the routing tablesCopiar enlaceEnlace copiado en el portapapeles!

2.2.5. Verifying and troubleshooting VPC peeringCopiar enlaceEnlace copiado en el portapapeles!

2.3. Configuring AWS VPNCopiar enlaceEnlace copiado en el portapapeles!

2.3.1. Creating a VPN connectionCopiar enlaceEnlace copiado en el portapapeles!

2.3.1.1. Configuring the VPN connectionCopiar enlaceEnlace copiado en el portapapeles!

2.3.1.2. Establishing the VPN ConnectionCopiar enlaceEnlace copiado en el portapapeles!

2.3.1.3. Enabling VPN route propagationCopiar enlaceEnlace copiado en el portapapeles!

2.3.2. Verifying the VPN connectionCopiar enlaceEnlace copiado en el portapapeles!

2.3.3. Troubleshooting the VPN connectionCopiar enlaceEnlace copiado en el portapapeles!

Tunnel does not connect

Tunnel does not stay connected

Secondary tunnel in Down state

2.4. Configuring AWS Direct ConnectCopiar enlaceEnlace copiado en el portapapeles!

2.4.1. AWS Direct Connect methodsCopiar enlaceEnlace copiado en el portapapeles!

2.4.2. Creating the hosted Virtual InterfaceCopiar enlaceEnlace copiado en el portapapeles!

2.4.2.1. Determining the type of Direct Connect connectionCopiar enlaceEnlace copiado en el portapapeles!

2.4.2.2. Creating a Private Direct ConnectCopiar enlaceEnlace copiado en el portapapeles!

2.4.2.3. Creating a Public Direct ConnectCopiar enlaceEnlace copiado en el portapapeles!

2.4.2.4. Verifying the Virtual InterfacesCopiar enlaceEnlace copiado en el portapapeles!

2.4.3. Connecting to an existing Direct Connect GatewayCopiar enlaceEnlace copiado en el portapapeles!

2.4.4. Troubleshooting Direct ConnectCopiar enlaceEnlace copiado en el portapapeles!

Chapter 3. Cluster autoscaling for Red Hat OpenShift Service on AWSCopiar enlaceEnlace copiado en el portapapeles!

3.1. About the cluster autoscalerCopiar enlaceEnlace copiado en el portapapeles!

3.1.1. Automatic node removalCopiar enlaceEnlace copiado en el portapapeles!

3.1.2. LimitationsCopiar enlaceEnlace copiado en el portapapeles!

3.1.3. Interaction with other scheduling featuresCopiar enlaceEnlace copiado en el portapapeles!

3.2. Enable autoscaling during cluster creation by using the interactive mode with the ROSA CLICopiar enlaceEnlace copiado en el portapapeles!

3.3. Enable autoscaling during cluster creation with the ROSA CLICopiar enlaceEnlace copiado en el portapapeles!

3.4. Cluster autoscaling parameters using the ROSA CLICopiar enlaceEnlace copiado en el portapapeles!

3.5. Edit autoscaling after cluster creation with the ROSA CLICopiar enlaceEnlace copiado en el portapapeles!

3.6. View autoscaler configurations with the ROSA CLICopiar enlaceEnlace copiado en el portapapeles!

Chapter 4. Managing compute nodes using machine poolsCopiar enlaceEnlace copiado en el portapapeles!

4.1. About machine poolsCopiar enlaceEnlace copiado en el portapapeles!

4.1.1. MachinesCopiar enlaceEnlace copiado en el portapapeles!

4.1.2. Machine setsCopiar enlaceEnlace copiado en el portapapeles!

4.1.3. Machine poolsCopiar enlaceEnlace copiado en el portapapeles!

4.1.3.1. Machine pools during cluster installationCopiar enlaceEnlace copiado en el portapapeles!

4.1.3.2. Configuring machine pools after cluster installationCopiar enlaceEnlace copiado en el portapapeles!

4.1.3.3. Machine pool upgrade requirementsCopiar enlaceEnlace copiado en el portapapeles!

4.1.4. Additional resourcesCopiar enlaceEnlace copiado en el portapapeles!

4.2. Managing compute nodesCopiar enlaceEnlace copiado en el portapapeles!

4.2.1. Creating a machine poolCopiar enlaceEnlace copiado en el portapapeles!

4.2.1.1. Creating a machine pool using OpenShift Cluster ManagerCopiar enlaceEnlace copiado en el portapapeles!

4.2.1.2. Creating a machine pool using the ROSA CLICopiar enlaceEnlace copiado en el portapapeles!

4.2.2. Configuring machine pool disk volumeCopiar enlaceEnlace copiado en el portapapeles!

4.2.2.1. Configuring machine pool disk volume using the ROSA CLICopiar enlaceEnlace copiado en el portapapeles!

4.2.3. Deleting a machine poolCopiar enlaceEnlace copiado en el portapapeles!

4.2.3.1. Deleting a machine pool using OpenShift Cluster ManagerCopiar enlaceEnlace copiado en el portapapeles!

4.2.3.2. Deleting a machine pool using the ROSA CLICopiar enlaceEnlace copiado en el portapapeles!

4.2.4. Scaling compute nodes manuallyCopiar enlaceEnlace copiado en el portapapeles!

4.2.5. Node labelsCopiar enlaceEnlace copiado en el portapapeles!

4.2.5.1. Adding node labels to a machine poolCopiar enlaceEnlace copiado en el portapapeles!

4.2.6. Adding tags to a machine poolCopiar enlaceEnlace copiado en el portapapeles!

4.2.6.1. Adding tags to a machine pool using the ROSA CLICopiar enlaceEnlace copiado en el portapapeles!

4.2.7. Adding taints to a machine poolCopiar enlaceEnlace copiado en el portapapeles!

4.2.7.1. Adding taints to a machine pool using OpenShift Cluster ManagerCopiar enlaceEnlace copiado en el portapapeles!

4.2.7.2. Adding taints to a machine pool using the ROSA CLICopiar enlaceEnlace copiado en el portapapeles!

4.2.8. Configuring machine pool AutoRepairCopiar enlaceEnlace copiado en el portapapeles!

4.2.8.1. Configuring AutoRepair on a machine pool using OpenShift Cluster ManagerCopiar enlaceEnlace copiado en el portapapeles!

4.2.8.2. Configuring machine pool AutoRepair using the ROSA CLICopiar enlaceEnlace copiado en el portapapeles!

4.2.9. Adding node tuning to a machine poolCopiar enlaceEnlace copiado en el portapapeles!

Chapter 1. Cluster notifications
Copiar enlace

1.2. What to expect from cluster notifications
Copiar enlace

1.2.1. Cluster notification policy
Copiar enlace

1.2.2. Cluster notification severity levels
Copiar enlace

1.2.3. Cluster notification types
Copiar enlace

1.3. Viewing cluster notifications using the Red Hat Hybrid Cloud Console
Copiar enlace

1.4. Cluster notification emails
Copiar enlace

1.4.1. Adding notification contacts to your cluster
Copiar enlace

1.4.2. Removing notification contacts from your cluster
Copiar enlace

1.5. Troubleshooting
Copiar enlace

Chapter 2. Configuring private connections
Copiar enlace

2.1. Configuring private connections
Copiar enlace

2.2. Configuring AWS VPC peering
Copiar enlace

2.2.1. VPC peering terms
Copiar enlace

2.2.2. Initiating the VPC peer request
Copiar enlace

2.2.3. Accepting the VPC peer request
Copiar enlace

2.2.4. Configuring the routing tables
Copiar enlace

2.2.5. Verifying and troubleshooting VPC peering
Copiar enlace

2.3. Configuring AWS VPN
Copiar enlace

2.3.1. Creating a VPN connection
Copiar enlace

2.3.1.1. Configuring the VPN connection
Copiar enlace

2.3.1.2. Establishing the VPN Connection
Copiar enlace

2.3.1.3. Enabling VPN route propagation
Copiar enlace

2.3.2. Verifying the VPN connection
Copiar enlace

2.3.3. Troubleshooting the VPN connection
Copiar enlace

2.4. Configuring AWS Direct Connect
Copiar enlace

2.4.1. AWS Direct Connect methods
Copiar enlace

2.4.2. Creating the hosted Virtual Interface
Copiar enlace

2.4.2.1. Determining the type of Direct Connect connection
Copiar enlace

2.4.2.2. Creating a Private Direct Connect
Copiar enlace

2.4.2.3. Creating a Public Direct Connect
Copiar enlace

2.4.2.4. Verifying the Virtual Interfaces
Copiar enlace

2.4.3. Connecting to an existing Direct Connect Gateway
Copiar enlace

2.4.4. Troubleshooting Direct Connect
Copiar enlace

Chapter 3. Cluster autoscaling for Red Hat OpenShift Service on AWS
Copiar enlace

3.1. About the cluster autoscaler
Copiar enlace

3.1.1. Automatic node removal
Copiar enlace

3.1.2. Limitations
Copiar enlace

3.1.3. Interaction with other scheduling features
Copiar enlace

3.2. Enable autoscaling during cluster creation by using the interactive mode with the ROSA CLI
Copiar enlace

3.3. Enable autoscaling during cluster creation with the ROSA CLI
Copiar enlace

3.4. Cluster autoscaling parameters using the ROSA CLI
Copiar enlace

3.5. Edit autoscaling after cluster creation with the ROSA CLI
Copiar enlace

3.6. View autoscaler configurations with the ROSA CLI
Copiar enlace

Chapter 4. Managing compute nodes using machine pools
Copiar enlace

4.1. About machine pools
Copiar enlace

4.1.1. Machines
Copiar enlace

4.1.2. Machine sets
Copiar enlace

4.1.3. Machine pools
Copiar enlace

4.1.3.1. Machine pools during cluster installation
Copiar enlace

4.1.3.2. Configuring machine pools after cluster installation
Copiar enlace

4.1.3.3. Machine pool upgrade requirements
Copiar enlace

4.1.4. Additional resources
Copiar enlace

4.2. Managing compute nodes
Copiar enlace

4.2.1. Creating a machine pool
Copiar enlace

4.2.1.1. Creating a machine pool using OpenShift Cluster Manager
Copiar enlace

4.2.1.2. Creating a machine pool using the ROSA CLI
Copiar enlace

4.2.2. Configuring machine pool disk volume
Copiar enlace

4.2.2.1. Configuring machine pool disk volume using the ROSA CLI
Copiar enlace

4.2.3. Deleting a machine pool
Copiar enlace

4.2.3.1. Deleting a machine pool using OpenShift Cluster Manager
Copiar enlace

4.2.3.2. Deleting a machine pool using the ROSA CLI
Copiar enlace

4.2.4. Scaling compute nodes manually
Copiar enlace

4.2.5. Node labels
Copiar enlace

4.2.5.1. Adding node labels to a machine pool
Copiar enlace

4.2.6. Adding tags to a machine pool
Copiar enlace

4.2.6.1. Adding tags to a machine pool using the ROSA CLI
Copiar enlace

4.2.7. Adding taints to a machine pool
Copiar enlace

4.2.7.1. Adding taints to a machine pool using OpenShift Cluster Manager
Copiar enlace

4.2.7.2. Adding taints to a machine pool using the ROSA CLI
Copiar enlace

4.2.8. Configuring machine pool AutoRepair
Copiar enlace

4.2.8.1. Configuring AutoRepair on a machine pool using OpenShift Cluster Manager
Copiar enlace

4.2.8.2. Configuring machine pool AutoRepair using the ROSA CLI
Copiar enlace

4.2.9. Adding node tuning to a machine pool
Copiar enlace

4.2.10. Configuring node drain grace periods
Copiar enlace

4.2.11. Additional resources
Copiar enlace

4.3. About autoscaling nodes on a cluster
Copiar enlace

4.3.1. Enabling autoscaling nodes on a cluster
Copiar enlace

4.3.1.1. Enabling autoscaling nodes in an existing cluster using Red Hat OpenShift Cluster Manager
Copiar enlace

4.3.1.2. Enabling autoscaling nodes in an existing cluster using the ROSA CLI
Copiar enlace