8.11. 集群部署故障排除
本文档论述了如何对集群部署错误进行故障排除。
8.11.1. 获取失败集群的信息
如果集群部署失败,集群将进入"错误"状态。
流程
运行以下命令来获取更多信息:
$ rosa describe cluster -c <my_cluster_name> --debug
8.11.2. 无法创建具有 osdCcsAdmin
错误的集群
如果集群创建操作失败,您可以收到以下出错信息。
输出示例
Failed to create cluster: Unable to create cluster spec: Failed to get access keys for user 'osdCcsAdmin': NoSuchEntity: The user with name osdCcsAdmin cannot be found.
流程
解决此问题的步骤:
删除堆栈:
$ rosa init --delete
重新初始化您的帐户:
$ rosa init
8.11.3. 创建 Elastic Load Balancing (ELB) 服务链接的角色
如果您还没有在 AWS 帐户中创建负载均衡器,则 Elastic Load Balancing (ELB) 的服务链接角色可能尚不存在。您可能会收到以下错误:
Error: Error creating network Load Balancer: AccessDenied: User: arn:aws:sts::xxxxxxxxxxxx:assumed-role/ManagedOpenShift-Installer-Role/xxxxxxxxxxxxxxxxxxx is not authorized to perform: iam:CreateServiceLinkedRole on resource: arn:aws:iam::xxxxxxxxxxxx:role/aws-service-role/elasticloadbalancing.amazonaws.com/AWSServiceRoleForElasticLoadBalancing"
流程
要解决这个问题,请确保您的 AWS 帐户中存在角色。如果没有,使用以下命令创建此角色:
aws iam get-role --role-name "AWSServiceRoleForElasticLoadBalancing" || aws iam create-service-linked-role --aws-service-name "elasticloadbalancing.amazonaws.com"
注意
此命令只需要为每个帐户执行一次。
8.11.4. 修复无法删除的集群
在某些情况下,如果您尝试删除集群,OpenShift Cluster Manager 中会出现以下错误。
Error deleting cluster CLUSTERS-MGMT-400: Failed to delete cluster <hash>: sts_user_role is not linked to your account. sts_ocm_role is linked to your organization <org number> which requires sts_user_role to be linked to your Red Hat account <account ID>.Please create a user role and link it to the account: User Account <account ID> is not authorized to perform STS cluster operations Operation ID: b0572d6e-fe54-499b-8c97-46bf6890011c
如果您尝试从 CLI 删除集群,则会出现以下错误。
E: Failed to delete cluster <hash>: sts_user_role is not linked to your account. sts_ocm_role is linked to your organization <org_number> which requires sts_user_role to be linked to your Red Hat account <account_id>.Please create a user role and link it to the account: User Account <account ID> is not authorized to perform STS cluster operations
当 user-role
被取消链接或删除,会发生此错误。
流程
运行以下命令来创建
user-role
IAM 资源:$ rosa create user-role
在可以看到创建的角色后,您可以删除集群。以下确认,角色已创建并被链接:
I: Successfully linked role ARN <user role ARN> with account <account ID>