Chapter 7. Quotas

7.1. Resource quotas per project
Copy link

A resource quota, defined by a ResourceQuota object, provides constraints that limit aggregate resource consumption per project. It can limit the quantity of objects that can be created in a project by type, as well as the total amount of compute resources and storage that may be consumed by resources in that project.

This guide describes how resource quotas work, how cluster administrators can set and manage resource quotas on a per project basis, and how developers and cluster administrators can view them.

7.1.1. Resources managed by quotas
Copy link

The following describes the set of compute resources and object types that can be managed by a quota.

Note

A pod is in a terminal state if status.phase in (Failed, Succeeded) is true.

Expand

Table 7.1. Compute resources managed by quota
Resource Name	Description
`cpu`	The sum of CPU requests across all pods in a non-terminal state cannot exceed this value. `cpu` and `requests.cpu` are the same value and can be used interchangeably.
`memory`	The sum of memory requests across all pods in a non-terminal state cannot exceed this value. `memory` and `requests.memory` are the same value and can be used interchangeably.
`ephemeral-storage`	The sum of local ephemeral storage requests across all pods in a non-terminal state cannot exceed this value. `ephemeral-storage` and `requests.ephemeral-storage` are the same value and can be used interchangeably. This resource is available only if you enabled the ephemeral storage technology preview. This feature is disabled by default.
`requests.cpu`	The sum of CPU requests across all pods in a non-terminal state cannot exceed this value. `cpu` and `requests.cpu` are the same value and can be used interchangeably.
`requests.memory`	The sum of memory requests across all pods in a non-terminal state cannot exceed this value. `memory` and `requests.memory` are the same value and can be used interchangeably.
`requests.ephemeral-storage`	The sum of ephemeral storage requests across all pods in a non-terminal state cannot exceed this value. `ephemeral-storage` and `requests.ephemeral-storage` are the same value and can be used interchangeably. This resource is available only if you enabled the ephemeral storage technology preview. This feature is disabled by default.
`limits.cpu`	The sum of CPU limits across all pods in a non-terminal state cannot exceed this value.
`limits.memory`	The sum of memory limits across all pods in a non-terminal state cannot exceed this value.
`limits.ephemeral-storage`	The sum of ephemeral storage limits across all pods in a non-terminal state cannot exceed this value. This resource is available only if you enabled the ephemeral storage technology preview. This feature is disabled by default.

Expand

Table 7.2. Storage resources managed by quota
Resource Name	Description
`requests.storage`	The sum of storage requests across all persistent volume claims in any state cannot exceed this value.
`persistentvolumeclaims`	The total number of persistent volume claims that can exist in the project.
`<storage-class-name>.storageclass.storage.k8s.io/requests.storage`	The sum of storage requests across all persistent volume claims in any state that have a matching storage class, cannot exceed this value.
`<storage-class-name>.storageclass.storage.k8s.io/persistentvolumeclaims`	The total number of persistent volume claims with a matching storage class that can exist in the project.

Expand

Table 7.3. Object counts managed by quota
Resource Name	Description
`pods`	The total number of pods in a non-terminal state that can exist in the project.
`replicationcontrollers`	The total number of ReplicationControllers that can exist in the project.
`resourcequotas`	The total number of resource quotas that can exist in the project.
`services`	The total number of services that can exist in the project.
`services.loadbalancers`	The total number of services of type `LoadBalancer` that can exist in the project.
`services.nodeports`	The total number of services of type `NodePort` that can exist in the project.
`secrets`	The total number of secrets that can exist in the project.
`configmaps`	The total number of `ConfigMap` objects that can exist in the project.
`persistentvolumeclaims`	The total number of persistent volume claims that can exist in the project.
`openshift.io/imagestreams`	The total number of imagestreams that can exist in the project.

7.1.2. Quota scopes
Copy link

Each quota can have an associated set of scopes. A quota only measures usage for a resource if it matches the intersection of enumerated scopes.

Adding a scope to a quota restricts the set of resources to which that quota can apply. Specifying a resource outside of the allowed set results in a validation error.

Expand

Scope	Description
`Terminating`	Match pods where `spec.activeDeadlineSeconds >= 0`.
`NotTerminating`	Match pods where `spec.activeDeadlineSeconds` is `nil`.
`BestEffort`	Match pods that have best effort quality of service for either `cpu` or `memory`.
`NotBestEffort`	Match pods that do not have best effort quality of service for `cpu` and `memory`.

A BestEffort scope restricts a quota to limiting the following resources:

pods

A Terminating, NotTerminating, and NotBestEffort scope restricts a quota to tracking the following resources:

pods
memory
requests.memory
limits.memory
cpu
requests.cpu
limits.cpu
ephemeral-storage
requests.ephemeral-storage
limits.ephemeral-storage

Note

Ephemeral storage requests and limits apply only if you enabled the ephemeral storage technology preview. This feature is disabled by default.

7.1.3. Quota enforcement
Copy link

After a resource quota for a project is first created, the project restricts the ability to create any new resources that may violate a quota constraint until it has calculated updated usage statistics.

After a quota is created and usage statistics are updated, the project accepts the creation of new content. When you create or modify resources, your quota usage is incremented immediately upon the request to create or modify the resource.

When you delete a resource, your quota use is decremented during the next full recalculation of quota statistics for the project. A configurable amount of time determines how long it takes to reduce quota usage statistics to their current observed system value.

If project modifications exceed a quota usage limit, the server denies the action, and an appropriate error message is returned to the user explaining the quota constraint violated, and what their currently observed usage statistics are in the system.

7.1.4. Requests versus limits
Copy link

When allocating compute resources, each container might specify a request and a limit value each for CPU, memory, and ephemeral storage. Quotas can restrict any of these values.

If the quota has a value specified for requests.cpu or requests.memory, then it requires that every incoming container make an explicit request for those resources. If the quota has a value specified for limits.cpu or limits.memory, then it requires that every incoming container specify an explicit limit for those resources.

7.1.5. Sample resource quota definitions
Copy link

core-object-counts.yaml

apiVersion: v1
kind: ResourceQuota
metadata:
  name: core-object-counts
spec:
  hard:
    configmaps: "10"


    persistentvolumeclaims: "4"


    replicationcontrollers: "20"


    secrets: "10"


    services: "10"


    services.loadbalancers: "2"

1: The total number of ConfigMap objects that can exist in the project.
2: The total number of persistent volume claims (PVCs) that can exist in the project.
3: The total number of ReplicationControllers that can exist in the project.
4: The total number of secrets that can exist in the project.
5: The total number of services that can exist in the project.
6: The total number of services of type LoadBalancer that can exist in the project.

openshift-object-counts.yaml

apiVersion: v1
kind: ResourceQuota
metadata:
  name: openshift-object-counts
spec:
  hard:
    openshift.io/imagestreams: "10"

1: The total number of imagestreams that can exist in the project.

compute-resources.yaml

apiVersion: v1
kind: ResourceQuota
metadata:
  name: compute-resources
spec:
  hard:
    pods: "4"


    requests.cpu: "1"


    requests.memory: 1Gi


    requests.ephemeral-storage: 2Gi


    limits.cpu: "2"


    limits.memory: 2Gi


    limits.ephemeral-storage: 4Gi

1: The total number of pods in a non-terminal state that can exist in the project.
2: Across all pods in a non-terminal state, the sum of CPU requests cannot exceed 1 core.
3: Across all pods in a non-terminal state, the sum of memory requests cannot exceed 1Gi.
4: Across all pods in a non-terminal state, the sum of ephemeral storage requests cannot exceed 2Gi.
5: Across all pods in a non-terminal state, the sum of CPU limits cannot exceed 2 cores.
6: Across all pods in a non-terminal state, the sum of memory limits cannot exceed 2Gi.
7: Across all pods in a non-terminal state, the sum of ephemeral storage limits cannot exceed 4Gi.

besteffort.yaml

apiVersion: v1
kind: ResourceQuota
metadata:
  name: besteffort
spec:
  hard:
    pods: "1"


  scopes:
  - BestEffort

1: The total number of pods in a non-terminal state with BestEffort quality of service that can exist in the project.
2: Restricts the quota to only matching pods that have BestEffort quality of service for either memory or CPU.

compute-resources-long-running.yaml

apiVersion: v1
kind: ResourceQuota
metadata:
  name: compute-resources-long-running
spec:
  hard:
    pods: "4"


    limits.cpu: "4"


    limits.memory: "2Gi"


    limits.ephemeral-storage: "4Gi"


  scopes:
  - NotTerminating

1: The total number of pods in a non-terminal state.
2: Across all pods in a non-terminal state, the sum of CPU limits cannot exceed this value.
3: Across all pods in a non-terminal state, the sum of memory limits cannot exceed this value.
4: Across all pods in a non-terminal state, the sum of ephemeral storage limits cannot exceed this value.
5: Restricts the quota to only matching pods where spec.activeDeadlineSeconds is set to nil. Build pods will fall under NotTerminating unless the RestartNever policy is applied.

compute-resources-time-bound.yaml

apiVersion: v1
kind: ResourceQuota
metadata:
  name: compute-resources-time-bound
spec:
  hard:
    pods: "2"


    limits.cpu: "1"


    limits.memory: "1Gi"


    limits.ephemeral-storage: "1Gi"


  scopes:
  - Terminating

1: The total number of pods in a non-terminal state.
2: Across all pods in a non-terminal state, the sum of CPU limits cannot exceed this value.
3: Across all pods in a non-terminal state, the sum of memory limits cannot exceed this value.
4: Across all pods in a non-terminal state, the sum of ephemeral storage limits cannot exceed this value.
5: Restricts the quota to only matching pods where spec.activeDeadlineSeconds >=0. For example, this quota would charge for build or deployer pods, but not long running pods like a web server or database.

storage-consumption.yaml

apiVersion: v1
kind: ResourceQuota
metadata:
  name: storage-consumption
spec:
  hard:
    persistentvolumeclaims: "10"


    requests.storage: "50Gi"


    gold.storageclass.storage.k8s.io/requests.storage: "10Gi"


    silver.storageclass.storage.k8s.io/requests.storage: "20Gi"


    silver.storageclass.storage.k8s.io/persistentvolumeclaims: "5"


    bronze.storageclass.storage.k8s.io/requests.storage: "0"


    bronze.storageclass.storage.k8s.io/persistentvolumeclaims: "0"

1: The total number of persistent volume claims in a project
2: Across all persistent volume claims in a project, the sum of storage requested cannot exceed this value.
3: Across all persistent volume claims in a project, the sum of storage requested in the gold storage class cannot exceed this value.
4: Across all persistent volume claims in a project, the sum of storage requested in the silver storage class cannot exceed this value.
5: Across all persistent volume claims in a project, the total number of claims in the silver storage class cannot exceed this value.
6: Across all persistent volume claims in a project, the sum of storage requested in the bronze storage class cannot exceed this value. When this is set to 0, it means bronze storage class cannot request storage.
7: Across all persistent volume claims in a project, the sum of storage requested in the bronze storage class cannot exceed this value. When this is set to 0, it means bronze storage class cannot create claims.

7.1.6. Creating a quota
Copy link

You can create a quota to constrain resource usage in a given project.

Procedure

Define the quota in a file.

Use the file to create the quota and apply it to a project:

$ oc create -f <file> [-n <project_name>]

For example:

$ oc create -f core-object-counts.yaml -n demoproject

7.1.6.1. Creating object count quotas
Copy link

You can create an object count quota for all OpenShift Container Platform standard namespaced resource types, such as BuildConfig, and DeploymentConfig. An object quota count places a defined quota on all standard namespaced resource types.

When using a resource quota, an object is charged against the quota if it exists in server storage. These types of quotas are useful to protect against exhaustion of storage resources.

Procedure

To configure an object count quota for a resource:

Run the following command:
```
$ oc create quota <name> \
    --hard=count/<resource>.<group>=<quota>,count/<resource>.<group>=<quota> 
```
1
1
<resource> is the name of the resource, and <group> is the API group, if applicable. Use the oc api-resources command for a list of resources and their associated API groups.
For example:
```
$ oc create quota test \
    --hard=count/deployments.extensions=2,count/replicasets.extensions=4,count/pods=3,count/secrets=4
resourcequota "test" created
```
This example limits the listed resources to the hard limit in each project in the cluster.

Verify that the quota was created:

$ oc describe quota test
Name:                         test
Namespace:                    quota
Resource                      Used  Hard
--------                      ----  ----
count/deployments.extensions  0     2
count/pods                    0     3
count/replicasets.extensions  0     4
count/secrets                 0     4

7.1.6.2. Setting resource quota for extended resources
Copy link

Overcommitment of resources is not allowed for extended resources, so you must specify requests and limits for the same extended resource in a quota. Currently, only quota items with the prefix requests. is allowed for extended resources. The following is an example scenario of how to set resource quota for the GPU resource nvidia.com/gpu.

Procedure

Determine how many GPUs are available on a node in your cluster. For example:

# oc describe node ip-172-31-27-209.us-west-2.compute.internal | egrep 'Capacity|Allocatable|gpu'
                    openshift.com/gpu-accelerator=true
Capacity:
 nvidia.com/gpu:  2
Allocatable:
 nvidia.com/gpu:  2
  nvidia.com/gpu  0           0

In this example, 2 GPUs are available.

Set a quota in the namespace nvidia. In this example, the quota is 1:

# cat gpu-quota.yaml
apiVersion: v1
kind: ResourceQuota
metadata:
  name: gpu-quota
  namespace: nvidia
spec:
  hard:
    requests.nvidia.com/gpu: 1

Create the quota:

# oc create -f gpu-quota.yaml
resourcequota/gpu-quota created

Verify that the namespace has the correct quota set:

# oc describe quota gpu-quota -n nvidia
Name:                    gpu-quota
Namespace:               nvidia
Resource                 Used  Hard
--------                 ----  ----
requests.nvidia.com/gpu  0     1

Run a pod that asks for a single GPU:

# oc create -f gpu-pod.yaml

apiVersion: v1
kind: Pod
metadata:
  generateName: gpu-pod-
  namespace: nvidia
spec:
  restartPolicy: OnFailure
  containers:
  - name: rhel7-gpu-pod
    image: rhel7
    env:
      - name: NVIDIA_VISIBLE_DEVICES
        value: all
      - name: NVIDIA_DRIVER_CAPABILITIES
        value: "compute,utility"
      - name: NVIDIA_REQUIRE_CUDA
        value: "cuda>=5.0"
    command: ["sleep"]
    args: ["infinity"]
    resources:
      limits:
        nvidia.com/gpu: 1

Verify that the pod is running:

# oc get pods
NAME              READY     STATUS      RESTARTS   AGE
gpu-pod-s46h7     1/1       Running     0          1m

Verify that the quota Used counter is correct:

# oc describe quota gpu-quota -n nvidia
Name:                    gpu-quota
Namespace:               nvidia
Resource                 Used  Hard
--------                 ----  ----
requests.nvidia.com/gpu  1     1

Attempt to create a second GPU pod in the nvidia namespace. This is technically available on the node because it has 2 GPUs:
```
# oc create -f gpu-pod.yaml
Error from server (Forbidden): error when creating "gpu-pod.yaml": pods "gpu-pod-f7z2w" is forbidden: exceeded quota: gpu-quota, requested: requests.nvidia.com/gpu=1, used: requests.nvidia.com/gpu=1, limited: requests.nvidia.com/gpu=1
```
This Forbidden error message is expected because you have a quota of 1 GPU and this pod tried to allocate a second GPU, which exceeds its quota.

7.1.7. Viewing a quota
Copy link

You can view usage statistics related to any hard limits defined in a project’s quota by navigating in the web console to the project’s Quota page.

You can also use the CLI to view quota details.

Procedure

Get the list of quotas defined in the project. For example, for a project called demoproject:

$ oc get quota -n demoproject
NAME                AGE
besteffort          11m
compute-resources   2m
core-object-counts  29m

Describe the quota you are interested in, for example the core-object-counts quota:

$ oc describe quota core-object-counts -n demoproject
Name:			core-object-counts
Namespace:		demoproject
Resource		Used	Hard
--------		----	----
configmaps		3	10
persistentvolumeclaims	0	4
replicationcontrollers	3	20
secrets			9	10
services		2	10

7.1.8. Configuring quota synchronization period
Copy link

When a set of resources are deleted, but before quota usage is restored, a user might encounter problems when attempting to reuse the resources. The synchronization time frame of resources is determined by the resource-quota-sync-period setting, which can be configured by a cluster administrator.

Adjusting the regeneration time can be helpful for creating resources and determining resource usage when automation is used.

Note

The resource-quota-sync-period setting is designed to balance system performance. Reducing the sync period can result in a heavy load on the master.

Procedure

To configure the quota synchronization period:

Edit the Kubernetes controller manager.
```
$ oc edit kubecontrollermanager cluster
```
Change the unsupportedconfigOverrides field to have the following settings, specifying the amount of time, in seconds, for the resource-quota-sync-period field:
```
  unsupportedConfigOverrides:
    extendedArguments:
      resource-quota-sync-period:
      - 60s
```

7.2. Resource quotas across multiple projects
Copy link

A multi-project quota, defined by a ClusterResourceQuota object, allows quotas to be shared across multiple projects. Resources used in each selected project are aggregated and that aggregate is used to limit resources across all the selected projects.

This guide describes how cluster administrators can set and manage resource quotas across multiple projects.

7.2.1. Selecting multiple projects during quota creation
Copy link

When creating quotas, you can select multiple projects based on annotation selection, label selection, or both.

Procedure

To select projects based on annotations, run the following command:

$ oc create clusterquota for-user \
     --project-annotation-selector openshift.io/requester=<user_name> \
     --hard pods=10 \
     --hard secrets=20

This creates the following ClusterResourceQuota object:

apiVersion: v1
kind: ClusterResourceQuota
metadata:
  name: for-user
spec:
  quota:


    hard:
      pods: "10"
      secrets: "20"
  selector:
    annotations:


      openshift.io/requester: <user_name>
    labels: null


status:
  namespaces:


  - namespace: ns-one
    status:
      hard:
        pods: "10"
        secrets: "20"
      used:
        pods: "1"
        secrets: "9"
  total:


    hard:
      pods: "10"
      secrets: "20"
    used:
      pods: "1"
      secrets: "9"

1: The ResourceQuotaSpec object that will be enforced over the selected projects.
2: A simple key-value selector for annotations.
3: A label selector that can be used to select projects.
4: A per-namespace map that describes current quota usage in each selected project.
5: The aggregate usage across all selected projects.

This multi-project quota document controls all projects requested by <user_name> using the default project request endpoint. You are limited to 10 pods and 20 secrets.

Similarly, to select projects based on labels, run this command:

$  oc create clusterresourcequota for-name \


    --project-label-selector=name=frontend \


    --hard=pods=10 --hard=secrets=20

1: Both clusterresourcequota and clusterquota are aliases of the same command. for-name is the name of the ClusterResourceQuota object.
2: To select projects by label, provide a key-value pair by using the format --project-label-selector=key=value.

This creates the following ClusterResourceQuota object definition:

apiVersion: v1
kind: ClusterResourceQuota
metadata:
  creationTimestamp: null
  name: for-name
spec:
  quota:
    hard:
      pods: "10"
      secrets: "20"
  selector:
    annotations: null
    labels:
      matchLabels:
        name: frontend

7.2.2. Viewing applicable ClusterResourceQuotas
Copy link

A project administrator is not allowed to create or modify the multi-project quota that limits his or her project, but the administrator is allowed to view the multi-project quota documents that are applied to his or her project. The project administrator can do this via the AppliedClusterResourceQuota resource.

Procedure

To view quotas applied to a project, run:

$ oc describe AppliedClusterResourceQuota

For example:

Name:   for-user
Namespace:  <none>
Created:  19 hours ago
Labels:   <none>
Annotations:  <none>
Label Selector: <null>
AnnotationSelector: map[openshift.io/requester:<user-name>]
Resource  Used  Hard
--------  ----  ----
pods        1     10
secrets     9     20

7.2.3. Selection granularity
Copy link

Because of the locking consideration when claiming quota allocations, the number of active projects selected by a multi-project quota is an important consideration. Selecting more than 100 projects under a single multi-project quota can have detrimental effects on API server responsiveness in those projects.

Chapter 7. Quotas

7.1. Resource quotas per project
Copy link

7.1.1. Resources managed by quotas
Copy link

7.1.2. Quota scopes
Copy link

7.1.3. Quota enforcement
Copy link

7.1.4. Requests versus limits
Copy link

7.1.5. Sample resource quota definitions
Copy link

7.1.6. Creating a quota
Copy link

7.1.6.1. Creating object count quotas
Copy link

7.1.6.2. Setting resource quota for extended resources
Copy link

7.1.7. Viewing a quota
Copy link

7.1.8. Configuring quota synchronization period
Copy link

7.2. Resource quotas across multiple projects
Copy link

7.2.1. Selecting multiple projects during quota creation
Copy link

7.2.2. Viewing applicable ClusterResourceQuotas
Copy link

7.2.3. Selection granularity
Copy link

Learn

Try, buy, & sell

Communities

About Red Hat

Making open source more inclusive

About Red Hat Documentation

Theme

Red Hat legal and privacy links

Red Hat legal and privacy links

Chapter 7. Quotas

7.1. Resource quotas per projectCopy linkLink copied to clipboard!

7.1.1. Resources managed by quotasCopy linkLink copied to clipboard!

7.1.2. Quota scopesCopy linkLink copied to clipboard!

7.1.3. Quota enforcementCopy linkLink copied to clipboard!

7.1.4. Requests versus limitsCopy linkLink copied to clipboard!

7.1.5. Sample resource quota definitionsCopy linkLink copied to clipboard!

7.1.6. Creating a quotaCopy linkLink copied to clipboard!

7.1.6.1. Creating object count quotasCopy linkLink copied to clipboard!

7.1.6.2. Setting resource quota for extended resourcesCopy linkLink copied to clipboard!

7.1.7. Viewing a quotaCopy linkLink copied to clipboard!

7.1.8. Configuring quota synchronization periodCopy linkLink copied to clipboard!

7.2. Resource quotas across multiple projectsCopy linkLink copied to clipboard!

7.2.1. Selecting multiple projects during quota creationCopy linkLink copied to clipboard!

7.2.2. Viewing applicable ClusterResourceQuotasCopy linkLink copied to clipboard!

7.2.3. Selection granularityCopy linkLink copied to clipboard!

Learn

Try, buy, & sell

Communities

About Red Hat

Making open source more inclusive

About Red Hat Documentation

Theme

Red Hat legal and privacy links

Red Hat legal and privacy links

7.1. Resource quotas per project
Copy link

7.1.1. Resources managed by quotas
Copy link

7.1.2. Quota scopes
Copy link

7.1.3. Quota enforcement
Copy link

7.1.4. Requests versus limits
Copy link

7.1.5. Sample resource quota definitions
Copy link

7.1.6. Creating a quota
Copy link

7.1.6.1. Creating object count quotas
Copy link

7.1.6.2. Setting resource quota for extended resources
Copy link

7.1.7. Viewing a quota
Copy link

7.1.8. Configuring quota synchronization period
Copy link

7.2. Resource quotas across multiple projects
Copy link

7.2.1. Selecting multiple projects during quota creation
Copy link

7.2.2. Viewing applicable ClusterResourceQuotas
Copy link

7.2.3. Selection granularity
Copy link