Chapter 3. Automatically scaling pods with the Custom Metrics Autoscaler Operator

Specifies Prometheus as the trigger type.

Specifies the address of the Prometheus server. This example uses OpenShift Container Platform monitoring.

Optional: Specifies the namespace of the object you want to scale. This parameter is mandatory if using OpenShift Container Platform monitoring as a source for the metrics.

Specifies the name to identify the metric in the external.metrics.k8s.io API. If you are using more than one trigger, all metric names must be unique.

5

Specifies the value that triggers scaling. Must be specified as a quoted string value.

6

Specifies the Prometheus query to use.

7

Specifies the authentication method to use. Prometheus scalers support bearer authentication (bearer), basic authentication (basic), or TLS authentication (tls). You configure the specific authentication parameters in a trigger authentication, as discussed in a following section. As needed, you can also use a secret.

8

Optional: Passes the X-Scope-OrgID header to multi-tenant Cortex or Mimir storage for Prometheus. This parameter is required only with multi-tenant Prometheus storage, to indicate which data Prometheus should return.

9

Optional: Specifies how the trigger should proceed if the Prometheus target is lost.

If true, the trigger continues to operate if the Prometheus target is lost. This is the default behavior.
If false, the trigger returns an error if the Prometheus target is lost.

10

Optional: Specifies whether the certificate check should be skipped. For example, you might skip the check if you are running in a test environment and using self-signed certificates at the Prometheus endpoint.

If false, the certificate check is performed. This is the default behavior.
If true, the certificate check is not performed.
Important
Skipping the check is not recommended.

11

Optional: Specifies an HTTP request timeout in milliseconds for the HTTP client used by this Prometheus trigger. This value overrides any global timeout setting.

3.4.1.1. Configuring GPU-based autoscaling with Prometheus and DCGM metrics
Copy link

You can use the Custom Metrics Autoscaler with NVIDIA Data Center GPU Manager (DCGM) metrics to scale workloads based on GPU utilization. This is particularly useful for AI and machine learning workloads that require GPU resources.

Example scaled object with a Prometheus target for GPU-based autoscaling

apiVersion: keda.sh/v1alpha1
kind: ScaledObject
metadata:
  name: gpu-scaledobject
  namespace: my-namespace
spec:
  scaleTargetRef:
    kind: Deployment
    name: gpu-deployment
  minReplicaCount: 1 
  maxReplicaCount: 5 
  triggers:
  - type: prometheus
    metadata:
      serverAddress: https://thanos-querier.openshift-monitoring.svc.cluster.local:9092
      namespace: my-namespace
      metricName: gpu_utilization
      threshold: '90' 
      query: SUM(DCGM_FI_DEV_GPU_UTIL{instance=~".+", gpu=~".+"}) 
      authModes: bearer
    authenticationRef:
      name: keda-trigger-auth-prometheus

apiVersion: keda.sh/v1alpha1
kind: ScaledObject
metadata:
  name: gpu-scaledobject
  namespace: my-namespace
spec:
  scaleTargetRef:
    kind: Deployment
    name: gpu-deployment
  minReplicaCount: 1


  maxReplicaCount: 5


  triggers:
  - type: prometheus
    metadata:
      serverAddress: https://thanos-querier.openshift-monitoring.svc.cluster.local:9092
      namespace: my-namespace
      metricName: gpu_utilization
      threshold: '90'


      query: SUM(DCGM_FI_DEV_GPU_UTIL{instance=~".+", gpu=~".+"})


      authModes: bearer
    authenticationRef:
      name: keda-trigger-auth-prometheus

Copy to Clipboard

Toggle word wrap

1: Specifies the minimum number of replicas to maintain. For GPU workloads, this should not be set to 0 to ensure that metrics continue to be collected.
2: Specifies the maximum number of replicas allowed during scale-up operations.
3: Specifies the GPU utilization percentage threshold that triggers scaling. When the average GPU utilization exceeds 90%, the autoscaler scales up the deployment.
4: Specifies a Prometheus query using NVIDIA DCGM metrics to monitor GPU utilization across all GPU devices. The DCGM_FI_DEV_GPU_UTIL metric provides GPU utilization percentages.

3.4.1.2. Configuring the custom metrics autoscaler to use OpenShift Container Platform monitoring
Copy link

You can use the installed OpenShift Container Platform Prometheus monitoring as a source for the metrics used by the custom metrics autoscaler. However, there are some additional configurations you must perform.

For your scaled objects to be able to read the OpenShift Container Platform Prometheus metrics, you must use a trigger authentication or a cluster trigger authentication in order to provide the authentication information required. The following procedure differs depending on which trigger authentication method you use. For more information on trigger authentications, see "Understanding custom metrics autoscaler trigger authentications".

Note

These steps are not required for an external Prometheus source.

You must perform the following tasks, as described in this section:

Create a service account.
Create the trigger authentication.
Create a role.
Add that role to the service account.
Reference the token in the trigger authentication object used by Prometheus.

Prerequisites

OpenShift Container Platform monitoring must be installed.
Monitoring of user-defined workloads must be enabled in OpenShift Container Platform monitoring, as described in the Creating a user-defined workload monitoring config map section.
The Custom Metrics Autoscaler Operator must be installed.

Procedure

Change to the appropriate project:
```
oc project <project_name>
```
```
$ oc project <project_name> 
```
1
Copy to Clipboard Toggle word wrap
1
Specifies one of the following projects:
If you are using a trigger authentication, specify the project with the object you want to scale.
If you are using a cluster trigger authentication, specify the openshift-keda project.
Create a service account if your cluster does not have one:
1. Create a service account object by using the following command:
  $ oc create serviceaccount thanos
  1
  Copy to Clipboard Toggle word wrap
  1
  Specifies the name of the service account.
Create a trigger authentication with the service account token:
1. Create a YAML file similar to the following:
  apiVersion: keda.sh/v1alpha1 kind: <authentication_method>
  1
  metadata: name: keda-trigger-auth-prometheus spec: boundServiceAccountToken:
  2
  - parameter: bearerToken
  3
  serviceAccountName: thanos
  4
  Copy to Clipboard Toggle word wrap
  1
  Specifies one of the following trigger authentication methods:
  If you are using a trigger authentication, specify TriggerAuthentication. This example configures a trigger authentication.
  If you are using a cluster trigger authentication, specify ClusterTriggerAuthentication.
  2
  Specifies that this trigger authentication uses a bound service account token for authorization when connecting to the metrics endpoint.
  3
  Specifies the authentication parameter to supply by using the token. Here, the example uses bearer authentication.
  4
  Specifies the name of the service account to use.
2. Create the CR object:
  $ oc create -f <file-name>.yaml
  Copy to Clipboard Toggle word wrap

Create a role for reading Thanos metrics:

Create a YAML file with the following parameters:

apiVersion: rbac.authorization.k8s.io/v1
kind: Role
metadata:
  name: thanos-metrics-reader
rules:
- apiGroups:
  - ""
  resources:
  - pods
  verbs:
  - get
- apiGroups:
  - metrics.k8s.io
  resources:
  - pods
  - nodes
  verbs:
  - get
  - list
  - watch

apiVersion: rbac.authorization.k8s.io/v1
kind: Role
metadata:
  name: thanos-metrics-reader
rules:
- apiGroups:
  - ""
  resources:
  - pods
  verbs:
  - get
- apiGroups:
  - metrics.k8s.io
  resources:
  - pods
  - nodes
  verbs:
  - get
  - list
  - watch

Copy to Clipboard

Toggle word wrap

Create the CR object:
```
oc create -f <file-name>.yaml
```
```
$ oc create -f <file-name>.yaml
```
Copy to Clipboard Toggle word wrap

Create a role binding for reading Thanos metrics:
1. Create a YAML file similar to the following:
  apiVersion: rbac.authorization.k8s.io/v1 kind: <binding_type>
  1
  metadata: name: thanos-metrics-reader
  2
  namespace: my-project
  3
  roleRef: apiGroup: rbac.authorization.k8s.io kind: Role name: thanos-metrics-reader subjects: - kind: ServiceAccount name: thanos
  4
  namespace: <namespace_name>
  5
  Copy to Clipboard Toggle word wrap
  1
  Specifies one of the following object types:
  If you are using a trigger authentication, specify RoleBinding.
  If you are using a cluster trigger authentication, specify ClusterRoleBinding.
  2
  Specifies the name of the role you created.
  3
  Specifies one of the following projects:
  If you are using a trigger authentication, specify the project with the object you want to scale.
  If you are using a cluster trigger authentication, specify the openshift-keda project.
  4
  Specifies the name of the service account to bind to the role.
  5
  Specifies the project where you previously created the service account.
2. Create the CR object:
  $ oc create -f <file-name>.yaml
  Copy to Clipboard Toggle word wrap

You can now deploy a scaled object or scaled job to enable autoscaling for your application, as described in "Understanding how to add custom metrics autoscalers". To use OpenShift Container Platform monitoring as the source, in the trigger, or scaler, you must include the following parameters:

triggers.type must be prometheus
triggers.metadata.serverAddress must be https://thanos-querier.openshift-monitoring.svc.cluster.local:9092
triggers.metadata.authModes must be bearer
triggers.metadata.namespace must be set to the namespace of the object to scale
triggers.authenticationRef must point to the trigger authentication resource specified in the previous step

3.4.2. Understanding the CPU trigger
Copy link

You can scale pods based on CPU metrics. This trigger uses cluster metrics as the source for metrics.

The custom metrics autoscaler scales the pods associated with an object to maintain the CPU usage that you specify. The autoscaler increases or decreases the number of replicas between the minimum and maximum numbers to maintain the specified CPU utilization across all pods. The memory trigger considers the memory utilization of the entire pod. If the pod has multiple containers, the memory trigger considers the total memory utilization of all containers in the pod.

Note

This trigger cannot be used with the ScaledJob custom resource.
When using a memory trigger to scale an object, the object does not scale to 0, even if you are using multiple triggers.

Example scaled object with a CPU target

apiVersion: keda.sh/v1alpha1
kind: ScaledObject
metadata:
  name: cpu-scaledobject
  namespace: my-namespace
spec:
# ...
  triggers:
  - type: cpu 
    metricType: Utilization 
    metadata:
      value: '60' 
  minReplicaCount: 1

apiVersion: keda.sh/v1alpha1
kind: ScaledObject
metadata:
  name: cpu-scaledobject
  namespace: my-namespace
spec:
# ...
  triggers:
  - type: cpu


    metricType: Utilization


    metadata:
      value: '60'


  minReplicaCount: 1

Copy to Clipboard

Toggle word wrap

Specifies CPU as the trigger type.

Specifies the type of metric to use, either Utilization or AverageValue.

Specifies the value that triggers scaling. Must be specified as a quoted string value.

When using Utilization, the target value is the average of the resource metrics across all relevant pods, represented as a percentage of the requested value of the resource for the pods.
When using AverageValue, the target value is the average of the metrics across all relevant pods.

Specifies the minimum number of replicas when scaling down. For a CPU trigger, enter a value of 1 or greater, because the HPA cannot scale to zero if you are using only CPU metrics.

3.4.3. Understanding the memory trigger
Copy link

You can scale pods based on memory metrics. This trigger uses cluster metrics as the source for metrics.

The custom metrics autoscaler scales the pods associated with an object to maintain the average memory usage that you specify. The autoscaler increases and decreases the number of replicas between the minimum and maximum numbers to maintain the specified memory utilization across all pods. The memory trigger considers the memory utilization of entire pod. If the pod has multiple containers, the memory utilization is the sum of all of the containers.

Note

This trigger cannot be used with the ScaledJob custom resource.
When using a memory trigger to scale an object, the object does not scale to 0, even if you are using multiple triggers.

Example scaled object with a memory target

apiVersion: keda.sh/v1alpha1
kind: ScaledObject
metadata:
  name: memory-scaledobject
  namespace: my-namespace
spec:
# ...
  triggers:
  - type: memory 
    metricType: Utilization 
    metadata:
      value: '60' 
      containerName: api

apiVersion: keda.sh/v1alpha1
kind: ScaledObject
metadata:
  name: memory-scaledobject
  namespace: my-namespace
spec:
# ...
  triggers:
  - type: memory


    metricType: Utilization


    metadata:
      value: '60'


      containerName: api

Copy to Clipboard

Toggle word wrap

Specifies memory as the trigger type.

Specifies the type of metric to use, either Utilization or AverageValue.

Specifies the value that triggers scaling. Must be specified as a quoted string value.

When using Utilization, the target value is the average of the resource metrics across all relevant pods, represented as a percentage of the requested value of the resource for the pods.
When using AverageValue, the target value is the average of the metrics across all relevant pods.

Optional: Specifies an individual container to scale, based on the memory utilization of only that container, rather than the entire pod. In this example, only the container named api is to be scaled.

3.4.4. Understanding the Kafka trigger
Copy link

You can scale pods based on an Apache Kafka topic or other services that support the Kafka protocol. The custom metrics autoscaler does not scale higher than the number of Kafka partitions, unless you set the allowIdleConsumers parameter to true in the scaled object or scaled job.

Note

If the number of consumer groups exceeds the number of partitions in a topic, the extra consumer groups remain idle. To avoid this, by default the number of replicas does not exceed:

The number of partitions on a topic, if a topic is specified
The number of partitions of all topics in the consumer group, if no topic is specified
The maxReplicaCount specified in scaled object or scaled job CR

You can use the allowIdleConsumers parameter to disable these default behaviors.

Example scaled object with a Kafka target

apiVersion: keda.sh/v1alpha1
kind: ScaledObject
metadata:
  name: kafka-scaledobject
  namespace: my-namespace
spec:
# ...
  triggers:
  - type: kafka 
    metadata:
      topic: my-topic 
      bootstrapServers: my-cluster-kafka-bootstrap.openshift-operators.svc:9092 
      consumerGroup: my-group 
      lagThreshold: '10' 
      activationLagThreshold: '5' 
      offsetResetPolicy: latest 
      allowIdleConsumers: true 
      scaleToZeroOnInvalidOffset: false 
      excludePersistentLag: false 
      version: '1.0.0' 
      partitionLimitation: '1,2,10-20,31' 
      tls: enable

apiVersion: keda.sh/v1alpha1
kind: ScaledObject
metadata:
  name: kafka-scaledobject
  namespace: my-namespace
spec:
# ...
  triggers:
  - type: kafka


    metadata:
      topic: my-topic


      bootstrapServers: my-cluster-kafka-bootstrap.openshift-operators.svc:9092


      consumerGroup: my-group


      lagThreshold: '10'


      activationLagThreshold: '5'


      offsetResetPolicy: latest


      allowIdleConsumers: true


      scaleToZeroOnInvalidOffset: false


      excludePersistentLag: false


      version: '1.0.0'


      partitionLimitation: '1,2,10-20,31'


      tls: enable

Copy to Clipboard

Toggle word wrap

Specifies Kafka as the trigger type.

Specifies the name of the Kafka topic on which Kafka is processing the offset lag.

Specifies a comma-separated list of Kafka brokers to connect to.

Specifies the name of the Kafka consumer group used for checking the offset on the topic and processing the related lag.

5

Optional: Specifies the average target value that triggers scaling. Must be specified as a quoted string value. The default is 5.

6

Optional: Specifies the target value for the activation phase. Must be specified as a quoted string value.

7

Optional: Specifies the Kafka offset reset policy for the Kafka consumer. The available values are: latest and earliest. The default is latest.

8

Optional: Specifies whether the number of Kafka replicas can exceed the number of partitions on a topic.

If true, the number of Kafka replicas can exceed the number of partitions on a topic. This allows for idle Kafka consumers.
If false, the number of Kafka replicas cannot exceed the number of partitions on a topic. This is the default.

9

Specifies how the trigger behaves when a Kafka partition does not have a valid offset.

If true, the consumers are scaled to zero for that partition.
If false, the scaler keeps a single consumer for that partition. This is the default.

10

Optional: Specifies whether the trigger includes or excludes partition lag for partitions whose current offset is the same as the current offset of the previous polling cycle.

If true, the scaler excludes partition lag in these partitions.
If false, the trigger includes all consumer lag in all partitions. This is the default.

11

Optional: Specifies the version of your Kafka brokers. Must be specified as a quoted string value. The default is 1.0.0.

12

Optional: Specifies a comma-separated list of partition IDs to scope the scaling on. If set, only the listed IDs are considered when calculating lag. Must be specified as a quoted string value. The default is to consider all partitions.

13

Optional: Specifies whether to use TSL client authentication for Kafka. The default is disable. For information on configuring TLS, see "Understanding custom metrics autoscaler trigger authentications".

3.4.5. Understanding the Cron trigger
Copy link

You can scale pods based on a time range.

When the time range starts, the custom metrics autoscaler scales the pods associated with an object from the configured minimum number of pods to the specified number of desired pods. At the end of the time range, the pods are scaled back to the configured minimum. The time period must be configured in cron format.

The following example scales the pods associated with this scaled object from 0 to 100 from 6:00 AM to 6:30 PM India Standard Time.

Example scaled object with a Cron trigger

apiVersion: keda.sh/v1alpha1
kind: ScaledObject
metadata:
  name: cron-scaledobject
  namespace: default
spec:
  scaleTargetRef:
    name: my-deployment
  minReplicaCount: 0 
  maxReplicaCount: 100 
  cooldownPeriod: 300
  triggers:
  - type: cron 
    metadata:
      timezone: Asia/Kolkata 
      start: "0 6 * * *" 
      end: "30 18 * * *" 
      desiredReplicas: "100"

apiVersion: keda.sh/v1alpha1
kind: ScaledObject
metadata:
  name: cron-scaledobject
  namespace: default
spec:
  scaleTargetRef:
    name: my-deployment
  minReplicaCount: 0


  maxReplicaCount: 100


  cooldownPeriod: 300
  triggers:
  - type: cron


    metadata:
      timezone: Asia/Kolkata


      start: "0 6 * * *"


      end: "30 18 * * *"


      desiredReplicas: "100"

Copy to Clipboard

Toggle word wrap

1: Specifies the minimum number of pods to scale down to at the end of the time frame.
2: Specifies the maximum number of replicas when scaling up. This value should be the same as desiredReplicas. The default is 100.
3: Specifies a Cron trigger.
4: Specifies the timezone for the time frame. This value must be from the IANA Time Zone Database.
5: Specifies the start of the time frame.
6: Specifies the end of the time frame.
7: Specifies the number of pods to scale to between the start and end of the time frame. This value should be the same as maxReplicaCount.

3.4.6. Understanding the Kubernetes workload trigger
Copy link

You can scale pods based on the number of pods matching a specific label selector.

The Custom Metrics Autoscaler Operator tracks the number of pods with a specific label that are in the same namespace, then calculates a relation based on the number of labeled pods to the pods for the scaled object. Using this relation, the Custom Metrics Autoscaler Operator scales the object according to the scaling policy in the ScaledObject or ScaledJob specification.

The pod counts includes pods with a Succeeded or Failed phase.

For example, if you have a frontend deployment and a backend deployment. You can use a kubernetes-workload trigger to scale the backend deployment based on the number of frontend pods. If number of frontend pods goes up, the Operator would scale the backend pods to maintain the specified ratio. In this example, if there are 10 pods with the app=frontend pod selector, the Operator scales the backend pods to 5 in order to maintain the 0.5 ratio set in the scaled object.

Example scaled object with a Kubernetes workload trigger

apiVersion: keda.sh/v1alpha1
kind: ScaledObject
metadata:
  name: workload-scaledobject
  namespace: my-namespace
spec:
  triggers:
  - type: kubernetes-workload 
    metadata:
      podSelector: 'app=frontend' 
      value: '0.5' 
      activationValue: '3.1'

apiVersion: keda.sh/v1alpha1
kind: ScaledObject
metadata:
  name: workload-scaledobject
  namespace: my-namespace
spec:
  triggers:
  - type: kubernetes-workload


    metadata:
      podSelector: 'app=frontend'


      value: '0.5'


      activationValue: '3.1'

Copy to Clipboard

Toggle word wrap

Specifies a Kubernetes workload trigger.

Specifies one or more pod selectors and/or set-based selectors, separated with commas, to use to get the pod count.

Specifies the target relation between the scaled workload and the number of pods that match the selector. The relation is calculated following the following formula:

relation = (pods that match the selector) / (scaled workload pods)

relation = (pods that match the selector) / (scaled workload pods)

Copy to Clipboard

Toggle word wrap

Optional: Specifies the target value for scaler activation phase. The default is 0.

3.5. Understanding custom metrics autoscaler trigger authentications
Copy link

A trigger authentication allows you to include authentication information in a scaled object or a scaled job that can be used by the associated containers. You can use trigger authentications to pass OpenShift Container Platform secrets, platform-native pod authentication mechanisms, environment variables, and so on.

You define a TriggerAuthentication object in the same namespace as the object that you want to scale. That trigger authentication can be used only by objects in that namespace.

Alternatively, to share credentials between objects in multiple namespaces, you can create a ClusterTriggerAuthentication object that can be used across all namespaces.

Trigger authentications and cluster trigger authentication use the same configuration. However, a cluster trigger authentication requires an additional kind parameter in the authentication reference of the scaled object.

Example trigger authentication that uses a bound service account token

kind: TriggerAuthentication
apiVersion: keda.sh/v1alpha1
metadata:
  name: secret-triggerauthentication
  namespace: my-namespace 
spec:
  boundServiceAccountToken: 
    - parameter: bearerToken
      serviceAccountName: thanos

kind: TriggerAuthentication
apiVersion: keda.sh/v1alpha1
metadata:
  name: secret-triggerauthentication
  namespace: my-namespace


spec:
  boundServiceAccountToken:


    - parameter: bearerToken
      serviceAccountName: thanos

Copy to Clipboard

Toggle word wrap

1: Specifies the namespace of the object you want to scale.
2: Specifies that this trigger authentication uses a bound service account token for authorization when connecting to the metrics endpoint.
3: Specifies the name of the service account to use.

Example cluster trigger authentication that uses a bound service account token

kind: ClusterTriggerAuthentication
apiVersion: keda.sh/v1alpha1
metadata:
  name: bound-service-account-token-triggerauthentication 
spec:
  boundServiceAccountToken: 
    - parameter: bearerToken
      serviceAccountName: thanos

kind: ClusterTriggerAuthentication
apiVersion: keda.sh/v1alpha1
metadata:
  name: bound-service-account-token-triggerauthentication


spec:
  boundServiceAccountToken:


    - parameter: bearerToken
      serviceAccountName: thanos

Copy to Clipboard

Toggle word wrap

1: Specifies the namespace of the object you want to scale.
2: Specifies that this cluster trigger authentication uses a bound service account token for authorization when connecting to the metrics endpoint.
3: Specifies the name of the service account to use.

Example trigger authentication that uses a secret for Basic authentication

kind: TriggerAuthentication
apiVersion: keda.sh/v1alpha1
metadata:
  name: secret-triggerauthentication
  namespace: my-namespace 
spec:
  secretTargetRef: 
  - parameter: username 
    name: my-basic-secret 
    key: username 
  - parameter: password
    name: my-basic-secret
    key: password

kind: TriggerAuthentication
apiVersion: keda.sh/v1alpha1
metadata:
  name: secret-triggerauthentication
  namespace: my-namespace


spec:
  secretTargetRef:


  - parameter: username


    name: my-basic-secret


    key: username


  - parameter: password
    name: my-basic-secret
    key: password

Copy to Clipboard

Toggle word wrap

1: Specifies the namespace of the object you want to scale.
2: Specifies that this trigger authentication uses a secret for authorization when connecting to the metrics endpoint.
3: Specifies the authentication parameter to supply by using the secret.
4: Specifies the name of the secret to use. See the following example secret for Basic authentication.
5: Specifies the key in the secret to use with the specified parameter.

Example secret for Basic authentication

apiVersion: v1
kind: Secret
metadata:
  name: my-basic-secret
  namespace: default
data:
  username: "dXNlcm5hbWU=" 
  password: "cGFzc3dvcmQ="

apiVersion: v1
kind: Secret
metadata:
  name: my-basic-secret
  namespace: default
data:
  username: "dXNlcm5hbWU="


  password: "cGFzc3dvcmQ="

Copy to Clipboard

Toggle word wrap

1: User name and password to supply to the trigger authentication. The values in the data stanza must be base-64 encoded.

Example trigger authentication that uses a secret for CA details

kind: TriggerAuthentication
apiVersion: keda.sh/v1alpha1
metadata:
  name: secret-triggerauthentication
  namespace: my-namespace 
spec:
  secretTargetRef: 
    - parameter: key 
      name: my-secret 
      key: client-key.pem 
    - parameter: ca 
      name: my-secret 
      key: ca-cert.pem

kind: TriggerAuthentication
apiVersion: keda.sh/v1alpha1
metadata:
  name: secret-triggerauthentication
  namespace: my-namespace


spec:
  secretTargetRef:


    - parameter: key


      name: my-secret


      key: client-key.pem


    - parameter: ca


      name: my-secret


      key: ca-cert.pem

Copy to Clipboard

Toggle word wrap

1: Specifies the namespace of the object you want to scale.
2: Specifies that this trigger authentication uses a secret for authorization when connecting to the metrics endpoint.
3: Specifies the type of authentication to use.
4: Specifies the name of the secret to use.
5: Specifies the key in the secret to use with the specified parameter.
6: Specifies the authentication parameter for a custom CA when connecting to the metrics endpoint.
7: Specifies the name of the secret to use. See the following example secret with certificate authority (CA) details.
8: Specifies the key in the secret to use with the specified parameter.

Example secret with certificate authority (CA) details

apiVersion: v1
kind: Secret
metadata:
  name: my-secret
  namespace: my-namespace
data:
  ca-cert.pem: LS0tLS1CRUdJTiBDRVJUSUZJQ0FURS0tLS0... 
  client-cert.pem: LS0tLS1CRUdJTiBDRVJUSUZJQ0FURS0... 
  client-key.pem: LS0tLS1CRUdJTiBQUklWQVRFIEtFWS0t...

apiVersion: v1
kind: Secret
metadata:
  name: my-secret
  namespace: my-namespace
data:
  ca-cert.pem: LS0tLS1CRUdJTiBDRVJUSUZJQ0FURS0tLS0...


  client-cert.pem: LS0tLS1CRUdJTiBDRVJUSUZJQ0FURS0...


  client-key.pem: LS0tLS1CRUdJTiBQUklWQVRFIEtFWS0t...

Copy to Clipboard

Toggle word wrap

1: Specifies the TLS CA Certificate for authentication of the metrics endpoint. The value must be base-64 encoded.
2: Specifies the TLS certificates and key for TLS client authentication. The values must be base-64 encoded.

Example trigger authentication that uses a bearer token

kind: TriggerAuthentication
apiVersion: keda.sh/v1alpha1
metadata:
  name: token-triggerauthentication
  namespace: my-namespace 
spec:
  secretTargetRef: 
  - parameter: bearerToken 
    name: my-secret 
    key: bearerToken

kind: TriggerAuthentication
apiVersion: keda.sh/v1alpha1
metadata:
  name: token-triggerauthentication
  namespace: my-namespace


spec:
  secretTargetRef:


  - parameter: bearerToken


    name: my-secret


    key: bearerToken

Copy to Clipboard

Toggle word wrap

1: Specifies the namespace of the object you want to scale.
2: Specifies that this trigger authentication uses a secret for authorization when connecting to the metrics endpoint.
3: Specifies the type of authentication to use.
4: Specifies the name of the secret to use. See the following example secret for a bearer token.
5: Specifies the key in the token to use with the specified parameter.

Example secret for a bearer token

apiVersion: v1
kind: Secret
metadata:
  name: my-secret
  namespace: my-namespace
data:
  bearerToken: "<bearer_token>"

apiVersion: v1
kind: Secret
metadata:
  name: my-secret
  namespace: my-namespace
data:
  bearerToken: "<bearer_token>"

Copy to Clipboard

Toggle word wrap

1: Specifies a bearer token to use with bearer authentication. The value must be base-64 encoded.

Example trigger authentication that uses an environment variable

kind: TriggerAuthentication
apiVersion: keda.sh/v1alpha1
metadata:
  name: env-var-triggerauthentication
  namespace: my-namespace 
spec:
  env: 
  - parameter: access_key 
    name: ACCESS_KEY 
    containerName: my-container

kind: TriggerAuthentication
apiVersion: keda.sh/v1alpha1
metadata:
  name: env-var-triggerauthentication
  namespace: my-namespace


spec:
  env:


  - parameter: access_key


    name: ACCESS_KEY


    containerName: my-container

Copy to Clipboard

Toggle word wrap

1: Specifies the namespace of the object you want to scale.
2: Specifies that this trigger authentication uses environment variables for authorization when connecting to the metrics endpoint.
3: Specify the parameter to set with this variable.
4: Specify the name of the environment variable.
5: Optional: Specify a container that requires authentication. The container must be in the same resource as referenced by scaleTargetRef in the scaled object.

Example trigger authentication that uses pod authentication providers

kind: TriggerAuthentication
apiVersion: keda.sh/v1alpha1
metadata:
  name: pod-id-triggerauthentication
  namespace: my-namespace 
spec:
  podIdentity: 
    provider: aws-eks

kind: TriggerAuthentication
apiVersion: keda.sh/v1alpha1
metadata:
  name: pod-id-triggerauthentication
  namespace: my-namespace


spec:
  podIdentity:


    provider: aws-eks

Copy to Clipboard

Toggle word wrap

1: Specifies the namespace of the object you want to scale.
2: Specifies that this trigger authentication uses a platform-native pod authentication when connecting to the metrics endpoint.
3: Specifies a pod identity. Supported values are none, azure, gcp, aws-eks, or aws-kiam. The default is none.

Additional resources

3.5.1. Using trigger authentications
Copy link

You use trigger authentications and cluster trigger authentications by using a custom resource to create the authentication, then add a reference to a scaled object or scaled job.

Prerequisites

The Custom Metrics Autoscaler Operator must be installed.
If you are using a bound service account token, the service account must exist.

If you are using a bound service account token, a role-based access control (RBAC) object that enables the Custom Metrics Autoscaler Operator to request service account tokens from the service account must exist.

apiVersion: rbac.authorization.k8s.io/v1
kind: Role
metadata:
  name: keda-operator-token-creator
  namespace: <namespace_name> 
rules:
- apiGroups:
  - ""
  resources:
  - serviceaccounts/token
  verbs:
  - create
  resourceNames:
  - thanos 
---
apiVersion: rbac.authorization.k8s.io/v1
kind: RoleBinding
metadata:
  name: keda-operator-token-creator-binding
  namespace: <namespace_name> 
roleRef:
  apiGroup: rbac.authorization.k8s.io
  kind: Role
  name: keda-operator-token-creator
subjects:
- kind: ServiceAccount
  name: keda-operator
  namespace: openshift-keda

apiVersion: rbac.authorization.k8s.io/v1
kind: Role
metadata:
  name: keda-operator-token-creator
  namespace: <namespace_name>


rules:
- apiGroups:
  - ""
  resources:
  - serviceaccounts/token
  verbs:
  - create
  resourceNames:
  - thanos


---
apiVersion: rbac.authorization.k8s.io/v1
kind: RoleBinding
metadata:
  name: keda-operator-token-creator-binding
  namespace: <namespace_name>


roleRef:
  apiGroup: rbac.authorization.k8s.io
  kind: Role
  name: keda-operator-token-creator
subjects:
- kind: ServiceAccount
  name: keda-operator
  namespace: openshift-keda

Copy to Clipboard

Toggle word wrap

1: Specifies the namespace of the service account.
2: Specifies the name of the service account.
3: Specifies the namespace of the service account.

If you are using a secret, the Secret object must exist.

Procedure

Create the TriggerAuthentication or ClusterTriggerAuthentication object.
1. Create a YAML file that defines the object:
  Example trigger authentication with a bound service account token
  kind: TriggerAuthentication apiVersion: keda.sh/v1alpha1 metadata: name: prom-triggerauthentication namespace: my-namespace
  1
  spec: boundServiceAccountToken:
  2
  - parameter: token serviceAccountName: thanos
  3
  
  Copy to Clipboard Toggle word wrap
  1
  Specifies the namespace of the object you want to scale.
  2
  Specifies that this trigger authentication uses a bound service account token for authorization when connecting to the metrics endpoint.
  3
  Specifies the name of the service account to use.
2. Create the TriggerAuthentication object:
  $ oc create -f <filename>.yaml
  Copy to Clipboard Toggle word wrap

Create or edit a ScaledObject YAML file that uses the trigger authentication:

Create a YAML file that defines the object by running the following command:

Example scaled object with a trigger authentication

apiVersion: keda.sh/v1alpha1
kind: ScaledObject
metadata:
  name: scaledobject
  namespace: my-namespace
spec:
  scaleTargetRef:
    name: example-deployment
  maxReplicaCount: 100
  minReplicaCount: 0
  pollingInterval: 30
  triggers:
  - type: prometheus
    metadata:
      serverAddress: https://thanos-querier.openshift-monitoring.svc.cluster.local:9092
      namespace: kedatest # replace <NAMESPACE>
      metricName: http_requests_total
      threshold: '5'
      query: sum(rate(http_requests_total{job="test-app"}[1m]))
      authModes: "basic"
    authenticationRef:
      name: prom-triggerauthentication 
      kind: TriggerAuthentication

apiVersion: keda.sh/v1alpha1
kind: ScaledObject
metadata:
  name: scaledobject
  namespace: my-namespace
spec:
  scaleTargetRef:
    name: example-deployment
  maxReplicaCount: 100
  minReplicaCount: 0
  pollingInterval: 30
  triggers:
  - type: prometheus
    metadata:
      serverAddress: https://thanos-querier.openshift-monitoring.svc.cluster.local:9092
      namespace: kedatest # replace <NAMESPACE>
      metricName: http_requests_total
      threshold: '5'
      query: sum(rate(http_requests_total{job="test-app"}[1m]))
      authModes: "basic"
    authenticationRef:
      name: prom-triggerauthentication


      kind: TriggerAuthentication

Copy to Clipboard

Toggle word wrap

1: Specify the name of your trigger authentication object.
2: Specify TriggerAuthentication. TriggerAuthentication is the default.

Example scaled object with a cluster trigger authentication

apiVersion: keda.sh/v1alpha1
kind: ScaledObject
metadata:
  name: scaledobject
  namespace: my-namespace
spec:
  scaleTargetRef:
    name: example-deployment
  maxReplicaCount: 100
  minReplicaCount: 0
  pollingInterval: 30
  triggers:
  - type: prometheus
    metadata:
      serverAddress: https://thanos-querier.openshift-monitoring.svc.cluster.local:9092
      namespace: kedatest # replace <NAMESPACE>
      metricName: http_requests_total
      threshold: '5'
      query: sum(rate(http_requests_total{job="test-app"}[1m]))
      authModes: "basic"
    authenticationRef:
      name: prom-cluster-triggerauthentication 
      kind: ClusterTriggerAuthentication

apiVersion: keda.sh/v1alpha1
kind: ScaledObject
metadata:
  name: scaledobject
  namespace: my-namespace
spec:
  scaleTargetRef:
    name: example-deployment
  maxReplicaCount: 100
  minReplicaCount: 0
  pollingInterval: 30
  triggers:
  - type: prometheus
    metadata:
      serverAddress: https://thanos-querier.openshift-monitoring.svc.cluster.local:9092
      namespace: kedatest # replace <NAMESPACE>
      metricName: http_requests_total
      threshold: '5'
      query: sum(rate(http_requests_total{job="test-app"}[1m]))
      authModes: "basic"
    authenticationRef:
      name: prom-cluster-triggerauthentication


      kind: ClusterTriggerAuthentication

Copy to Clipboard

Toggle word wrap

1: Specify the name of your trigger authentication object.
2: Specify ClusterTriggerAuthentication.

Create the scaled object by running the following command:
```
oc apply -f <filename>
```
```
$ oc apply -f <filename>
```
Copy to Clipboard Toggle word wrap

3.6. Understanding how to add custom metrics autoscalers
Copy link

To add a custom metrics autoscaler, create a ScaledObject custom resource for a deployment, stateful set, or custom resource. Create a ScaledJob custom resource for a job.

You can create only one scaled object for each workload that you want to scale. Also, you cannot use a scaled object and the horizontal pod autoscaler (HPA) on the same workload.

3.6.1. Adding a custom metrics autoscaler to a workload
Copy link

You can create a custom metrics autoscaler for a workload that is created by a Deployment, StatefulSet, or custom resource object.

Prerequisites

The Custom Metrics Autoscaler Operator must be installed.

If you use a custom metrics autoscaler for scaling based on CPU or memory:

Your cluster administrator must have properly configured cluster metrics. You can use the oc describe PodMetrics <pod-name> command to determine if metrics are configured. If metrics are configured, the output appears similar to the following, with CPU and Memory displayed under Usage.

oc describe PodMetrics openshift-kube-scheduler-ip-10-0-135-131.ec2.internal

$ oc describe PodMetrics openshift-kube-scheduler-ip-10-0-135-131.ec2.internal

Copy to Clipboard

Toggle word wrap

Example output

Name:         openshift-kube-scheduler-ip-10-0-135-131.ec2.internal
Namespace:    openshift-kube-scheduler
Labels:       <none>
Annotations:  <none>
API Version:  metrics.k8s.io/v1beta1
Containers:
  Name:  wait-for-host-port
  Usage:
    Memory:  0
  Name:      scheduler
  Usage:
    Cpu:     8m
    Memory:  45440Ki
Kind:        PodMetrics
Metadata:
  Creation Timestamp:  2019-05-23T18:47:56Z
  Self Link:           /apis/metrics.k8s.io/v1beta1/namespaces/openshift-kube-scheduler/pods/openshift-kube-scheduler-ip-10-0-135-131.ec2.internal
Timestamp:             2019-05-23T18:47:56Z
Window:                1m0s
Events:                <none>

Name:         openshift-kube-scheduler-ip-10-0-135-131.ec2.internal
Namespace:    openshift-kube-scheduler
Labels:       <none>
Annotations:  <none>
API Version:  metrics.k8s.io/v1beta1
Containers:
  Name:  wait-for-host-port
  Usage:
    Memory:  0
  Name:      scheduler
  Usage:
    Cpu:     8m
    Memory:  45440Ki
Kind:        PodMetrics
Metadata:
  Creation Timestamp:  2019-05-23T18:47:56Z
  Self Link:           /apis/metrics.k8s.io/v1beta1/namespaces/openshift-kube-scheduler/pods/openshift-kube-scheduler-ip-10-0-135-131.ec2.internal
Timestamp:             2019-05-23T18:47:56Z
Window:                1m0s
Events:                <none>

Copy to Clipboard

Toggle word wrap

The pods associated with the object you want to scale must include specified memory and CPU limits. For example:

Example pod spec

apiVersion: v1
kind: Pod
# ...
spec:
  containers:
  - name: app
    image: images.my-company.example/app:v4
    resources:
      limits:
        memory: "128Mi"
        cpu: "500m"
# ...

apiVersion: v1
kind: Pod
# ...
spec:
  containers:
  - name: app
    image: images.my-company.example/app:v4
    resources:
      limits:
        memory: "128Mi"
        cpu: "500m"
# ...

Copy to Clipboard

Toggle word wrap

Procedure

Create a YAML file similar to the following. Only the name <2>, object name <4>, and object kind <5> are required:
Example scaled object
```
apiVersion: keda.sh/v1alpha1
kind: ScaledObject
metadata:
  annotations:
    autoscaling.keda.sh/paused-replicas: "0" 
  name: scaledobject 
  namespace: my-namespace
spec:
  scaleTargetRef:
    apiVersion: apps/v1 
    name: example-deployment 
    kind: Deployment 
    envSourceContainerName: .spec.template.spec.containers[0] 
  cooldownPeriod:  200 
  maxReplicaCount: 100 
  minReplicaCount: 0 
  metricsServer: 
    auditConfig:
      logFormat: "json"
      logOutputVolumeClaim: "persistentVolumeClaimName"
      policy:
        rules:
        - level: Metadata
        omitStages: "RequestReceived"
        omitManagedFields: false
      lifetime:
        maxAge: "2"
        maxBackup: "1"
        maxSize: "50"
  fallback: 
    failureThreshold: 3
    replicas: 6
    behavior: static 
  pollingInterval: 30 
  advanced:
    restoreToOriginalReplicaCount: false 
    horizontalPodAutoscalerConfig:
      name: keda-hpa-scale-down 
      behavior: 
        scaleDown:
          stabilizationWindowSeconds: 300
          policies:
          - type: Percent
            value: 100
            periodSeconds: 15
  triggers:
  - type: prometheus 
    metadata:
      serverAddress: https://thanos-querier.openshift-monitoring.svc.cluster.local:9092
      namespace: kedatest
      metricName: http_requests_total
      threshold: '5'
      query: sum(rate(http_requests_total{job="test-app"}[1m]))
      authModes: basic
    authenticationRef: 
      name: prom-triggerauthentication
      kind: TriggerAuthentication
```
```
apiVersion: keda.sh/v1alpha1
kind: ScaledObject
metadata:
  annotations:
    autoscaling.keda.sh/paused-replicas: "0" 
```
1
```
  name: scaledobject 
```
2
```
  namespace: my-namespace
spec:
  scaleTargetRef:
    apiVersion: apps/v1 
```
3
```
    name: example-deployment 
```
4
```
    kind: Deployment 
```
5
```
    envSourceContainerName: .spec.template.spec.containers[0] 
```
6
```
  cooldownPeriod:  200 
```
7
```
  maxReplicaCount: 100 
```
8
```
  minReplicaCount: 0 
```
9
```
  metricsServer: 
```
10
```
    auditConfig:
      logFormat: "json"
      logOutputVolumeClaim: "persistentVolumeClaimName"
      policy:
        rules:
        - level: Metadata
        omitStages: "RequestReceived"
        omitManagedFields: false
      lifetime:
        maxAge: "2"
        maxBackup: "1"
        maxSize: "50"
  fallback: 
```
11
```
    failureThreshold: 3
    replicas: 6
    behavior: static 
```
12
```
  pollingInterval: 30 
```
13
```
  advanced:
    restoreToOriginalReplicaCount: false 
```
14
```
    horizontalPodAutoscalerConfig:
      name: keda-hpa-scale-down 
```
15
```
      behavior: 
```
16
```
        scaleDown:
          stabilizationWindowSeconds: 300
          policies:
          - type: Percent
            value: 100
            periodSeconds: 15
  triggers:
  - type: prometheus 
```
17
```
    metadata:
      serverAddress: https://thanos-querier.openshift-monitoring.svc.cluster.local:9092
      namespace: kedatest
      metricName: http_requests_total
      threshold: '5'
      query: sum(rate(http_requests_total{job="test-app"}[1m]))
      authModes: basic
    authenticationRef: 
```
18
```
      name: prom-triggerauthentication
      kind: TriggerAuthentication
```
Copy to Clipboard Toggle word wrap
1
Optional: Specifies that the Custom Metrics Autoscaler Operator is to scale the replicas to the specified value and stop autoscaling, as described in the "Pausing the custom metrics autoscaler for a workload" section.
2
Specifies a name for this custom metrics autoscaler.
3
Optional: Specifies the API version of the target resource. The default is apps/v1.
4
Specifies the name of the object that you want to scale.
5
Specifies the kind as Deployment, StatefulSet or CustomResource.
6
Optional: Specifies the name of the container in the target resource, from which the custom metrics autoscaler gets environment variables holding secrets and so forth. The default is .spec.template.spec.containers[0].
7
Optional. Specifies the period in seconds to wait after the last trigger is reported before scaling the deployment back to 0 if the minReplicaCount is set to 0. The default is 300.
8
Optional: Specifies the maximum number of replicas when scaling up. The default is 100.
9
Optional: Specifies the minimum number of replicas when scaling down.
10
Optional: Specifies the parameters for audit logs. as described in the "Configuring audit logging" section.
11
Optional: Specifies the number of replicas to fall back to if a scaler fails to get metrics from the source for the number of times defined by the failureThreshold parameter. For more information on fallback behavior, see the KEDA documentation.
12
Optional: Specifies the replica count to be used if a fallback occurs. Enter one of the following options or omit the parameter:
Enter static to use the number of replicas specified by the fallback.replicas parameter. This is the default.
Enter currentReplicas to maintain the current number of replicas.
Enter currentReplicasIfHigher to maintain the current number of replicas, if that number is higher than the fallback.replicas parameter. If the current number of replicas is lower than the fallback.replicas parameter, use the fallback.replicas value.
Enter currentReplicasIfLower to maintain the current number of replicas, if that number is lower than the fallback.replicas parameter. If the current number of replicas is higher than the fallback.replicas parameter, use the fallback.replicas value.
13
Optional: Specifies the interval in seconds to check each trigger on. The default is 30.
14
Optional: Specifies whether to scale back the target resource to the original replica count after the scaled object is deleted. The default is false, which keeps the replica count as it is when the scaled object is deleted.
15
Optional: Specifies a name for the horizontal pod autoscaler. The default is keda-hpa-{scaled-object-name}.
16
Optional: Specifies a scaling policy to use to control the rate to scale pods up or down, as described in the "Scaling policies" section.
17
Specifies the trigger to use as the basis for scaling, as described in the "Understanding the custom metrics autoscaler triggers" section. This example uses OpenShift Container Platform monitoring.
18
Optional: Specifies a trigger authentication or a cluster trigger authentication. For more information, see Understanding the custom metrics autoscaler trigger authentication in the Additional resources section.
Enter TriggerAuthentication to use a trigger authentication. This is the default.
Enter ClusterTriggerAuthentication to use a cluster trigger authentication.
Create the custom metrics autoscaler by running the following command:
```
oc create -f <filename>.yaml
```
```
$ oc create -f <filename>.yaml
```
Copy to Clipboard Toggle word wrap

Verification

View the command output to verify that the custom metrics autoscaler was created:
```
oc get scaledobject <scaled_object_name>
```
```
$ oc get scaledobject <scaled_object_name>
```
Copy to Clipboard Toggle word wrap
Example output
```
NAME            SCALETARGETKIND      SCALETARGETNAME        MIN   MAX   TRIGGERS     AUTHENTICATION               READY   ACTIVE   FALLBACK   AGE
scaledobject    apps/v1.Deployment   example-deployment     0     50    prometheus   prom-triggerauthentication   True    True     True       17s
```
```
NAME            SCALETARGETKIND      SCALETARGETNAME        MIN   MAX   TRIGGERS     AUTHENTICATION               READY   ACTIVE   FALLBACK   AGE
scaledobject    apps/v1.Deployment   example-deployment     0     50    prometheus   prom-triggerauthentication   True    True     True       17s
```
Copy to Clipboard Toggle word wrap
Note the following fields in the output:
- TRIGGERS: Indicates the trigger, or scaler, that is being used.
- AUTHENTICATION: Indicates the name of any trigger authentication being used.
- READY: Indicates whether the scaled object is ready to start scaling:
  - If True, the scaled object is ready.
  - If False, the scaled object is not ready because of a problem in one or more of the objects you created.
- ACTIVE: Indicates whether scaling is taking place:
  - If True, scaling is taking place.
  - If False, scaling is not taking place because there are no metrics or there is a problem in one or more of the objects you created.
- FALLBACK: Indicates whether the custom metrics autoscaler is able to get metrics from the source
  - If False, the custom metrics autoscaler is getting metrics.
  - If True, the custom metrics autoscaler is getting metrics because there are no metrics or there is a problem in one or more of the objects you created.

3.6.2. Adding a custom metrics autoscaler to a job
Copy link

You can create a custom metrics autoscaler for any Job object.

Important

For more information about the support scope of Red Hat Technology Preview features, see Technology Preview Features Support Scope.

Prerequisites

The Custom Metrics Autoscaler Operator must be installed.

Procedure

Create a YAML file similar to the following:

kind: ScaledJob
apiVersion: keda.sh/v1alpha1
metadata:
  name: scaledjob
  namespace: my-namespace
spec:
  failedJobsHistoryLimit: 5
  jobTargetRef:
    activeDeadlineSeconds: 600 
    backoffLimit: 6 
    parallelism: 1 
    completions: 1 
    template:  
      metadata:
        name: pi
      spec:
        containers:
        - name: pi
          image: perl
          command: ["perl",  "-Mbignum=bpi", "-wle", "print bpi(2000)"]
  maxReplicaCount: 100 
  pollingInterval: 30 
  successfulJobsHistoryLimit: 5 
  failedJobsHistoryLimit: 5 
  envSourceContainerName: 
  rolloutStrategy: gradual 
  scalingStrategy: 
    strategy: "custom"
    customScalingQueueLengthDeduction: 1
    customScalingRunningJobPercentage: "0.5"
    pendingPodConditions:
      - "Ready"
      - "PodScheduled"
      - "AnyOtherCustomPodCondition"
    multipleScalersCalculation : "max"
  triggers:
  - type: prometheus 
    metadata:
      serverAddress: https://thanos-querier.openshift-monitoring.svc.cluster.local:9092
      namespace: kedatest
      metricName: http_requests_total
      threshold: '5'
      query: sum(rate(http_requests_total{job="test-app"}[1m]))
      authModes: "bearer"
    authenticationRef: 
      name: prom-cluster-triggerauthentication

kind: ScaledJob
apiVersion: keda.sh/v1alpha1
metadata:
  name: scaledjob
  namespace: my-namespace
spec:
  failedJobsHistoryLimit: 5
  jobTargetRef:
    activeDeadlineSeconds: 600


    backoffLimit: 6


    parallelism: 1


    completions: 1


    template:


      metadata:
        name: pi
      spec:
        containers:
        - name: pi
          image: perl
          command: ["perl",  "-Mbignum=bpi", "-wle", "print bpi(2000)"]
  maxReplicaCount: 100


  pollingInterval: 30


  successfulJobsHistoryLimit: 5


  failedJobsHistoryLimit: 5


  envSourceContainerName:


  rolloutStrategy: gradual


  scalingStrategy:


    strategy: "custom"
    customScalingQueueLengthDeduction: 1
    customScalingRunningJobPercentage: "0.5"
    pendingPodConditions:
      - "Ready"
      - "PodScheduled"
      - "AnyOtherCustomPodCondition"
    multipleScalersCalculation : "max"
  triggers:
  - type: prometheus


    metadata:
      serverAddress: https://thanos-querier.openshift-monitoring.svc.cluster.local:9092
      namespace: kedatest
      metricName: http_requests_total
      threshold: '5'
      query: sum(rate(http_requests_total{job="test-app"}[1m]))
      authModes: "bearer"
    authenticationRef:


      name: prom-cluster-triggerauthentication

Copy to Clipboard

Toggle word wrap

Specifies the maximum duration the job can run.

Specifies the number of retries for a job. The default is 6.

Optional: Specifies how many pod replicas a job should run in parallel; defaults to 1.

For non-parallel jobs, leave unset. When unset, the default is 1.