ホーム
製品
OpenShift Container Platform
4.2
Serverless applications
第8章 Configuring Knative Serving autoscaling

第8章 Configuring Knative Serving autoscaling

重要

You are viewing documentation for a release of Red Hat OpenShift Serverless that is no longer supported. Red Hat OpenShift Serverless is currently supported on OpenShift Container Platform 4.3 and newer.

OpenShift Serverless provides capabilities for automatic Pod scaling, including scaling inactive Pods to zero, by enabling the Knative Serving autoscaling system in an OpenShift Container Platform cluster.

To enable autoscaling for Knative Serving, you must configure concurrency and scale bounds in the revision template.

注記

Any limits or targets set in the revision template are measured against a single instance of your application. For example, setting the target annotation to 50 will configure the autoscaler to scale the application so that each instance of it will handle 50 requests at a time.

8.1. Configuring concurrent requests for Knative Serving autoscaling
リンクのコピー

You can specify the number of concurrent requests that should be handled by each instance of an application (revision container) by adding the target annotation or the containerConcurrency field in the revision template.

Here is an example of target being used in a revision template:

apiVersion: serving.knative.dev/v1alpha1
kind: Service
metadata:
  name: myapp
spec:
  template:
    metadata:
      annotations:
        autoscaling.knative.dev/target: 50
    spec:
      containers:
      - image: myimage

apiVersion: serving.knative.dev/v1alpha1
kind: Service
metadata:
  name: myapp
spec:
  template:
    metadata:
      annotations:
        autoscaling.knative.dev/target: 50
    spec:
      containers:
      - image: myimage

Copy to Clipboard

Toggle word wrap

Here is an example of containerConcurrency being used in a revision template:

apiVersion: serving.knative.dev/v1alpha1
kind: Service
metadata:
  name: myapp
spec:
  template:
    metadata:
      annotations:
    spec:
      containerConcurrency: 100
      containers:
      - image: myimage

apiVersion: serving.knative.dev/v1alpha1
kind: Service
metadata:
  name: myapp
spec:
  template:
    metadata:
      annotations:
    spec:
      containerConcurrency: 100
      containers:
      - image: myimage

Copy to Clipboard

Toggle word wrap

Adding a value for both target and containerConcurrency will target the target number of concurrent requests, but impose a hard limit of the containerConcurrency number of requests.

For example, if the target value is 50 and the containerConcurrency value is 100, the targeted number of requests will be 50, but the hard limit will be 100.

If the containerConcurrency value is less than the target value, the target value will be tuned down, since there is no need to target more requests than the number that can actually be handled.

注記

containerConcurrency should only be used if there is a clear need to limit how many requests reach the application at a given time. Using containerConcurrency is only advised if the application needs to have an enforced constraint of concurrency.

8.1.1. Configuring concurrent requests using the target annotation
リンクのコピー

The default target for the number of concurrent requests is 100, but you can override this value by adding or modifying the autoscaling.knative.dev/target annotation value in the revision template.

Here is an example of how this annotation is used in the revision template to set the target to 50.

autoscaling.knative.dev/target: 50

autoscaling.knative.dev/target: 50

Copy to Clipboard

Toggle word wrap

8.1.2. Configuring concurrent requests using the containerConcurrency field
リンクのコピー

containerConcurrency sets a hard limit on the number of concurrent requests handled.

containerConcurrency: 0 | 1 | 2-N

containerConcurrency: 0 | 1 | 2-N

Copy to Clipboard

Toggle word wrap

0: allows unlimited concurrent requests.
1: guarantees that only one request is handled at a time by a given instance of the revision container.
2 or more: will limit request concurrency to that value.

注記

If there is no target annotation, autoscaling is configured as if target is equal to the value of containerConcurrency.

第8章 Configuring Knative Serving autoscaling

8.1. Configuring concurrent requests for Knative Serving autoscaling
リンクのコピー

8.1.1. Configuring concurrent requests using the target annotation
リンクのコピー

8.1.2. Configuring concurrent requests using the containerConcurrency field
リンクのコピー

詳細情報

試用、購入および販売

コミュニティー

Red Hat ドキュメントについて

多様性を受け入れるオープンソースの強化

会社概要

Theme

Red Hat legal and privacy links

Red Hat legal and privacy links

第8章 Configuring Knative Serving autoscaling

8.1. Configuring concurrent requests for Knative Serving autoscalingリンクのコピーリンクがクリップボードにコピーされました!

8.1.1. Configuring concurrent requests using the target annotationリンクのコピーリンクがクリップボードにコピーされました!

8.1.2. Configuring concurrent requests using the containerConcurrency fieldリンクのコピーリンクがクリップボードにコピーされました!

詳細情報

試用、購入および販売

コミュニティー

Red Hat ドキュメントについて

多様性を受け入れるオープンソースの強化

会社概要

Theme

Red Hat legal and privacy links

Red Hat legal and privacy links

8.1. Configuring concurrent requests for Knative Serving autoscaling
リンクのコピー

8.1.1. Configuring concurrent requests using the target annotation
リンクのコピー

8.1.2. Configuring concurrent requests using the containerConcurrency field
リンクのコピー