Chapter 4. Configuring OpenShift Serverless applications
4.1. Multi-container support for Serving Copy linkLink copied to clipboard!
You can deploy a multi-container pod by using a single Knative service. This method is useful for separating application responsibilities into smaller, specialized parts.
4.1.1. Configuring a multi-container service Copy linkLink copied to clipboard!
Multi-container support is enabled by default. You can create a multi-container pod by specifiying multiple containers in the service.
Procedure
Modify your service to include additional containers. Only one container can handle requests, so specify
portsfor exactly one container. Here is an example configuration with two containers:Multiple containers configuration
Copy to Clipboard Copied! Toggle word wrap Toggle overflow
4.1.2. Probing a multi-container service Copy linkLink copied to clipboard!
You can specify readiness and liveness probes for multiple containers. This feature is not enabled by default and you must configure it using the KnativeServing custom resource (CR).
Procedure
Configure multi-container probing for your service by enabling the
multi-container-probingfeature in theKnativeServingCR.Multi-container probing configuration
Copy to Clipboard Copied! Toggle word wrap Toggle overflow - 1
- Enabled multi-container-probing feature
Apply the updated
KnativeServingCR.oc apply -f <filename>
$ oc apply -f <filename>Copy to Clipboard Copied! Toggle word wrap Toggle overflow Modify your multi-container service to include the specified probes.
Multi-container probing
Copy to Clipboard Copied! Toggle word wrap Toggle overflow
4.1.2.1. Additional resources Copy linkLink copied to clipboard!
4.2. EmptyDir volumes Copy linkLink copied to clipboard!
emptyDir volumes are empty volumes that are created when a pod is created, and are used to provide temporary working disk space. emptyDir volumes are deleted when the pod they were created for is deleted.
4.2.1. Configuring the EmptyDir extension Copy linkLink copied to clipboard!
The kubernetes.podspec-volumes-emptydir extension controls whether emptyDir volumes can be used with Knative Serving. To enable using emptyDir volumes, you must modify the KnativeServing custom resource (CR) to include the following YAML:
Example KnativeServing CR
4.3. Persistent Volume Claims for Serving Copy linkLink copied to clipboard!
Some serverless applications require permanent data storage. By configuring different volume types, you can provide data storage for Knative services. Serving supports mounting of the volume types such as secret, configMap, projected, and emptyDir.
You can configure persistent volume claims (PVCs) for your Knative services. The Persistent volume types are implemented as plugins. To determine if there are any persistent volume types available, you can check the available or installed storage classes in your cluster. Persistent volumes are supported, but require a feature flag to be enabled.
The mounting of large volumes can lead to a considerable delay in the start time of the application.
4.3.1. Enabling PVC support Copy linkLink copied to clipboard!
Procedure
To enable Knative Serving to use PVCs and write to them, modify the
KnativeServingcustom resource (CR) to include the following YAML:Enabling PVCs with write access
Copy to Clipboard Copied! Toggle word wrap Toggle overflow -
The
kubernetes.podspec-persistent-volume-claimextension controls whether persistent volumes (PVs) can be used with Knative Serving. -
The
kubernetes.podspec-persistent-volume-writeextension controls whether PVs are available to Knative Serving with the write access.
-
The
To claim a PV, modify your service to include the PV configuration. For example, you might have a persistent volume claim with the following configuration:
NoteUse the storage class that supports the access mode you are requesting. For example, you can use the
ocs-storagecluster-cephfsstorage class for theReadWriteManyaccess mode.The
ocs-storagecluster-cephfsstorage class is supported and comes from Red Hat OpenShift Data Foundation.PersistentVolumeClaim configuration
Copy to Clipboard Copied! Toggle word wrap Toggle overflow In this case, to claim a PV with write access, modify your service as follows:
Knative service PVC configuration
Copy to Clipboard Copied! Toggle word wrap Toggle overflow NoteTo successfully use persistent storage in Knative services, you need additional configuration, such as the user permissions for the Knative container user.
4.4. Init containers Copy linkLink copied to clipboard!
Init containers are specialized containers that are run before application containers in a pod. They are generally used to implement initialization logic for an application, which may include running setup scripts or downloading required configurations. You can enable the use of init containers for Knative services by modifying the KnativeServing custom resource (CR).
Init containers may cause longer application start-up times and should be used with caution for serverless applications, which are expected to scale up and down frequently.
4.4.1. Enabling init containers Copy linkLink copied to clipboard!
Prerequisites
- You have installed OpenShift Serverless Operator and Knative Serving on your cluster.
- You have cluster administrator permissions on OpenShift Container Platform, or you have cluster or dedicated administrator permissions on Red Hat OpenShift Service on AWS or OpenShift Dedicated.
Procedure
Enable the use of init containers by adding the
kubernetes.podspec-init-containersflag to theKnativeServingCR:Example KnativeServing CR
Copy to Clipboard Copied! Toggle word wrap Toggle overflow
4.5. Startup probes Copy linkLink copied to clipboard!
Startup probes verify whether a service has started successfully, helping to reduce cold start times for containers with slow startup processes. Startup probes run only during the container’s initialization phase and do not execute periodically. If a startup probe fails, the container adheres to the defined restartPolicy.
4.5.1. Progress deadline Copy linkLink copied to clipboard!
By default, services have a progress deadline that defines the time limit for a service to complete its initial startup. When using startup probes, ensure that the progress deadline is set to exceed the maximum time required by the startup probes. If the progress deadline is set too low, the startup probes might not finish before the deadline is reached, which can prevent the service from starting.
Consider increasing the progress deadline if you encounter any of these conditions in your deployment:
- The service image takes a long time to pull due to its size.
-
The service takes a long time to become
READYbecause of initial cache priming. - The cluster relies on autoscaling to allocate resources for new pods.
4.5.2. Configuring startup probing Copy linkLink copied to clipboard!
For OpenShift Serverless Serving, startup probes are not defined by default. You can define startup probes for your containers in your deployment configuration.
Procedure
Define startup probes for your service by modifying your deployment configuration. The following example shows a configuration with two containers:
Example of defined starup probes
Copy to Clipboard Copied! Toggle word wrap Toggle overflow
4.5.3. Configuring the progress deadline Copy linkLink copied to clipboard!
You can configure progress deadline settings to specify the maximum time allowed for your deployment to progress before the system reports a failure for the Knative Revision. This time limit can be specified in seconds or minutes.
To configure the progress deadline effectively, consider the following parameters:
-
initialDelaySeconds -
failureThreshold -
periodSeconds -
timeoutSeconds
If the initial scale is not achieved within the specified time limit, the Knative Autoscaler component scales the revision to 0, and the Knative service enters a terminal Failed state.
By default, the progress deadline is set to 600 seconds. This value is specified as a Golang time.Duration string and must be rounded to the nearest second.
Procedure
To configure the progress deadline setting, use an annotation in your deployment configuration.
Example of progress deadline set to 60 seconds
Copy to Clipboard Copied! Toggle word wrap Toggle overflow
4.6. Resolving image tags to digests Copy linkLink copied to clipboard!
If the Knative Serving controller has access to the container registry, Knative Serving resolves image tags to a digest when you create a revision of a service. This is known as tag-to-digest resolution, and helps to provide consistency for deployments.
4.6.1. Tag-to-digest resolution Copy linkLink copied to clipboard!
To give the controller access to the container registry on OpenShift Container Platform, you must create a secret and then configure controller custom certificates. You can configure controller custom certificates by modifying the controller-custom-certs spec in the KnativeServing custom resource (CR). The secret must reside in the same namespace as the KnativeServing CR.
If a secret is not included in the KnativeServing CR, this setting defaults to using public key infrastructure (PKI). When using PKI, the cluster-wide certificates are automatically injected into the Knative Serving controller by using the config-service-sa config map. The OpenShift Serverless Operator populates the config-service-sa config map with cluster-wide certificates and mounts the config map as a volume to the controller.
4.6.1.1. Configuring tag-to-digest resolution by using a secret Copy linkLink copied to clipboard!
If the controller-custom-certs spec uses the Secret type, the secret is mounted as a secret volume. Knative components consume the secret directly, assuming that the secret has the required certificates.
Prerequisites
- You have cluster administrator permissions on OpenShift Container Platform, or you have cluster or dedicated administrator permissions on Red Hat OpenShift Service on AWS or OpenShift Dedicated.
- You have installed the OpenShift Serverless Operator and Knative Serving on your cluster.
Procedure
Create a secret:
Example command
$ oc -n knative-serving create secret generic custom-secret --from-file=<secret_name>.crt=<path_to_certificate>
$ oc -n knative-serving create secret generic custom-secret --from-file=<secret_name>.crt=<path_to_certificate>Copy to Clipboard Copied! Toggle word wrap Toggle overflow Configure the
controller-custom-certsspec in theKnativeServingcustom resource (CR) to use theSecrettype:Example KnativeServing CR
Copy to Clipboard Copied! Toggle word wrap Toggle overflow
4.7. Configuring deployment resources Copy linkLink copied to clipboard!
In Knative Serving, the config-deployment config map contains settings that determine how Kubernetes Deployment resources are configured for Knative services. In OpenShift Serverless Serving, you can configure these settings in the deployment section of your KnativeServing custom resource (CR).
You can use the deployment section to configure the following:
- Tag resolution
- Runtime environments
- Progress deadlines
4.7.1. Skipping tag resolution Copy linkLink copied to clipboard!
Skipping tag resolution in OpenShift Serverless Serving can speed up deployments by avoiding unnecessary queries to the container registry, reducing latency and dependency on registry availability.
You can configure Serving to skip tag resolution by modifying the registriesSkippingTagResolving setting in your KnativeServing custom resource (CR).
Procedure
In your
KnativeServingCR, modify theregistriesSkippingTagResolvingsetting with the list of registries for which tag resoution will be skipped:Example of configured tag resolution skipping
Copy to Clipboard Copied! Toggle word wrap Toggle overflow
4.7.2. Configuring selectable RuntimeClassName Copy linkLink copied to clipboard!
You can configure OpenShift Serverless Serving to set a specific RuntimeClassName resource for Deployments by updating the runtime-class-name setting in your KnativeServing custom resource (CR).
This setting interacts with service labels, applying either the default RuntimeClassName or the one that matches the most labels associated with the service.
Procedure
In your
KnativeServingCR, configure theruntime-class-namesetting:Example of configured
runtime-class-namesettingCopy to Clipboard Copied! Toggle word wrap Toggle overflow
4.7.3. Progress deadline Copy linkLink copied to clipboard!
By default, services have a progress deadline that defines the time limit for a service to complete its initial startup.
Consider increasing the progress deadline if you encounter any of these conditions in your deployment:
- The service image takes a long time to pull due to its size.
-
The service takes a long time to become
READYbecause of initial cache priming. - The cluster relies on autoscaling to allocate resources for new pods.
If the initial scale is not achieved within the specified time limit, the Knative Autoscaler component scales the revision to 0, and the service enters a terminal Failed state.
4.7.3.1. Configuring the progress deadline Copy linkLink copied to clipboard!
Configure progress deadline settings to set the maximum time allowed in seconds or minutes for deployment progress before the system reports a Knative Revision failure.
By default, the progress deadline is set to 600 seconds. This value is specified as a Go time.Duration string and must be rounded to the nearest second.
Procedure
Configure progress deadline by modifying your KnativeServing custom resource (CR).
In your
KnativeServingCR, set the value ofprogressDeadline:Example of progress deadline set to 60 seconds
Copy to Clipboard Copied! Toggle word wrap Toggle overflow
4.8. Configuring Kourier Copy linkLink copied to clipboard!
Kourier is a lightweight Kubernetes-native Ingress for Knative Serving. Kourier acts as a gateway for Knative, routing HTTP traffic to Knative services.
4.8.1. Accessing the current Envoy bootstrap configuration Copy linkLink copied to clipboard!
The Envoy proxy component in Kourier handles inbound and outbound HTTP traffic for the Knative services. By default, Kourier contains an Envoy bootstrap configuration in the kourier-bootstrap configuration map in the knative-serving-ingress namespace.
Procedure
To get the current Envoy bootstrap configuration, run the following command:
Example command
oc get cm kourier-bootstrap -n knative-serving-ingress -o yaml
$ oc get cm kourier-bootstrap -n knative-serving-ingress -o yamlCopy to Clipboard Copied! Toggle word wrap Toggle overflow For example, with the default configuration, the example command produces the output that contains the following excerpts:
Example output
Copy to Clipboard Copied! Toggle word wrap Toggle overflow Example
DataoutputCopy to Clipboard Copied! Toggle word wrap Toggle overflow Example
BinaryDataoutputEvents: <none>
Events: <none>Copy to Clipboard Copied! Toggle word wrap Toggle overflow
4.8.2. Customizing kourier-bootstrap for Kourier getaways Copy linkLink copied to clipboard!
The Envoy proxy component in Kourier handles inbound and outbound HTTP traffic for the Knative services. By default, Kourier contains an Envoy bootstrap configuration in the kourier-bootstrap configuration map in the knative-serving-ingress namespace. You can change this configuration map to a custom one.
Prerequisites
- You have installed the OpenShift Serverless Operator and Knative Serving.
- You have cluster administrator permissions on OpenShift Container Platform, or you have cluster or dedicated administrator permissions on Red Hat OpenShift Service on AWS or OpenShift Dedicated.
Procedure
Specify a custom bootstrapping configuration map by changing the
spec.ingress.kourier.bootstrap-configmapfield in theKnativeServingcustom resource (CR):Example KnativeServing CR
Copy to Clipboard Copied! Toggle word wrap Toggle overflow
4.8.3. Enabling administrator interface access Copy linkLink copied to clipboard!
You can change the envoy bootstrap configuration to enable access to the administrator interface.
This procedure assumes sufficient knowledge of Knative, as changing envoy bootstrap configuration might result in Knative failure. Red Hat does not support custom configurations that are not tested or shipped with the product.
Prerequisites
- You have installed the OpenShift Serverless Operator and Knative Serving.
- You have cluster administrator permissions on OpenShift Container Platform, or you have cluster or dedicated administrator permissions on Red Hat OpenShift Service on AWS or OpenShift Dedicated.
Procedure
To enable administrator interface access, locate this configuration in your bootstrapping configuration map:
pipe: path: /tmp/envoy.admin
pipe: path: /tmp/envoy.adminCopy to Clipboard Copied! Toggle word wrap Toggle overflow Substitute it with the following configuration:
socket_address: address: 127.0.0.1 port_value: 9901
socket_address:1 address: 127.0.0.1 port_value: 9901Copy to Clipboard Copied! Toggle word wrap Toggle overflow - 1
- This configuration enables access to the Envoy admin interface on the loopback address (127.0.0.1) and port 9901.
Apply the
socket_addressconfiguration in theservice_statscluster configuration and in theadminconfiguration:The first is in the
service_statscluster configuration:Copy to Clipboard Copied! Toggle word wrap Toggle overflow The second is in the
adminconfiguration:Copy to Clipboard Copied! Toggle word wrap Toggle overflow
4.9. Restrictive network policies Copy linkLink copied to clipboard!
4.9.1. Clusters with restrictive network policies Copy linkLink copied to clipboard!
If you are using a cluster that multiple users have access to, your cluster might use network policies to control which pods, services, and namespaces can communicate with each other over the network. If your cluster uses restrictive network policies, it is possible that Knative system pods are not able to access your Knative application. For example, if your namespace has the following network policy, which denies all requests, Knative system pods cannot access your Knative application:
Example NetworkPolicy object that denies all requests to the namespace
4.9.2. Enabling communication with Knative applications on a cluster with restrictive network policies Copy linkLink copied to clipboard!
To allow access to your applications from Knative system pods, you must add a label to each of the Knative system namespaces, and then create a NetworkPolicy object in your application namespace that allows access to the namespace for other namespaces that have this label.
A network policy that denies requests to non-Knative services on your cluster still prevents access to these services. However, by allowing access from Knative system namespaces to your Knative application, you are allowing access to your Knative application from all namespaces in the cluster.
If you do not want to allow access to your Knative application from all namespaces on the cluster, you might want to use JSON Web Token authentication for Knative services instead. JSON Web Token authentication for Knative services requires Service Mesh.
Prerequisites
-
Install the OpenShift CLI (
oc). - OpenShift Serverless Operator and Knative Serving are installed on your cluster.
Procedure
Add the
knative.openshift.io/system-namespace=truelabel to each Knative system namespace that requires access to your application:Label the
knative-servingnamespace:oc label namespace knative-serving knative.openshift.io/system-namespace=true
$ oc label namespace knative-serving knative.openshift.io/system-namespace=trueCopy to Clipboard Copied! Toggle word wrap Toggle overflow Label the
knative-serving-ingressnamespace:oc label namespace knative-serving-ingress knative.openshift.io/system-namespace=true
$ oc label namespace knative-serving-ingress knative.openshift.io/system-namespace=trueCopy to Clipboard Copied! Toggle word wrap Toggle overflow Label the
knative-eventingnamespace:oc label namespace knative-eventing knative.openshift.io/system-namespace=true
$ oc label namespace knative-eventing knative.openshift.io/system-namespace=trueCopy to Clipboard Copied! Toggle word wrap Toggle overflow Label the
knative-kafkanamespace:oc label namespace knative-kafka knative.openshift.io/system-namespace=true
$ oc label namespace knative-kafka knative.openshift.io/system-namespace=trueCopy to Clipboard Copied! Toggle word wrap Toggle overflow
Create a
NetworkPolicyobject in your application namespace to allow access from namespaces with theknative.openshift.io/system-namespacelabel:Example
NetworkPolicyobjectCopy to Clipboard Copied! Toggle word wrap Toggle overflow
4.10. Configuring revision timeouts Copy linkLink copied to clipboard!
You can configure timeout durations for revisions globally or individually to control the time spent on requests.
4.10.1. Configuring revision timeout Copy linkLink copied to clipboard!
You can configure the default number of seconds for the revision timeout based on the request.
Prerequisites
- You have installed the OpenShift Serverless Operator and Knative Serving.
You have the required permissions for your cluster:
- Cluster administrator permissions on OpenShift Container Platform
- Cluster administrator or dedicated administrator permissions on Red Hat OpenShift Service on AWS
- Cluster administrator or dedicated administrator permissions on OpenShift Dedicated
Procedure
Choose the appropriate method to configure the revision timeout:
To configure the revision timeout globally, set the
revision-timeout-secondsfield in theKnativeServingcustom resource (CR):Copy to Clipboard Copied! Toggle word wrap Toggle overflow To configure the timeout per revision by setting the
timeoutSecondsfield in your service definition:Copy to Clipboard Copied! Toggle word wrap Toggle overflow
NoteTo set the revision timeout to a value over 600 seconds (10 minutes), you must increase the default OpenShift Container Platform route timeout and the maximum revision timeout.
For instructions on how to configure timeouts for requests exceeding the default 600 seconds (10 minutes), see "Long-running requests".
4.10.2. Configuring maximum revision timeout Copy linkLink copied to clipboard!
By seting the maximum revision timeout, you can ensure that no revision can exceed a specific limit. The value of your maximum revision timeout must not exceed the terminationGracePeriodSeconds value of the activator to prevent in-flight requests being disrupted.
Prerequisites
- You have installed the OpenShift Serverless Operator and Knative Serving.
You have the required permissions for your cluster:
- Cluster administrator permissions on OpenShift Container Platform
- Cluster administrator or dedicated administrator permissions on Red Hat OpenShift Service on AWS
- Cluster administrator or dedicated administrator permissions on OpenShift Dedicated
Procedure
To configure the maximum revision timeout, set the
max-revision-timeout-secondsfield in theKnativeServingcustom resource (CR):Copy to Clipboard Copied! Toggle word wrap Toggle overflow NoteTo set the maximum revision timeout to a value over 600 seconds (10 minutes), you must increase the default OpenShift Container Platform route timeout.
For instructions on how to configure timeouts for requests exceeding the default 600 seconds (10 minutes), see "Long-running requests".
4.10.3. Configuring revision response start timeout Copy linkLink copied to clipboard!
By setting the revision response start timeout, you can specify the maximum duration in seconds that Serving waits for a revision to start sending network traffic after a request has been routed to it. The revision response start timeout must not exceed the revision timeout. The default duration is 300 seconds (5 minutes).
Prerequisites
- You have installed the OpenShift Serverless Operator and Knative Serving.
You have the required permissions for your cluster:
- Cluster administrator permissions on OpenShift Container Platform
- Cluster administrator or dedicated administrator permissions on Red Hat OpenShift Service on AWS
- Cluster administrator or dedicated administrator permissions on OpenShift Dedicated
Procedure
Choose the appropriate method to configure the revision response start timeout:
To configure the timeout globally, set the
revision-response-start-timeout-secondsfield in yourKnativeServingcustom resource (CR). If your required response start timeout exceeds the revision timeout, also adjust therevision-timeout-secondsfield accordingly:Example of revision response start timeout globally set to 300 seconds (5 minutes)
Copy to Clipboard Copied! Toggle word wrap Toggle overflow To configure the timeout per revision, set the
responseStartTimeoutSecondsfield in your service definition. If your required response start timeout exceeds the revision timeout, also adjust thetimeoutSecondsfield accordingly:Example of a service definition with revision response start timeout set to 300 seconds (5 minutes)
Copy to Clipboard Copied! Toggle word wrap Toggle overflow
NoteTo set the revision response start timeout and the revision timeout to a value over 600 seconds (10 minutes), you must increase the default OpenShift Container Platform route timeout and the maximum revision timeout.
For instructions on how to configure timeouts for requests exceeding the default 600 seconds (10 minutes), see "Long-running requests".
4.10.4. Configuring revision idle timeout Copy linkLink copied to clipboard!
By setting the revision idle timeout, you can specify the maximum duration in seconds a request is allowed to stay open without receiving data from the application. The default duration is 0 (infinite).
Prerequisites
- You have installed the OpenShift Serverless Operator and Knative Serving.
You have the required permissions for your cluster:
- Cluster administrator permissions on OpenShift Container Platform
- Cluster administrator or dedicated administrator permissions on Red Hat OpenShift Service on AWS
- Cluster administrator or dedicated administrator permissions on OpenShift Dedicated
Procedure
Choose the appropriate method to configure the revision idle timeout:
To configure the timeout globally, set the
revision-idle-timeout-secondsfield in yourKnativeServingcustom resource (CR):Example of revision idle timeout globally set to 300 seconds (5 minutes)
Copy to Clipboard Copied! Toggle word wrap Toggle overflow To configure the timeout per revision, set the
idleTimeoutSecondsfield in your service definition:Example of a service definition with revision idle timeout set to 300 seconds (5 minutes)
Copy to Clipboard Copied! Toggle word wrap Toggle overflow
4.10.5. Long-running requests Copy linkLink copied to clipboard!
To ensure that requests exceeding the default 600 second timeout set by Knative are not prematurely terminated, you need to adjust the timeouts in the following components:
- OpenShift Container Platform route
- OpenShift Serverless Serving
- Load balancer, depending on the cloud provider
You can configure the timeouts globally or per revision. You can configure the timeouts globally if you have requests across all Knative services that need extended durations, or per revision for specific workloads that require different timeout values, such as AI deployments.
4.10.5.1. Configuring the default route timeouts globally Copy linkLink copied to clipboard!
By configuring the route timeouts globally, you can ensure consistent timeout settings across all services, simplifying management for workloads that have similar timeout needs and reducing the need for individual adjustments.
You can configure the route timeouts globally by updating the ROUTE_HAPROXY_TIMEOUT environment value in your serverless-operator subscription and updating the max-revision-timeout-seconds field in your KnativeServing custom resource (CR). This applies the timeout changes across all Knative services, and you can deploy services with specific timeouts up to the maximum value set.
ROUTE_HAPROXY_TIMEOUT is an environment variable managed by the Serverless Operator and by default is set to 600.
Procedure
Set the value of
ROUTE_HAPROXY_TIMEOUTin your subscription to your required timeout in seconds by running the following command. Note that this causes pods in theopenshift-serverlessnamespace to be redeployed.Setting the
ROUTE_HAPROXY_TIMEOUTvalue to 900 secondsoc patch subscription.operators.coreos.com serverless-operator -n openshift-serverless --type='merge' -p '{"spec": {"config": {"env": [{"name": "ROUTE_HAPROXY_TIMEOUT", "value": "900"}]}}}'$ oc patch subscription.operators.coreos.com serverless-operator -n openshift-serverless --type='merge' -p '{"spec": {"config": {"env": [{"name": "ROUTE_HAPROXY_TIMEOUT", "value": "900"}]}}}'Copy to Clipboard Copied! Toggle word wrap Toggle overflow Alternatively, you can set the value of
ROUTE_HAPROXY_TIMEOUTin your subscription directly:A subscription definition with
ROUTE_HAPROXY_TIMEOUTset to 900 secondsCopy to Clipboard Copied! Toggle word wrap Toggle overflow NoteIf you created your routes manually and disabled auto-generation with the
serving.knative.openshift.io/disableRouteannotation, you can configure the timeouts directly in the route definitions.Set the maximum revision timeout in your
KnativeServingCR:KnativeServingCR withmax-revision-timeout-secondsset to 900 secondsCopy to Clipboard Copied! Toggle word wrap Toggle overflow The Serverless Operator automatically adjusts the
terminationGracePeriodvalue of the activator to the set maximum revision timeout value to avoid request termination in cases where activator pods are being terminated themselves.Optional: Verify that the timeout has been set by running the following command:
oc get deployment activator -n knative-serving -o jsonpath="{.spec.template.spec.terminationGracePeriodSeconds}"$ oc get deployment activator -n knative-serving -o jsonpath="{.spec.template.spec.terminationGracePeriodSeconds}"Copy to Clipboard Copied! Toggle word wrap Toggle overflow If necessary for your cloud provider, adjust the load balancer timeout by running the following command:
Load balancer timeout adjustment for AWS Classic LB
Copy to Clipboard Copied! Toggle word wrap Toggle overflow Deploy a Knative service with the desired timeouts less or equal to the
max-revision-timeout-secondsvariable:A Service definition with timeouts set to 800 seconds
Copy to Clipboard Copied! Toggle word wrap Toggle overflow ImportantWhen using Service Mesh, if the activator pod is stopped while a long-running request is in-flight, the request is interrupted. To avoid request interruptions, you must adjust the value of the
terminationDrainDurationfield in theServiceMeshControlPlaneCR:Copy to Clipboard Copied! Toggle word wrap Toggle overflow - 1
- Ensure that the value exceeds the request duration to avoid the Istio proxy shutdown, which would interrupt the request.
Verification
If you are using Kourier, you can verify the current value of the timeout at the OpenShift Container Platform route by running the following command:
oc get route <route_name> -n knative-serving-ingress ess -o jsonpath="{.metadata.annotations.haproxy\.router\.openshift\.io/timeout}"$ oc get route <route_name> -n knative-serving-ingress ess -o jsonpath="{.metadata.annotations.haproxy\.router\.openshift\.io/timeout}" 800sCopy to Clipboard Copied! Toggle word wrap Toggle overflow
4.10.5.2. Configuring the default route timeouts per revision Copy linkLink copied to clipboard!
By configuring the route timeouts per revision, you can fine-tune timeouts for workloads with unique requirements, such as AI or data processing applications, without impacting the global timeout settings for other services. You can configure the timeouts of a specific revision by updating your KnativeServing custom resource (CR), the Service definition, and using the serving.knative.openshift.io/setRouteTimeout annotation to adjust the OpenShift Container Platform route timeout.
Procedure
Set the
max-revision-timeoutannotation in yourKnativeServingCR as you require:Copy to Clipboard Copied! Toggle word wrap Toggle overflow Optional: Verify the termination grace period of an activator by running the following command:
oc get deployment activator -n knative-serving -o jsonpath="{.spec.template.spec.terminationGracePeriodSeconds}"$ oc get deployment activator -n knative-serving -o jsonpath="{.spec.template.spec.terminationGracePeriodSeconds}" 900Copy to Clipboard Copied! Toggle word wrap Toggle overflow If necessary for your cloud provider, adjust the load balancer timeout by running the following command:
Load balancer timeout adjustment for AWS Classic LB
Copy to Clipboard Copied! Toggle word wrap Toggle overflow Set the timeout for your specific service:
Copy to Clipboard Copied! Toggle word wrap Toggle overflow - 1
- This annotation sets the timeout for the OpenShift Container Platform route. You can fine-tune this for each service instead of setting a global maximum.
- 2
- This ensures that the request does not exceed the specific value.
- 3
- This ensures that the response start timeout does not trigger before the max threshold is reached. The default value is
300.
ImportantWhen using Service Mesh, if the activator pod is stopped while a long-running request is in-flight, the request is interrupted. To avoid request interruptions, you must adjust the value of the
terminationDrainDurationfield in theServiceMeshControlPlaneCR:Copy to Clipboard Copied! Toggle word wrap Toggle overflow - 1
- Ensure that the value exceeds the request duration to avoid the Istio proxy shutdown, which would interrupt the request.
Verification
If you are using Kourier, you can verify the current value of the timeout at the OpenShift Container Platform route by running the following command:
oc get route <route-name> -n knative-serving-ingress ess -o jsonpath="{.metadata.annotations.haproxy\.router\.openshift\.io/timeout}"$ oc get route <route-name> -n knative-serving-ingress ess -o jsonpath="{.metadata.annotations.haproxy\.router\.openshift\.io/timeout}" 800sCopy to Clipboard Copied! Toggle word wrap Toggle overflow