Chapter 2. Mirroring the Operator images for AI Inference Server and its dependencies
Once you have created a mirror registry for the disconnected environment, you are ready to mirror AI Inference Server container images.
Prerequisites
-
You have installed the OpenShift CLI (
oc). -
You have logged in as a user with
cluster-adminprivileges. - You have installed a mirror registry on the bastion host
Procedure
Find the version of the following images that match your environment and pull the images with
podman:Create an image set configuration custom resource (CR) that includes the NFD Operator, NVIDIA GPU Operator, and AI Inference Server images that you pulled in the previous step. For example, the following
ImageSetConfigurationCR includes the image specs for AMD CPU architecture and CUDA accelerators:Copy to Clipboard Copied! Toggle word wrap Toggle overflow Alternatively, if you have already installed the NFD and NVIDIA GPU Operators in the cluster, create an
ImageSetConfigurationCR that configures AI Inference Server only:apiVersion: mirror.openshift.io/v2alpha1 kind: ImageSetConfiguration mirror: additionalImages: - name: registry.redhat.io/rhaiis/vllm-cuda-rhel9:latest
apiVersion: mirror.openshift.io/v2alpha1 kind: ImageSetConfiguration mirror: additionalImages: - name: registry.redhat.io/rhaiis/vllm-cuda-rhel9:latestCopy to Clipboard Copied! Toggle word wrap Toggle overflow - Mirror the image set in the disconnected environment.
- Configure the disconnected cluster to use the updated image set.