Chapter 3. Mirroring the required images for model inference
Once you have created a mirror registry for the disconnected environment, you are ready to mirror the required AI Inference Server image, AI accelerator Operator images, and OCI model container image.
Prerequisites
-
You have installed the OpenShift CLI (
oc). -
You have logged in as a user with
cluster-adminprivileges. - You have installed a mirror registry on the bastion host.
Procedure
Find the version of the following images that match your environment and model inference use case:
Select an OCI model container image, for example
registry.redhat.io/rhelai1/granite-3-1-8b-instruct-quantized-w8a8:1.5NoteYou can select any OCI model container image from the validated models list that matches your requirements. See Validated models for AI Inference Server for available options.
Create an image set configuration custom resource (CR) that includes the NFD Operator, NVIDIA GPU Operator, AI Inference Server image, and the OCI model image. For example, save the following
ImageSetConfigurationCR as the fileimageset-config.yaml:Copy to Clipboard Copied! Toggle word wrap Toggle overflow Mirror the required images into the mirror registry using a valid pull secret. Run the following command:
oc mirror --config imageset-config.yaml docker://<TARGET_MIRROR_REGISTRY_URL> --registry-config <PATH_TO_PULL_SECRET_JSON>
$ oc mirror --config imageset-config.yaml docker://<TARGET_MIRROR_REGISTRY_URL> --registry-config <PATH_TO_PULL_SECRET_JSON>Copy to Clipboard Copied! Toggle word wrap Toggle overflow Alternatively, if you have already installed the NFD and NVIDIA GPU Operators in the cluster, create an
ImageSetConfigurationCR that configures the AI Inference Server and OCI model container images only:Copy to Clipboard Copied! Toggle word wrap Toggle overflow - Mirror the image set in the disconnected environment.
- Configure the cluster for the mirror registry.