Chapter 1. Version 3.4 release notes


Red Hat Enterprise Linux AI is a generative AI inference platform for Linux environments that uses Red Hat AI Inference for running and optimizing models, and includes Red Hat AI Model Optimization Toolkit for model quantization, sparsity, and general compression for supported AI accelerators. Red Hat AI Model Optimization Toolkit has native Hugging Face and vLLM support. You can seamlessly integrate optimized models with deployment pipelines for faster, cost-saving inference at scale, powered by the compressed-tensors model format.

Red Hat Enterprise Linux AI is packaged as bootc container images for easy deployment on a Linux server appliances with NVIDIA CUDA, AMD ROCm, or Intel Gaudi AI accelerators installed. Download the images from registry.redhat.io.

The following bootc and disk-image container images are Generally Available (GA):

  • registry.redhat.io/rhelai3/bootc-cuda-rhel9:3.4.0
  • registry.redhat.io/rhelai3/bootc-aws-cuda-rhel9:3.4.0
  • registry.redhat.io/rhelai3/bootc-azure-cuda-rhel9:3.4.0
  • registry.redhat.io/rhelai3/bootc-gcp-cuda-rhel9:3.4.0
  • registry.redhat.io/rhelai3/bootc-rocm-rhel9:3.4.0
  • registry.redhat.io/rhelai3/bootc-azure-rocm-rhel9:3.4.0
  • registry.redhat.io/rhelai3/disk-image-cuda-rhel9:3.4.0

The following bootc container images are Technology Preview:

  • registry.redhat.io/rhelai3/bootc-gaudi-rhel9:3.4.0
Important

There is no direct upgrade path from Red Hat Enterprise Linux AI 1.5 to Red Hat Enterprise Linux AI 3.0. You can upgrade from Red Hat Enterprise Linux AI 3.0 to 3.4 and all versions in-between.

Important

The registry.redhat.io/rhelai3/bootc-rocm-rhel9:3.4.0 image does not include Red Hat AI Model Optimization Toolkit, which is not supported for AMD ROCm AI accelerators.

1.1. New features

Red Hat Enterprise Linux AI 3.4 packages Red Hat AI Inference 3.4, which includes the following highlights:

Upgraded vLLM to v0.18.0
Red Hat AI Inference 3.4 packages the upstream vLLM v0.18.0 release. You can review the complete list of updates in the upstream vLLM v0.18.0 release notes.
Red Hat AI Model Optimization Toolkit updated to v0.10.0.1
Red Hat AI Model Optimization Toolkit 3.4 packages the upstream LLM Compressor v0.10.0.1 release. You can review the complete list of updates in the upstream LLM Compressor v0.10.0.1 release notes.
Systemd Quadlet service renamed from rhaiis to rhaii
The systemd Quadlet service has been renamed from rhaiis to rhaii to align with the Red Hat AI Inference product name change. The service file is now located at /etc/containers/systemd/rhaii.container. Symbolic links from the previous rhaiis paths are maintained for backward compatibility with existing installations.

For the complete list of new features, enhancements, and known issues, see the Red Hat AI Inference 3.4 release notes.

1.2. Known issues

NVIDIA HGX B200 and B300 servers require additional NVLink packages

The Red Hat Enterprise Linux AI bootc image does not currently include the OFED or MOFED InfiniBand drivers that NVIDIA Fabric Manager requires on HGX B200 and B300 NVSwitch-based systems. Without these drivers, Fabric Manager fails to start and GPU initialization fails with a cudaGetDeviceCount() error (Error 802: system not yet initialized).

As a workaround, you can disable NVLink by running the following command and rebooting the system:

# echo "options nvidia NVreg_NvLinkDisable=1" >> /etc/modprobe.d/nvidia.conf
Important

Disabling NVLink forces GPU-to-GPU communication over PCIe, which significantly reduces multi-GPU bandwidth. Expect higher latency and lower throughput for models that span multiple GPUs.

Red Hat logoGithubredditYoutubeTwitter

Learn

Try, buy, & sell

Communities

About Red Hat

We deliver hardened solutions that make it easier for enterprises to work across platforms and environments, from the core datacenter to the network edge.

Making open source more inclusive

Red Hat is committed to replacing problematic language in our code, documentation, and web properties. For more details, see the Red Hat Blog.

About Red Hat Documentation

Legal Notice

Theme

© 2026 Red Hat
Back to top