此内容没有您所选择的语言版本。

Chapter 1. Version 3.4 release notes


Red Hat Enterprise Linux AI is a generative AI inference platform for Linux environments that uses Red Hat AI Inference for running and optimizing models, and includes Red Hat AI Model Optimization Toolkit for model quantization, sparsity, and general compression for supported AI accelerators. Red Hat AI Model Optimization Toolkit has native Hugging Face and vLLM support. You can seamlessly integrate optimized models with deployment pipelines for faster, cost-saving inference at scale, powered by the compressed-tensors model format.

Red Hat Enterprise Linux AI is packaged as bootc container images for easy deployment on a Linux server appliances with NVIDIA CUDA, AMD ROCm, or Intel Gaudi AI accelerators installed. Download the images from registry.redhat.io.

The following bootc and disk-image container images are Generally Available (GA):

  • registry.redhat.io/rhelai3/bootc-cuda-rhel9:3.4.0
  • registry.redhat.io/rhelai3/bootc-aws-cuda-rhel9:3.4.0
  • registry.redhat.io/rhelai3/bootc-azure-cuda-rhel9:3.4.0
  • registry.redhat.io/rhelai3/bootc-gcp-cuda-rhel9:3.4.0
  • registry.redhat.io/rhelai3/bootc-rocm-rhel9:3.4.0
  • registry.redhat.io/rhelai3/bootc-azure-rocm-rhel9:3.4.0
  • registry.redhat.io/rhelai3/disk-image-cuda-rhel9:3.4.0

The following bootc container images are Technology Preview:

  • registry.redhat.io/rhelai3/bootc-gaudi-rhel9:3.4.0
Important

There is no direct upgrade path from Red Hat Enterprise Linux AI 1.5 to Red Hat Enterprise Linux AI 3.0. You can upgrade from Red Hat Enterprise Linux AI 3.0 to 3.4 and all versions in-between.

Important

The registry.redhat.io/rhelai3/bootc-rocm-rhel9:3.4.0 image does not include Red Hat AI Model Optimization Toolkit, which is not supported for AMD ROCm AI accelerators.

1.1. New features

Red Hat Enterprise Linux AI 3.4 packages Red Hat AI Inference 3.4, which includes the following highlights:

Upgraded vLLM to v0.18.0
Red Hat AI Inference 3.4 packages the upstream vLLM v0.18.0 release. You can review the complete list of updates in the upstream vLLM v0.18.0 release notes.
Red Hat AI Model Optimization Toolkit updated to v0.10.0.1
Red Hat AI Model Optimization Toolkit 3.4 packages the upstream LLM Compressor v0.10.0.1 release. You can review the complete list of updates in the upstream LLM Compressor v0.10.0.1 release notes.
Systemd Quadlet service renamed from rhaiis to rhaii
The systemd Quadlet service has been renamed from rhaiis to rhaii to align with the Red Hat AI Inference product name change. The service file is now located at /etc/containers/systemd/rhaii.container. Symbolic links from the previous rhaiis paths are maintained for backward compatibility with existing installations.

For the complete list of new features, enhancements, and known issues, see the Red Hat AI Inference 3.4 release notes.

1.2. Known issues

NVIDIA HGX B200 and B300 servers require additional NVLink packages

The Red Hat Enterprise Linux AI bootc image does not currently include the OFED or MOFED InfiniBand drivers that NVIDIA Fabric Manager requires on HGX B200 and B300 NVSwitch-based systems. Without these drivers, Fabric Manager fails to start and GPU initialization fails with a cudaGetDeviceCount() error (Error 802: system not yet initialized).

As a workaround, you can disable NVLink by running the following command and rebooting the system:

# echo "options nvidia NVreg_NvLinkDisable=1" >> /etc/modprobe.d/nvidia.conf
Important

Disabling NVLink forces GPU-to-GPU communication over PCIe, which significantly reduces multi-GPU bandwidth. Expect higher latency and lower throughput for models that span multiple GPUs.

Red Hat logoGithubredditYoutubeTwitter

学习

尝试、购买和销售

社区

關於紅帽

我们提供强化的解决方案,使企业能够更轻松地跨平台和环境(从核心数据中心到网络边缘)工作。

让开源更具包容性

红帽致力于替换我们的代码、文档和 Web 属性中存在问题的语言。欲了解更多详情,请参阅红帽博客.

关于红帽文档

Legal Notice

Theme

© 2026 Red Hat
返回顶部