此内容没有您所选择的语言版本。

Chapter 1. Version 3.4 release notes

Red Hat Enterprise Linux AI is a generative AI inference platform for Linux environments that uses Red Hat AI Inference for running and optimizing models, and includes Red Hat AI Model Optimization Toolkit for model quantization, sparsity, and general compression for supported AI accelerators. Red Hat AI Model Optimization Toolkit has native Hugging Face and vLLM support. You can seamlessly integrate optimized models with deployment pipelines for faster, cost-saving inference at scale, powered by the compressed-tensors model format.

Red Hat Enterprise Linux AI is packaged as bootc container images for easy deployment on a Linux server appliances with NVIDIA CUDA, AMD ROCm, or Intel Gaudi AI accelerators installed. Download the images from registry.redhat.io.

The following bootc and disk-image container images are Generally Available (GA):

registry.redhat.io/rhelai3/bootc-cuda-rhel9:3.4.0
registry.redhat.io/rhelai3/bootc-aws-cuda-rhel9:3.4.0
registry.redhat.io/rhelai3/bootc-azure-cuda-rhel9:3.4.0
registry.redhat.io/rhelai3/bootc-gcp-cuda-rhel9:3.4.0
registry.redhat.io/rhelai3/bootc-rocm-rhel9:3.4.0
registry.redhat.io/rhelai3/bootc-azure-rocm-rhel9:3.4.0
registry.redhat.io/rhelai3/disk-image-cuda-rhel9:3.4.0

The following bootc container images are Technology Preview:

registry.redhat.io/rhelai3/bootc-gaudi-rhel9:3.4.0

Important

There is no direct upgrade path from Red Hat Enterprise Linux AI 1.5 to Red Hat Enterprise Linux AI 3.0. You can upgrade from Red Hat Enterprise Linux AI 3.0 to 3.4 and all versions in-between.

Important

The registry.redhat.io/rhelai3/bootc-rocm-rhel9:3.4.0 image does not include Red Hat AI Model Optimization Toolkit, which is not supported for AMD ROCm AI accelerators.

1.1. New features
复制链接

Red Hat Enterprise Linux AI 3.4 packages Red Hat AI Inference 3.4, which includes the following highlights:

Upgraded vLLM to v0.18.0: Red Hat AI Inference 3.4 packages the upstream vLLM v0.18.0 release. You can review the complete list of updates in the upstream vLLM v0.18.0 release notes.
Red Hat AI Model Optimization Toolkit updated to v0.10.0.1: Red Hat AI Model Optimization Toolkit 3.4 packages the upstream LLM Compressor v0.10.0.1 release. You can review the complete list of updates in the upstream LLM Compressor v0.10.0.1 release notes.
Systemd Quadlet service renamed from rhaiis to rhaii: The systemd Quadlet service has been renamed from rhaiis to rhaii to align with the Red Hat AI Inference product name change. The service file is now located at /etc/containers/systemd/rhaii.container. Symbolic links from the previous rhaiis paths are maintained for backward compatibility with existing installations.

For the complete list of new features, enhancements, and known issues, see the Red Hat AI Inference 3.4 release notes.

1.2. Known issues
复制链接

NVIDIA HGX B200 and B300 servers require additional NVLink packages

The Red Hat Enterprise Linux AI bootc image does not currently include the OFED or MOFED InfiniBand drivers that NVIDIA Fabric Manager requires on HGX B200 and B300 NVSwitch-based systems. Without these drivers, Fabric Manager fails to start and GPU initialization fails with a cudaGetDeviceCount() error (Error 802: system not yet initialized).

As a workaround, you can disable NVLink by running the following command and rebooting the system:

# echo "options nvidia NVreg_NvLinkDisable=1" >> /etc/modprobe.d/nvidia.conf

Important

Disabling NVLink forces GPU-to-GPU communication over PCIe, which significantly reduces multi-GPU bandwidth. Expect higher latency and lower throughput for models that span multiple GPUs.

此内容没有您所选择的语言版本。

Chapter 1. Version 3.4 release notes

1.1. New features
复制链接

1.2. Known issues
复制链接

学习

尝试、购买和销售

社区

關於紅帽

让开源更具包容性

关于红帽文档

Theme

Red Hat legal and privacy links

Red Hat legal and privacy links

此内容没有您所选择的语言版本。

Chapter 1. Version 3.4 release notes

1.1. New features复制链接链接已复制到粘贴板!

1.2. Known issues复制链接链接已复制到粘贴板!

学习

尝试、购买和销售

社区

關於紅帽

让开源更具包容性

关于红帽文档

Theme

Red Hat legal and privacy links

Red Hat legal and privacy links

1.1. New features
复制链接

1.2. Known issues
复制链接