Chapter 1. Version 3.4.0-ea.2 release notes
Red Hat Enterprise Linux AI 3.4.0-ea.2 is the second Early Access release for the Red Hat Enterprise Linux AI 3.4 release. This release adds Intel Gaudi 3 AI accelerator support to the RHEL AI bootc container image.
Red Hat Enterprise Linux AI 3.4.0-ea.2 is an Early Access release. Early Access releases are not supported by Red Hat in any way and are not functionally complete or production-ready. Do not use Early Access releases for production or business-critical workloads. Use Early Access releases to test upcoming product features in advance of their possible inclusion in a Red Hat product offering, and to test functionality and provide feedback during the development process. These features might not have any documentation, are subject to change or removal at any time, and testing is limited. Red Hat might provide ways to submit feedback on Early Access features without an associated SLA.
The following container images are available as early access releases from registry.redhat.io:
-
registry.redhat.io/rhelai-early-access/bootc-cuda-rhel9:3.4.0-ea.2 -
registry.redhat.io/rhelai-early-access/bootc-rocm-rhel9:3.4.0-ea.2 -
registry.redhat.io/rhelai-early-access/bootc-gaudi-rhel9:3.4.0-ea.2
1.1. New features Copy linkLink copied to clipboard!
Red Hat Enterprise Linux AI 3.4.0-ea.2 packages Red Hat AI Inference Server 3.4.0-ea.2, which includes the following highlights:
- Upgraded vLLM to v0.16.0
- Red Hat AI Inference Server 3.4.0-ea.2 packages the upstream vLLM v0.16.0. You can review the complete list of updates in the upstream vLLM v0.16.0 release notes.
- Introducing Speculators library (Technology Preview)
- Speculators is an end-to-end training framework for creating EAGLE3 draft models that accelerate inference through speculative decoding, reducing model latency by 1.5-3x while maintaining output quality. for more information, see Red Hat AI Inference Server version 3.4.0-ea.2 release notes
- Intel Gaudi 3 AI accelerator support (Technology Preview)
Red Hat Enterprise Linux AI 3.4.0-ea.2 adds Intel Gaudi 3 AI accelerator support as a Technology Preview feature through the
vllm-gaudihardware plugin version 0.16.0. Thevllm-gaudi-rhel9container image is available fromregistry.redhat.io/rhaii-early-access/vllm-gaudi-rhel9:3.4.0-ea.2. Intel Gaudi 3 support includes paged KV cache optimized for Gaudi HPU devices, tensor parallel inference across multiple HPU devices, and FP8 quantization with Intel Neural Compressor (INC).ImportantIntel Gaudi 3 AI accelerator support is a Technology Preview feature only. Technology Preview features are not supported with Red Hat production service level agreements (SLAs) and might not be functionally complete. Red Hat does not recommend using them in production. These features provide early access to upcoming product features, enabling customers to test functionality and provide feedback during the development process.
For more information about the support scope of Red Hat Technology Preview features, see Technology Preview Features Support Scope.
1.2. Known issues Copy linkLink copied to clipboard!
There are no known issues for Red Hat Enterprise Linux AI 3.4.0-ea.2.