Red Hat AI Inference | 3.4 | Red Hat Documentation

Get started

Release notes

Highlights of what is new and what has changed with this Red Hat AI Inference release

Getting started

Getting started with Red Hat AI Inference

Distributed Inference with llm-d

Architecture, components, and deployment of Distributed Inference with llm-d for scalable LLM serving on Kubernetes

Plan

Supported product and hardware configurations

Supported product and hardware configurations for deploying Red Hat AI software

Validated models

Red Hat AI validated models

Inference Operations

Deploy Distributed Inference with llm-d on Openshift Container Platform

Deploy and serve large language models at scale on Openshift Container Platform

Deploy Distributed Inference with llm-d on Azure or CoreWeave Kubernetes Service

Deploy the standalone Red Hat AI Inference container in OpenShift Container Platform

Deploy the standalone Red Hat AI Inference container in OpenShift Container Platform clusters that have supported AI accelerators installed

Monitor and troubleshoot Distributed Inference with llm-d deployments

Deploy the standalone Red Hat AI Inference container in a disconnected environment

Deploy Red Hat AI Inference in a disconnected environment using OpenShift Container Platform and a disconnected mirror image registry

Inference serving language models in OCI-compliant model containers

Inferencing OCI-compliant models in Red Hat AI Inference

Speculative decoding

Speculative decoding with Red Hat AI Inference

Inference serving Mistral 3 models

Inference serving Mistral 3 models with Red Hat AI Inference

Inference serving geospatial foundation models

Inference serving geospatial foundation models with Red Hat AI Inference

Red Hat AI Model Optimization Toolkit

Compressing large language models with the LLM Compressor library

vLLM server arguments

Server arguments for running Red Hat AI Inference

Extending Red Hat AI Inference with tool calling capabilities

Configuring tool calling and chat templates for AI Inference

Red Hat Enterprise Linux AI

Switch to the Red Hat Enterprise Linux AI documentation

Red Hat OpenShift AI

Switch to the Red Hat OpenShift AI documentation

Red Hat AI Enterprise

Switch to the Red Hat AI Enterprise documentation

Additional Resources

Red Hat AI Inference Server 3.3

Switch to the Red Hat AI Inference Server 3.3 documentation

Certified partner AI accelerator containers

Use Red Hat-certified container images to extend Red Hat AI Inference to run on specialized AI accelerators.

Red Hat AI Foundations

Explore no-cost courses to boost your AI knowledge and get hands-on experience with Red Hat AI products while earning a certificate

Red Hat AI learning hub

Explore the curated set of third‑party models validated for Red Hat AI products, ready for fast, reliable deployment

Red Hat AI Inference 3.4

このコンテンツは選択した言語では利用できません。

Get started

Release notes

Getting started

Distributed Inference with llm-d

Plan

Supported product and hardware configurations

Validated models

Inference Operations

Deploy Distributed Inference with llm-d on Openshift Container Platform

Deploy Distributed Inference with llm-d on Azure or CoreWeave Kubernetes Service

Deploy the standalone Red Hat AI Inference container in OpenShift Container Platform

Monitor and troubleshoot Distributed Inference with llm-d deployments

Deploy the standalone Red Hat AI Inference container in a disconnected environment

Inference serving language models in OCI-compliant model containers

Speculative decoding

Inference serving Mistral 3 models

Inference serving geospatial foundation models

Red Hat AI Model Optimization Toolkit

vLLM server arguments

Extending Red Hat AI Inference with tool calling capabilities

Additional Resources

Red Hat AI Inference Server 3.3

Certified partner AI accelerator containers

Red Hat AI Foundations

Red Hat AI learning hub

詳細情報

試用、購入および販売

コミュニティー

会社概要

多様性を受け入れるオープンソースの強化

Red Hat ドキュメントについて

Theme

Red Hat legal and privacy links

Red Hat legal and privacy links

Red Hat AI Inference 3.4

このコンテンツは選択した言語では利用できません。

Get started

Plan

Inference Operations

Related Products

Additional Resources

詳細情報

試用、購入および販売

コミュニティー

会社概要

多様性を受け入れるオープンソースの強化

Red Hat ドキュメントについて

Theme

Red Hat legal and privacy links

Red Hat legal and privacy links