Red Hat AI Inference 3.4

このコンテンツは選択した言語では利用できません。

Get started

Release notes

Highlights of what is new and what has changed with this Red Hat AI Inference release

Getting started

Getting started with Red Hat AI Inference

Distributed Inference with llm-d

Architecture, components, and deployment of Distributed Inference with llm-d for scalable LLM serving on Kubernetes

Plan

Supported product and hardware configurations

Supported product and hardware configurations for deploying Red Hat AI software

Validated models

Red Hat AI validated models

Inference Operations

Deploy Distributed Inference with llm-d on Openshift Container Platform

Deploy and serve large language models at scale on Openshift Container Platform

Deploy Distributed Inference with llm-d on Azure or CoreWeave Kubernetes Service

Deploy Distributed Inference with llm-d on Azure or CoreWeave Kubernetes Service

Deploy the standalone Red Hat AI Inference container in OpenShift Container Platform

Deploy the standalone Red Hat AI Inference container in OpenShift Container Platform clusters that have supported AI accelerators installed

Monitor and troubleshoot Distributed Inference with llm-d deployments

Monitor and troubleshoot Distributed Inference with llm-d deployments

Deploy the standalone Red Hat AI Inference container in a disconnected environment

Deploy Red Hat AI Inference in a disconnected environment using OpenShift Container Platform and a disconnected mirror image registry

Inference serving language models in OCI-compliant model containers

Inferencing OCI-compliant models in Red Hat AI Inference

Speculative decoding

Speculative decoding with Red Hat AI Inference

Inference serving Mistral 3 models

Inference serving Mistral 3 models with Red Hat AI Inference

Inference serving geospatial foundation models

Inference serving geospatial foundation models with Red Hat AI Inference

Red Hat AI Model Optimization Toolkit

Compressing large language models with the LLM Compressor library

vLLM server arguments

Server arguments for running Red Hat AI Inference

Extending Red Hat AI Inference with tool calling capabilities

Configuring tool calling and chat templates for AI Inference

Additional Resources

Red Hat AI Inference Server 3.3

Switch to the Red Hat AI Inference Server 3.3 documentation

Red Hat AI Foundations

Explore no-cost courses to boost your AI knowledge and get hands-on experience with Red Hat AI products while earning a certificate

Red Hat AI learning hub

Explore the curated set of third‑party models validated for Red Hat AI products, ready for fast, reliable deployment

Red Hat logoGithubredditYoutubeTwitter

詳細情報

試用、購入および販売

コミュニティー

会社概要

Red Hat は、企業がコアとなるデータセンターからネットワークエッジに至るまで、各種プラットフォームや環境全体で作業を簡素化できるように、強化されたソリューションを提供しています。

多様性を受け入れるオープンソースの強化

Red Hat では、コード、ドキュメント、Web プロパティーにおける配慮に欠ける用語の置き換えに取り組んでいます。このような変更は、段階的に実施される予定です。詳細情報: Red Hat ブログ.

Red Hat ドキュメントについて

Legal Notice

Theme

© 2026 Red Hat
トップに戻る