このコンテンツは選択した言語では利用できません。

Chapter 6. Distributed Inference with llm-d deployment with Helm


The Distributed Inference with llm-d Helm chart deploys a complete inference stack on Openshift Container Platform or managed Kubernetes. On Openshift Container Platform, the chart uses Operator Lifecycle Manager (OLM) to install and configure required Operators automatically. On managed Kubernetes, the chart installs all dependencies directly.

The Helm chart uses a three-tier deployment model:

Red Hat OpenShift AI Operator
The chart installs the Red Hat OpenShift AI Operator through an OLM Subscription. The chart controls the DataScienceCluster and DSCInitialization custom resources that manage the inference stack lifecycle.
Helm chart components
The Helm chart includes the rhaii profile, which provides an inference-focused deployment. For distributed inference, the primary component is KServe, which provides the LLMInferenceService custom resource (CR) for deploying and managing inference services.
Operator dependencies
Each component declares the Operators it requires. KServe depends on cert-manager, LeaderWorkerSet, and Red Hat Connectivity Link. The Helm chart resolves these dependencies, including transitive dependencies, and installs each Operator through OLM automatically.

Figure 6.1. Deploying Distributed Inference with llm-d with Helm charts

Helm chart deployment architecture showing the RHAI Operator and RHAI cloud controller manager components

1 The Red Hat AI Inference (RHAII) Helm chart packages, deploys, and configures the Red Hat AI (RHAI) Operator and the RHAI cloud controller manager. The RHAI Operator handles KServe and model serving, while the RHAI cloud controller manager handles the underlying cluster infrastructure.

2 The RHAI Operator manages the KServe Controller and reconciles custom resource definitions (CRDs).

3 The RHAI cloud controller manager manages Helm-based infrastructure components, including cert-manager, Gateway API, Istio, and LeaderWorkerSet.

4 The RHAI cloud controller manager configures the managed Kubernetes or Openshift Container Platform cluster.

Red Hat logoGithubredditYoutubeTwitter

詳細情報

試用、購入および販売

コミュニティー

会社概要

Red Hat は、企業がコアとなるデータセンターからネットワークエッジに至るまで、各種プラットフォームや環境全体で作業を簡素化できるように、強化されたソリューションを提供しています。

多様性を受け入れるオープンソースの強化

Red Hat では、コード、ドキュメント、Web プロパティーにおける配慮に欠ける用語の置き換えに取り組んでいます。このような変更は、段階的に実施される予定です。詳細情報: Red Hat ブログ.

Red Hat ドキュメントについて

Legal Notice

Theme

© 2026 Red Hat
トップに戻る