Chapter 5. Supported AI accelerators for RHEL AI


The following AI accelerators are supported for inference serving with Red Hat AI Inference Server on RHEL AI.

Important

Bare metal deployments of RHEL AI are supported for all NVIDIA CUDA and AMD ROCm AI accelerators listed in Supported AI accelerators for Red Hat AI Inference Server.

Actual requirements vary based on the specific models you deploy, quantization methods, context lengths, and concurrent request loads. Aggregate GPU memory refers to the total GPU memory available across all GPUs in the system that can be used for tensor parallelism or pipeline parallelism.

For more information about inference serving on bare metal or Cloud platforms, see Red Hat Enterprise Linux AI.

Important

The recommended minimum additional disk storage for all platforms is 1 TB.

Expand
Table 5.1. Supported AI accelerators for Amazon Web Services (AWS) deployments
NVIDIA AI acceleratorAggregate GPU memoryAWS instance family

GB200

384 GB

P6e series

B200

192 GB

P6 series

RTX PRO 6000 Blackwell Server Edition

96 GB

G7e series

H100

80 GB

P5 series

L40S

48 GB

G6e series

A100

40 GB

P4d series

L4

24 GB

G6 series

Expand
Table 5.2. Supported AI accelerators for IBM Cloud deployments
NVIDIA AI acceleratorAggregate GPU memoryIBM Cloud instance family

H200

141 GB

gx3 series

H100

80 GB

gx3 series

A100

80 GB

gx3 series

L40S

48 GB

gx3 series

L4

24 GB

gx3 series

Expand
Table 5.3. Supported AI accelerators for Microsoft Azure deployments
AI acceleratorAggregate GPU memoryAzure instance family

NVIDIA GB200

384 GB

ND series

AMD Instinct MI300X

192 GB

ND series

NVIDIA H100

80 GB

ND series

NVIDIA A100

80 GB

ND series

AMD Instinct MI210

64 GB

ND series

Expand
Table 5.4. Supported AI accelerators for Google Cloud deployments
NVIDIA AI acceleratorAggregate GPU memoryGoogle Cloud instance family

GB200

384 GB

A4X series

B200

192 GB

A4 series

4xL4

96 GB

G2 series

H100

80 GB

A3 series

A100

40 GB

A2 series

Red Hat logoGithubredditYoutubeTwitter

Learn

Try, buy, & sell

Communities

About Red Hat Documentation

We help Red Hat users innovate and achieve their goals with our products and services with content they can trust. Explore our recent updates.

Making open source more inclusive

Red Hat is committed to replacing problematic language in our code, documentation, and web properties. For more details, see the Red Hat Blog.

About Red Hat

We deliver hardened solutions that make it easier for enterprises to work across platforms and environments, from the core datacenter to the network edge.

Theme

© 2026 Red Hat
Back to top