Este contenido no está disponible en el idioma seleccionado.

Chapter 1. Red Hat Enterprise Linux AI hardware requirements


Various hardware accelerators require different requirements for serving and inferencing as well as installing, generating and training the Granite starter model on Red Hat Enterprise Linux AI.

1.1. Hardware requirements for end-to-end workflow of Granite models

The following charts show the hardware requirements for running the full InstructLab end-to-end workflow to customize the Granite student model. This includes: synthetic data generation (SDG), multi-phase training, and evaluating a custom Granite model.

1.1.1. Bare metal

Expand
Hardware vendorSupported accelerators (GPUs)Aggregate GPU memoryRecommended additional disk storage

NVIDIA

2xA100

4xA100

8xA100

160 GB

320 GB

640 GB

3 TB

NVIDIA

2xH100

4xH100

8xH100

160 GB

320 GB

640 GB

3 TB

NVIDIA

2xH200

4xH200

8xH200

282 GB

564 GB

1128 GB

3 TB

NVIDIA

4xL40S

8xL40S

192 GB

384 GB

3 TB

AMD

2xMI300X

4xMI300X

8xMI300X

384 GB

768 GB

1536 GB

3 TB

1.1.2. IBM Cloud

Expand
Hardware vendorSupported accelerators (GPUs)Aggregate GPU MemoryIBM Cloud InstancesRecommended additional disk storage

NVIDIA

2xA100

160 GB

gx3d-48x240x2a100p

3 TB

NVIDIA

8xH100

640 GB

gx3d-160x1792x8h100

3 TB

NVIDIA

8xH200

1128 GB

gx3d-160x1792x8h200

3 TB

AMD

8xMI300X

1536 GB

gx3d-208x1792x8mi300x

3 TB

1.1.3. Amazon Web Services (AWS)

Expand
Hardware vendorSupported accelerators (GPUs)Aggregate GPU MemoryAWS InstancesRecommended additional disk storage

NVIDIA

8xA100

320 GB

p4d.24xlarge

3 TB

NVIDIA

8xA100

640 GB

p4de.24xlarge

3 TB

NVIDIA

8xH100

640 GB

p5.48xlarge

3 TB

NVIDIA

8xL40S

384 GB

g6e.48xlarge

3 TB

1.1.4. Azure

Expand
Hardware vendorSupported accelerators (GPUs)Aggregate GPU MemoryAzure InstancesRecommended additional disk storage

NVIDIA

8xA100

640 GB

Standard_ND96amsr_A100_v4

3 TB

NVIDIA

4xA100

320 GB

Standard_ND96asr_A100_v4

3 TB

NVIDIA

8xH100

640 GB

Standard_ND96isr_H100_v5

3 TB

AMD

8xMI300X

1535 GB

Standard_ND96is_MI300X_v5

3 TB

1.1.5. Google Cloud Platform (GCP)

Expand
Hardware vendorSupported accelerators (GPUs)Aggregate GPU MemoryGCP InstancesRecommended additional disk storage

NVIDIA

8xA100

640 GB

a2-highgpu-8g

3 TB

NVIDIA

8xH100

640 GB

a3-highgpu-8g

a3-megagpu-8g

3 TB

1.2. Hardware requirements for inference serving Granite models

The following charts display the minimum hardware requirements for inference serving a model on Red Hat Enterprise Linux AI.

1.2.1. Bare metal

Expand
Hardware vendorSupported accelerators (GPUs)Minimum Aggregate GPU memoryRecommended additional disk storage

NVIDIA

A100

80 GB

1 TB

NVIDIA

H100

80 GB

1 TB

NVIDIA

H200

141 GB

1 TB

NVIDIA

GH200 (Technology Preview)

192 GB

1 TP

NVIDIA

L40S

48 GB

1 TB

NVIDIA

L4

24 GB

1 TB

AMD

MI300X

192 GB

1 TB

Intel

Gaudi 3 (Technology Preview)

128 GB

1 TB

1.2.2. Amazon Web Services (AWS)

Expand
Hardware vendorSupported accelerators (GPUs)Minimum Aggregate GPU MemoryAWS Instance familyRecommended additional disk storage

NVIDIA

A100

40 GB

P4d series

1 TB

NVIDIA

H100

80 GB

P5 series

1 TB

NVIDIA

L40S

48 GB

G6e series

1 TB

NVIDIA

L4

24 GB

G6 series

1 TB

1.2.3. IBM cloud

Expand
Hardware vendorSupported accelerators (GPUs)Minimum Aggregate GPU MemoryIBM Cloud Instance familyRecommended additional disk storage

NVIDIA

L4

24 GB

gx3 series

1 TB

NVIDIA

L40S

48 GB

gx3 series

1 TB

NVIDIA

A100

80 GB

gx3 series

1 TB

NVIDIA

H100

80 GB

gx3 series

1 TB

NVIDIA

H200

141 GB

gx3 series

1 TB

AMD

MI300X

192 GB

gx3 series

1 TB

Intel

Gaudi 3 (Technology Preview)

128 GB

gx3 series

1 TB

1.2.4. Azure

Expand
Hardware vendorSupported accelerators (GPUs)Minimum Aggregate GPU MemoryAzure Instance familyRecommended additional disk storage

NVIDIA

A100

80 GB

ND series

1 TB

NVIDIA

H100

80 GB

ND sereis

1 TB

AMD

MI300X

192 GB

ND series

1 TB

1.2.5. Google Cloud Platform (GCP)

Expand
Hardware vendorSupported accelerators (GPUs)Minimum Aggregate GPU MemoryGCP Instance familyRecommended additional disk storage

NVIDIA

A100

40 GB

A2 series

1 TB

NVIDIA

H100

80 GB

A3 series

1 TB

NVIDIA

4xL4

96 GB

G2 series

1 TB

Volver arriba
Red Hat logoGithubredditYoutubeTwitter

Aprender

Pruebe, compre y venda

Comunidades

Acerca de la documentación de Red Hat

Ayudamos a los usuarios de Red Hat a innovar y alcanzar sus objetivos con nuestros productos y servicios con contenido en el que pueden confiar. Explore nuestras recientes actualizaciones.

Hacer que el código abierto sea más inclusivo

Red Hat se compromete a reemplazar el lenguaje problemático en nuestro código, documentación y propiedades web. Para más detalles, consulte el Blog de Red Hat.

Acerca de Red Hat

Ofrecemos soluciones reforzadas que facilitan a las empresas trabajar en plataformas y entornos, desde el centro de datos central hasta el perímetro de la red.

Theme

© 2025 Red Hat