Chapter 1. Red Hat AI validated models

1.1. Red Hat AI validated models - October 2025 collection
复制链接

The following models, available from RedHat AI on Hugging Face, are validated for use with Red Hat AI Inference Server.

Expand

Table 1.1. Red Hat AI validated models - October 2025 collection
Model	Quantized variants	Hugging Face model cards	Validated on
gpt-oss-120b	None	Baseline	RHAIIS 3.2.2 RHOAI 2.25
gpt-oss-20b	None	Baseline	RHAIIS 3.2.2 RHOAI 2.25
NVIDIA-Nemotron-Nano-9B-v2	INT4, FP8	INT4 FP8	RHAIIS 3.2.2 RHOAI 2.25
Qwen3-Coder-480B-A35B-Instruct	FP8	FP8	RHAIIS 3.2.2 RHOAI 2.25
Voxtral-Mini-3B-2507	FP8	FP8	RHAIIS 3.2.2 RHOAI 2.25
whisper-large-v3-turbo	INT4	INT4	RHAIIS 3.2.2 RHOAI 2.25

1.2. Validated models on Hugging Face - September 2025 collection
复制链接

The following models, available from RedHat AI on Hugging Face, are validated for use with Red Hat AI Inference Server.

Expand

Table 1.2. Red Hat AI validated models - September 2025 collection
Model	Quantized variants	Hugging Face model cards	Validated on
DeepSeek-R1-0528	INT4	INT4	RHAIIS 3.2.1 RHOAI 2.24
gemma-3n-E4B-it	FP8	FP8	RHAIIS 3.2.1 RHOAI 2.24
Kimi-K2-Instruct	INT4	INT4	RHAIIS 3.2.1 RHOAI 2.24
Qwen3-8B	FP8	FP8	RHAIIS 3.2.1 RHOAI 2.24

1.3. Validated models on Hugging Face - May 2025 collection
复制链接

The following models, available from RedHat AI on Hugging Face, are validated for use with Red Hat AI Inference Server.

Expand

Table 1.3. Red Hat AI validated models - May 2025 collection
Model	Quantized variants	Hugging Face model cards	Validated on
gemma-2-9b-it	FP8	Baseline FP8	RHAIIS 3.0 RHELAI 1.5 RHOAI 2.20
granite-3.1-8b-base	INT4	INT4	RHAIIS 3.0 RHELAI 1.5 RHOAI 2.20
granite-3.1-8b-instruct	INT4, INT8, FP8	Baseline INT4 INT8 FP8	RHAIIS 3.0 RHELAI 1.5 RHOAI 2.20
Llama-3.1-8B-Instruct	None	Baseline	RHAIIS 3.0 RHELAI 1.5 RHOAI 2.20
Llama-3.1-Nemotron-70B-Instruct-HF	FP8	Baseline FP8	RHAIIS 3.0 RHELAI 1.5 RHOAI 2.20
Llama-3.3-70B-Instruct	INT4, INT8, FP8	Baseline INT4 INT8 FP8	RHAIIS 3.0 RHELAI 1.5 RHOAI 2.20
Llama-4-Maverick-17B-128E-Instruct	FP8	Baseline FP8	RHAIIS 3.0 RHELAI 1.5 RHOAI 2.20
Llama-4-Scout-17B-16E-Instruct	INT4, FP8	Baseline INT4 FP8	RHAIIS 3.0 RHELAI 1.5 RHOAI 2.20
Meta-Llama-3.1-8B-Instruct	INT4, INT8, FP8	INT4 INT8 FP8	RHAIIS 3.0 RHELAI 1.5 RHOAI 2.20
Mistral-Small-24B-Instruct-2501	INT4, INT8, FP8	Baseline INT4 INT8 FP8	RHAIIS 3.0 RHELAI 1.5 RHOAI 2.20
Mistral-Small-3.1-24B-Instruct-2503	INT4, INT8, FP8	Baseline INT4 INT8 FP8	RHAIIS 3.0 RHELAI 1.5 RHOAI 2.20
Mixtral-8x7B-Instruct-v0.1	None	Baseline	RHAIIS 3.0 RHELAI 1.5 RHOAI 2.20
phi-4	INT4, INT8, FP8	Baseline INT4 INT8 FP8	RHAIIS 3.0 RHELAI 1.5 RHOAI 2.20
Qwen2.5-7B-Instruct	INT4, INT8, FP8	Baseline INT4 INT8 FP8	RHAIIS 3.0 RHELAI 1.5 RHOAI 2.20

1.4. Validated OCI artifact model container images
复制链接

Expand

Table 1.4. Validated OCI artifact model container images
Model	Quantized variants	ModelCar images
llama-4-scout-17b-16e-instruct	INT4, FP8	Baseline: `registry.redhat.io/rhelai1/llama-4-scout-17b-16e-instruct:1.5` INT4: `registry.redhat.io/rhelai1/llama-4-scout-17b-16e-instruct-quantized-w4a16:1.5` FP8: `registry.redhat.io/rhelai1/llama-4-scout-17b-16e-instruct-fp8-dynamic:1.5`
llama-4-maverick-17b-128e-instruct	FP8	Baseline: `registry.redhat.io/rhelai1/llama-4-maverick-17b-128e-instruct:1.5` FP8: `registry.redhat.io/rhelai1/llama-4-maverick-17b-128e-instruct-fp8:1.5`
mistral-small-3-1-24b-instruct-2503	INT4, INT8, FP8	Baseline: `registry.redhat.io/rhelai1/mistral-small-3-1-24b-instruct-2503:1.5` INT4: `registry.redhat.io/rhelai1/mistral-small-3-1-24b-instruct-2503-quantized-w4a16:1.5` INT8: `registry.redhat.io/rhelai1/mistral-small-3-1-24b-instruct-2503-quantized-w8a8:1.5` FP8: `registry.redhat.io/rhelai1/mistral-small-3-1-24b-instruct-2503-fp8-dynamic:1.5`
llama-3-3-70b-instruct	INT4, INT8, FP8	Baseline: `registry.redhat.io/rhelai1/llama-3-3-70b-instruct:1.5` INT4: `registry.redhat.io/rhelai1/llama-3-3-70b-instruct-quantized-w4a16:1.5` INT8: `registry.redhat.io/rhelai1/llama-3-3-70b-instruct-quantized-w8a8:1.5` FP8: `registry.redhat.io/rhelai1/llama-3-3-70b-instruct-fp8-dynamic:1.5`
llama-3-1-8b-instruct	INT4, INT8, FP8	Baseline: `registry.redhat.io/rhelai1/llama-3-1-8b-instruct:1.5` INT4: `registry.redhat.io/rhelai1/llama-3-1-8b-instruct-quantized-w4a16:1.5` INT8: `registry.redhat.io/rhelai1/llama-3-1-8b-instruct-quantized-w8a8:1.5` FP8: `registry.redhat.io/rhelai1/llama-3-1-8b-instruct-fp8-dynamic:1.5`
granite-3-1-8b-instruct	INT4, INT8, FP8	Baseline: `registry.redhat.io/rhelai1/granite-3-1-8b-instruct:1.5` INT4: `registry.redhat.io/rhelai1/granite-3-1-8b-instruct-quantized-w4a16:1.5` INT8: `registry.redhat.io/rhelai1/granite-3-1-8b-instruct-quantized-w8a8:1.5` FP8: `registry.redhat.io/rhelai1/granite-3-1-8b-instruct-fp8-dynamic:1.5`
phi-4	INT4, INT8, FP8	Baseline: `registry.redhat.io/rhelai1/phi-4:1.5` INT4: `registry.redhat.io/rhelai1/phi-4-quantized-w4a16:1.5` INT8: `registry.redhat.io/rhelai1/phi-4-quantized-w8a8:1.5` FP8: `registry.redhat.io/rhelai1/phi-4-fp8-dynamic:1.5`
qwen2-5-7b-instruct	INT4, INT8, FP8	Baseline: `registry.redhat.io/rhelai1/qwen2-5-7b-instruct:1.5` INT4: `registry.redhat.io/rhelai1/qwen2-5-7b-instruct-quantized-w4a16:1.5` INT8: `registry.redhat.io/rhelai1/qwen2-5-7b-instruct-quantized-w8a8:1.5` FP8: `registry.redhat.io/rhelai1/qwen2-5-7b-instruct-fp8-dynamic:1.5`
mistral-small-24b-instruct-2501	INT4, INT8, FP8	Baseline: `registry.redhat.io/rhelai1/mistral-small-24b-instruct-2501:1.5` INT4: `registry.redhat.io/rhelai1/mistral-small-24b-instruct-2501-quantized-w4a16:1.5` INT8: `registry.redhat.io/rhelai1/mistral-small-24b-instruct-2501-quantized-w8a8:1.5` FP8: `registry.redhat.io/rhelai1/mistral-small-24b-instruct-2501-fp8-dynamic:1.5`
mixtral-8x7b-instruct-v0-1	None	Baseline: `registry.redhat.io/rhelai1/mixtral-8x7b-instruct-v0-1:1.4`
granite-3-1-8b-base	INT4 (baseline currently unavailable)	INT4: `registry.redhat.io/rhelai1/granite-3-1-8b-base-quantized-w4a16:1.5`
granite-3.1-8b-starter-v2	None	Baseline: `registry.redhat.io/rhelai1/granite-3.1-8b-starter-v2:1.5`
llama-3-1-nemotron-70b-instruct-hf	FP8	Baseline: `registry.redhat.io/rhelai1/llama-3-1-nemotron-70b-instruct-hf:1.5` FP8: `registry.redhat.io/rhelai1/llama-3-1-nemotron-70b-instruct-hf-fp8-dynamic:1.5`
gemma-2-9b-it	FP8	Baseline: `registry.redhat.io/rhelai1/gemma-2-9b-it:1.5` FP8: `registry.redhat.io/rhelai1/gemma-2-9b-it-fp8:1.5`
deepseek-r1-0528	INT4 (baseline currently unavailable)	INT4: `registry.redhat.io/rhelai1/deepseek-r1-0528-quantized-w4a16:1.5`
qwen3-8b	FP8 (baseline currently unavailable)	FP8: `registry.redhat.io/rhelai1/qwen3-8b-fp8-dynamic:1.5`
kimi-k2-instruct	INT4 (baseline currently unavailable)	INT4: `registry.redhat.io/rhelai1/kimi-k2-instruct-quantized-w4a16:1.5`
gemma-3n-e4b-it	FP8 (baseline currently unavailable)	FP8: `registry.redhat.io/rhelai1/gemma-3n-e4b-it-fp8-dynamic:1.5`
gpt-oss-120b	None	Baseline: `registry.redhat.io/rhelai1/gpt-oss-120b:1.5`
gpt-oss-20b	None	Baseline: `registry.redhat.io/rhelai1/gpt-oss-20b:1.5`
qwen3-coder-480b-a35b-instruct	FP8 (baseline currently unavailable)	FP8: `registry.redhat.io/rhelai1/qwen3-coder-480b-a35b-instruct-fp8:1.5`
whisper-large-v3-turbo	INT4 (baseline currently unavailable)	INT4: `registry.redhat.io/rhelai1/whisper-large-v3-turbo-quantized-w4a16:1.5`
voxtral-mini-3b-2507	FP8 (baseline currently unavailable)	FP8: `registry.redhat.io/rhelai1/voxtral-mini-3b-2507-fp8-dynamic:1.5`
nvidia-nemotron-nano-9b-v2	FP8 (baseline currently unavailable)	FP8: `registry.redhat.io/rhelai1/nvidia-nemotron-nano-9b-v2-fp8-dynamic:1.5`

1.5. ModelCar container images
复制链接

Expand

Table 1.5. ModelCar container images
Model	Quantized variants	ModelCar images
llama-4-scout-17b-16e-instruct	INT4, FP8	Baseline: `registry.redhat.io/rhelai1/modelcar-llama-4-scout-17b-16e-instruct:1.5` INT4: `registry.redhat.io/rhelai1/modelcar-llama-4-scout-17b-16e-instruct-quantized-w4a16:1.5` FP8: `registry.redhat.io/rhelai1/modelcar-llama-4-scout-17b-16e-instruct-fp8-dynamic:1.5`
llama-4-maverick-17b-128e-instruct	FP8	Baseline: `registry.redhat.io/rhelai1/modelcar-llama-4-maverick-17b-128e-instruct:1.5` FP8: `registry.redhat.io/rhelai1/modelcar-llama-4-maverick-17b-128e-instruct-fp8:1.5`
mistral-small-3-1-24b-instruct-2503	INT4, INT8, FP8	Baseline: `registry.redhat.io/rhelai1/modelcar-mistral-small-3-1-24b-instruct-2503:1.5` INT4: `registry.redhat.io/rhelai1/modelcar-mistral-small-3-1-24b-instruct-2503-quantized-w4a16:1.5` INT8: `registry.redhat.io/rhelai1/modelcar-mistral-small-3-1-24b-instruct-2503-quantized-w8a8:1.5` FP8: `registry.redhat.io/rhelai1/modelcar-mistral-small-3-1-24b-instruct-2503-fp8-dynamic:1.5`
llama-3-3-70b-instruct	INT4, INT8, FP8	Baseline: `registry.redhat.io/rhelai1/modelcar-llama-3-3-70b-instruct:1.5` INT4: `registry.redhat.io/rhelai1/modelcar-llama-3-3-70b-instruct-quantized-w4a16:1.5` INT8: `registry.redhat.io/rhelai1/modelcar-llama-3-3-70b-instruct-quantized-w8a8:1.5` FP8: `registry.redhat.io/rhelai1/modelcar-llama-3-3-70b-instruct-fp8-dynamic:1.5`
llama-3-1-8b-instruct	INT4, INT8, FP8	Baseline: `registry.redhat.io/rhelai1/modelcar-llama-3-1-8b-instruct:1.5` INT4: `registry.redhat.io/rhelai1/modelcar-llama-3-1-8b-instruct-quantized-w4a16:1.5` INT8: `registry.redhat.io/rhelai1/modelcar-llama-3-1-8b-instruct-quantized-w8a8:1.5` FP8: `registry.redhat.io/rhelai1/modelcar-llama-3-1-8b-instruct-fp8-dynamic:1.5`
granite-3-1-8b-instruct	INT4, INT8, FP8	Baseline: `registry.redhat.io/rhelai1/modelcar-granite-3-1-8b-instruct:1.5` INT4: `registry.redhat.io/rhelai1/modelcar-granite-3-1-8b-instruct-quantized-w4a16:1.5` INT8: `registry.redhat.io/rhelai1/modelcar-granite-3-1-8b-instruct-quantized-w8a8:1.5` FP8: `registry.redhat.io/rhelai1/modelcar-granite-3-1-8b-instruct-fp8-dynamic:1.5`
phi-4	INT4, INT8, FP8	Baseline: `registry.redhat.io/rhelai1/modelcar-phi-4:1.5` INT4: `registry.redhat.io/rhelai1/modelcar-phi-4-quantized-w4a16:1.5` INT8: `registry.redhat.io/rhelai1/modelcar-phi-4-quantized-w8a8:1.5` FP8: `registry.redhat.io/rhelai1/modelcar-phi-4-fp8-dynamic:1.5`
qwen2-5-7b-instruct	INT4, INT8, FP8	Baseline: `registry.redhat.io/rhelai1/modelcar-qwen2-5-7b-instruct:1.5` INT4: `registry.redhat.io/rhelai1/modelcar-qwen2-5-7b-instruct-quantized-w4a16:1.5` INT8: `registry.redhat.io/rhelai1/modelcar-qwen2-5-7b-instruct-quantized-w8a8:1.5` FP8: `registry.redhat.io/rhelai1/modelcar-qwen2-5-7b-instruct-fp8-dynamic:1.5`
mistral-small-24b-instruct-2501	INT4, INT8, FP8	Baseline: `registry.redhat.io/rhelai1/modelcar-mistral-small-24b-instruct-2501:1.5` INT4: `registry.redhat.io/rhelai1/modelcar-mistral-small-24b-instruct-2501-quantized-w4a16:1.5` INT8: `registry.redhat.io/rhelai1/modelcar-mistral-small-24b-instruct-2501-quantized-w8a8:1.5` FP8: `registry.redhat.io/rhelai1/modelcar-mistral-small-24b-instruct-2501-fp8-dynamic:1.5`
mixtral-8x7b-instruct-v0-1	None	Baseline: `registry.redhat.io/rhelai1/modelcar-mixtral-8x7b-instruct-v0-1:1.4`
granite-3-1-8b-base	INT4 (baseline currently unavailable)	INT4: `registry.redhat.io/rhelai1/modelcar-granite-3-1-8b-base-quantized-w4a16:1.5`
granite-3-1-8b-starter-v2	None	Baseline: `registry.redhat.io/rhelai1/modelcar-granite-3-1-8b-starter-v2:1.5`
llama-3-1-nemotron-70b-instruct-hf	FP8	Baseline: `registry.redhat.io/rhelai1/modelcar-llama-3-1-nemotron-70b-instruct-hf:1.5` FP8: `registry.redhat.io/rhelai1/modelcar-llama-3-1-nemotron-70b-instruct-hf-fp8-dynamic:1.5`
gemma-2-9b-it	FP8	Baseline: `registry.redhat.io/rhelai1/modelcar-gemma-2-9b-it:1.5` FP8: `registry.redhat.io/rhelai1/modelcar-gemma-2-9b-it-fp8:1.5`
deepseek-r1-0528	INT4 (baseline currently unavailable)	INT4: `registry.redhat.io/rhelai1/modelcar-deepseek-r1-0528-quantized-w4a16:1.5`
qwen3-8b	FP8 (baseline currently unavailable)	FP8: `registry.redhat.io/rhelai1/modelcar-qwen3-8b-fp8-dynamic:1.5`
kimi-k2-instruct	INT4 (baseline currently unavailable)	INT4: `registry.redhat.io/rhelai1/modelcar-kimi-k2-instruct-quantized-w4a16:1.5`
gemma-3n-e4b-it	FP8 (baseline currently unavailable)	FP8: `registry.redhat.io/rhelai1/modelcar-gemma-3n-e4b-it-fp8-dynamic:1.5`
gpt-oss-120b	None	Baseline: `registry.redhat.io/rhelai1/modelcar-gpt-oss-120b:1.5`
gpt-oss-20b	None	Baseline: `registry.redhat.io/rhelai1/modelcar-gpt-oss-20b:1.5`
qwen3-coder-480b-a35b-instruct	FP8 (baseline currently unavailable)	FP8: `registry.redhat.io/rhelai1/modelcar-qwen3-coder-480b-a35b-instruct-fp8:1.5`
whisper-large-v3-turbo	INT4 (baseline currently unavailable)	INT4: `registry.redhat.io/rhelai1/modelcar-whisper-large-v3-turbo-quantized-w4a16:1.5`
voxtral-mini-3b-2507	FP8 (baseline currently unavailable)	FP8: `registry.redhat.io/rhelai1/modelcar-voxtral-mini-3b-2507-fp8-dynamic:1.5`
nvidia-nemotron-nano-9b-v2	FP8 (baseline currently unavailable)	FP8: `registry.redhat.io/rhelai1/modelcar-nvidia-nemotron-nano-9b-v2-fp8-dynamic:1.5`

此内容没有您所选择的语言版本。

1.1. Red Hat AI validated models - October 2025 collection
复制链接

1.2. Validated models on Hugging Face - September 2025 collection
复制链接

1.3. Validated models on Hugging Face - May 2025 collection
复制链接

1.4. Validated OCI artifact model container images
复制链接

1.5. ModelCar container images
复制链接

学习

尝试、购买和销售

社区

关于红帽文档

让开源更具包容性

關於紅帽

Theme

Red Hat legal and privacy links

Red Hat legal and privacy links

此内容没有您所选择的语言版本。

Chapter 1. Red Hat AI validated models

1.1. Red Hat AI validated models - October 2025 collection复制链接链接已复制到粘贴板!

1.2. Validated models on Hugging Face - September 2025 collection复制链接链接已复制到粘贴板!

1.3. Validated models on Hugging Face - May 2025 collection复制链接链接已复制到粘贴板!

1.4. Validated OCI artifact model container images复制链接链接已复制到粘贴板!

1.5. ModelCar container images复制链接链接已复制到粘贴板!

学习

尝试、购买和销售

社区

关于红帽文档

让开源更具包容性

關於紅帽

Theme

Red Hat legal and privacy links

Red Hat legal and privacy links

1.1. Red Hat AI validated models - October 2025 collection
复制链接

1.2. Validated models on Hugging Face - September 2025 collection
复制链接

1.3. Validated models on Hugging Face - May 2025 collection
复制链接

1.4. Validated OCI artifact model container images
复制链接

1.5. ModelCar container images
复制链接