Llama-4-Scout-17B-16E-Instruct
|
INT4, FP8
|
|
Baseline:
registry.redhat.io/rhelai1/llama-4-scout-17b-16e-instruct:1.5
INT4:
registry.redhat.io/rhelai1/llama-4-scout-17b-16e-instruct-quantized-w4a16:1.5
FP8:
registry.redhat.io/rhelai1/llama-4-scout-17b-16e-instruct-fp8-dynamic:1.5
|
Baseline:
registry.redhat.io/rhelai1/modelcar-llama-4-scout-17b-16e-instruct:1.5
INT4:
registry.redhat.io/rhelai1/modelcar-llama-4-scout-17b-16e-instruct-quantized-w4a16:1.5
FP8:
registry.redhat.io/rhelai1/modelcar-llama-4-scout-17b-16e-instruct-fp8-dynamic:1.5
|
Llama-4-Maverick-17B-128E-Instruct
|
FP8
|
|
|
|
Mistral-Small-3.1-24B-Instruct-2503
|
INT4, INT8, FP8
|
|
Baseline:
registry.redhat.io/rhelai1/mistral-small-3-1-24b-instruct-2503:1.5
INT4:
registry.redhat.io/rhelai1/mistral-small-3-1-24b-instruct-2503-quantized-w4a16:1.5
INT8:
registry.redhat.io/rhelai1/mistral-small-3-1-24b-instruct-2503-quantized-w8a8:1.5
FP8:
registry.redhat.io/rhelai1/mistral-small-3-1-24b-instruct-2503-fp8-dynamic:1.5
|
Baseline:
registry.redhat.io/rhelai1/modelcar-mistral-small-3-1-24b-instruct-2503:1.5
INT4:
registry.redhat.io/rhelai1/modelcar-mistral-small-3-1-24b-instruct-2503-quantized-w4a16:1.5
INT8:
registry.redhat.io/rhelai1/modelcar-mistral-small-3-1-24b-instruct-2503-quantized-w8a8:1.5
FP8:
registry.redhat.io/rhelai1/modelcar-mistral-small-3-1-24b-instruct-2503-fp8-dynamic:1.5
|
Llama-3.3-70B-Instruct
|
INT4, INT8, FP8
|
|
Baseline:
registry.redhat.io/rhelai1/llama-3-3-70b-instruct:1.5
INT4:
registry.redhat.io/rhelai1/llama-3-3-70b-instruct-quantized-w4a16:1.5
INT8:
registry.redhat.io/rhelai1/llama-3-3-70b-instruct-quantized-w8a8:1.5
FP8:
registry.redhat.io/rhelai1/llama-3-3-70b-instruct-fp8-dynamic:1.5
|
Baseline:
registry.redhat.io/rhelai1/modelcar-llama-3-3-70b-instruct:1.5
INT4:
registry.redhat.io/rhelai1/modelcar-llama-3-3-70b-instruct-quantized-w4a16:1.5
INT8:
registry.redhat.io/rhelai1/modelcar-llama-3-3-70b-instruct-quantized-w8a8:1.5
FP8:
registry.redhat.io/rhelai1/modelcar-llama-3-3-70b-instruct-fp8-dynamic:1.5
|
Llama-3.1-8B-Instruct
|
INT4, INT8, FP8
|
|
Baseline:
registry.redhat.io/rhelai1/llama-3-1-8b-instruct:1.5
INT4:
registry.redhat.io/rhelai1/llama-3-1-8b-instruct-quantized-w4a16:1.5
INT8:
registry.redhat.io/rhelai1/llama-3-1-8b-instruct-quantized-w8a8:1.5
FP8:
registry.redhat.io/rhelai1/llama-3-1-8b-instruct-fp8-dynamic:1.5
|
Baseline:
registry.redhat.io/rhelai1/modelcar-llama-3-1-8b-instruct:1.5
INT4:
registry.redhat.io/rhelai1/modelcar-llama-3-1-8b-instruct-quantized-w4a16:1.5
INT8:
registry.redhat.io/rhelai1/modelcar-llama-3-1-8b-instruct-quantized-w8a8:1.5
FP8:
registry.redhat.io/rhelai1/modelcar-llama-3-1-8b-instruct-fp8-dynamic:1.5
|
granite-3.1-8b-instruct
|
INT4, INT8, FP8
|
|
Baseline:
registry.redhat.io/rhelai1/granite-3-1-8b-instruct:1.5
INT4:
registry.redhat.io/rhelai1/granite-3-1-8b-instruct-quantized-w4a16:1.5
INT8:
registry.redhat.io/rhelai1/granite-3-1-8b-instruct-quantized-w8a8:1.5
FP8:
registry.redhat.io/rhelai1/granite-3-1-8b-instruct-fp8-dynamic:1.5
|
Baseline:
registry.redhat.io/rhelai1/modelcar-granite-3-1-8b-instruct:1.5
INT4:
registry.redhat.io/rhelai1/modelcar-granite-3-1-8b-instruct-quantized-w4a16:1.5
INT8:
registry.redhat.io/rhelai1/modelcar-granite-3-1-8b-instruct-quantized-w8a8:1.5
FP8:
registry.redhat.io/rhelai1/modelcar-granite-3-1-8b-instruct-fp8-dynamic:1.5
|
phi-4
|
INT4, INT8, FP8
|
|
Baseline:
registry.redhat.io/rhelai1/phi-4:1.5
INT4:
registry.redhat.io/rhelai1/phi-4-quantized-w4a16:1.5
INT8:
registry.redhat.io/rhelai1/phi-4-quantized-w8a8:1.5
FP8:
registry.redhat.io/rhelai1/phi-4-fp8-dynamic:1.5
|
Baseline:
registry.redhat.io/rhelai1/modelcar-phi-4:1.5
INT4:
registry.redhat.io/rhelai1/modelcar-phi-4-quantized-w4a16:1.5
INT8:
registry.redhat.io/rhelai1/modelcar-phi-4-quantized-w8a8:1.5
FP8:
registry.redhat.io/rhelai1/modelcar-phi-4-fp8-dynamic:1.5
|
Qwen2.5-7B-Instruct
|
INT4, INT8, FP8
|
|
Baseline:
registry.redhat.io/rhelai1/qwen2-5-7b-instruct:1.5
INT4:
registry.redhat.io/rhelai1/qwen2-5-7b-instruct-quantized-w4a16:1.5
INT8:
registry.redhat.io/rhelai1/qwen2-5-7b-instruct-quantized-w8a8:1.5
FP8:
registry.redhat.io/rhelai1/qwen2-5-7b-instruct-fp8-dynamic:1.5
|
Baseline:
registry.redhat.io/rhelai1/modelcar-qwen2-5-7b-instruct:1.5
INT4:
registry.redhat.io/rhelai1/modelcar-qwen2-5-7b-instruct-quantized-w4a16:1.5
INT8:
registry.redhat.io/rhelai1/modelcar-qwen2-5-7b-instruct-quantized-w8a8:1.5
FP8:
registry.redhat.io/rhelai1/modelcar-qwen2-5-7b-instruct-fp8-dynamic:1.5
|
Mistral-Small-24B-Instruct-2501
|
INT4, INT8, FP8
|
|
Baseline:
registry.redhat.io/rhelai1/mistral-small-24b-instruct-2501:1.5
INT4:
registry.redhat.io/rhelai1/mistral-small-24b-instruct-2501-quantized-w4a16:1.5
INT8:
registry.redhat.io/rhelai1/mistral-small-24b-instruct-2501-quantized-w8a8:1.5
FP8:
registry.redhat.io/rhelai1/mistral-small-24b-instruct-2501-fp8-dynamic:1.5
|
Baseline:
registry.redhat.io/rhelai1/modelcar-mistral-small-24b-instruct-2501:1.5
INT4:
registry.redhat.io/rhelai1/modelcar-mistral-small-24b-instruct-2501-quantized-w4a16:1.5
INT8:
registry.redhat.io/rhelai1/modelcar-mistral-small-24b-instruct-2501-quantized-w8a8:1.5
FP8:
registry.redhat.io/rhelai1/modelcar-mistral-small-24b-instruct-2501-fp8-dynamic:1.5
|
Mixtral-8x7B-Instruct-v0.1
|
None
|
|
|
|
granite-3.1-8b-base
|
INT4 (baseline currently unavailable)
|
|
|
|
granite-3.1-8b-starter-v2
|
None
| -
Unavailable on Hugging Face
|
|
|
Llama-3.1-Nemotron-70B-Instruct-HF
|
FP8
|
|
|
|
gemma-2-9b-it
|
FP8
|
|
|
|