Este contenido no está disponible en el idioma seleccionado.
Chapter 12. Configuring API key authentication
Configure API key for authentication with Red Hat AI Inference.
| Endpoint | Authentication required |
|---|---|
|
| Yes |
|
| Yes |
|
| Yes |
|
| No |
|
| No |
Prerequisites
- You have installed Podman or Docker.
- You have installed the NVIDIA Container Toolkit to enable Podman access to GPUs.
- You are logged in as a user with sudo access.
-
You have access to
registry.redhat.ioand have logged in. - You have a Hugging Face account and have generated a Hugging Face access token.
Procedure
Start the inference server with API key authentication:
$ podman run --rm -it \ --device nvidia.com/gpu=all \ --security-opt=label=disable \ --shm-size=4GB -p 8000:8000 \ -v ./rhaii-cache:/opt/app-root/src/.cache \ registry.redhat.io/rhaii/vllm-cuda-rhel9:3.4.0 \ --model RedHatAI/Llama-3.2-1B-Instruct-FP8 \ --api-key "<your_api_key>"Send a request with the API key in the Authorization header:
$ curl http://localhost:8000/v1/chat/completions \ -H "Content-Type: application/json" \ -H "Authorization: Bearer <your_api_key>" \ -d '{ "model": "RedHatAI/Llama-3.2-1B-Instruct-FP8", "messages": [{"role": "user", "content": "Hello!"}] }'
Verification
-
A successful request returns a
200 OKresponse with the model output. -
A request without the API key returns
401 Unauthorized. -
A request with an invalid API key returns
403 Forbidden.
API key authentication provides basic protection for your API endpoints. It is not a comprehensive security solution. For production deployments, consider enabling SSL/TLS encryption using the --ssl-keyfile, --ssl-certfile, and --ssl-ca-certs server arguments. Do not hard code API keys in source code. Use environment variables or a secrets management solution.