Este contenido no está disponible en el idioma seleccionado.

Chapter 12. Configuring API key authentication


Configure API key for authentication with Red Hat AI Inference.

Expand
Table 12.1. API endpoint authentication
EndpointAuthentication required

/v1/chat/completions

Yes

/v1/completions

Yes

/v1/embeddings

Yes

/health

No

/ping

No

Prerequisites

  • You have installed Podman or Docker.
  • You have installed the NVIDIA Container Toolkit to enable Podman access to GPUs.
  • You are logged in as a user with sudo access.
  • You have access to registry.redhat.io and have logged in.
  • You have a Hugging Face account and have generated a Hugging Face access token.

Procedure

  1. Start the inference server with API key authentication:

    $ podman run --rm -it \
      --device nvidia.com/gpu=all \
      --security-opt=label=disable \
      --shm-size=4GB -p 8000:8000 \
      -v ./rhaii-cache:/opt/app-root/src/.cache \
      registry.redhat.io/rhaii/vllm-cuda-rhel9:3.4.0 \
      --model RedHatAI/Llama-3.2-1B-Instruct-FP8 \
      --api-key "<your_api_key>"
  2. Send a request with the API key in the Authorization header:

    $ curl http://localhost:8000/v1/chat/completions \
      -H "Content-Type: application/json" \
      -H "Authorization: Bearer <your_api_key>" \
      -d '{
        "model": "RedHatAI/Llama-3.2-1B-Instruct-FP8",
        "messages": [{"role": "user", "content": "Hello!"}]
      }'

Verification

  • A successful request returns a 200 OK response with the model output.
  • A request without the API key returns 401 Unauthorized.
  • A request with an invalid API key returns 403 Forbidden.
Important

API key authentication provides basic protection for your API endpoints. It is not a comprehensive security solution. For production deployments, consider enabling SSL/TLS encryption using the --ssl-keyfile, --ssl-certfile, and --ssl-ca-certs server arguments. Do not hard code API keys in source code. Use environment variables or a secrets management solution.

Red Hat logoGithubredditYoutubeTwitter

Aprender

Pruebe, compre y venda

Comunidades

Acerca de Red Hat

Ofrecemos soluciones reforzadas que facilitan a las empresas trabajar en plataformas y entornos, desde el centro de datos central hasta el perímetro de la red.

Hacer que el código abierto sea más inclusivo

Red Hat se compromete a reemplazar el lenguaje problemático en nuestro código, documentación y propiedades web. Para más detalles, consulte el Blog de Red Hat.

Acerca de la documentación de Red Hat

Legal Notice

Theme

© 2026 Red Hat
Volver arriba