Home
Learn
AI quickstarts
rh-ai-virtual-agent

Build an AI-powered virtual agent
Copy link

Build and deploy a conversational AI virtual agent on Red Hat OpenShift AI to automate customer interactions and provide instant support.

Detailed description
Copy link

This platform provides the tools to build and deploy conversational AI agents that can:

Access knowledge bases - Upload documents and create searchable knowledge bases for RAG (Retrieval-Augmented Generation)
Use tools - Integrate web search, databases, and custom tools through the Model Context Protocol (MCP)
Apply guardrails - Built-in safety measures and content filtering
Scale in production - Kubernetes-ready architecture

Key Features
Copy link

🤖 Agent Management - Create and configure AI agents with different capabilities
📚 Knowledge Integration - Document search and question answering via RAG
💬 Real-time Chat - Streaming conversations with session history
🔧 Tool Ecosystem - Built-in tools plus extensible MCP server support
🛡️ Safety Controls - Configurable guardrails and content filtering

Architecture Overview
Copy link

The platform integrates several components:

React Frontend - Web interface for agent and chat management
FastAPI Backend - API server handling business logic and data persistence
LlamaStack - AI platform managing models, agents, and inference
PostgreSQL + pgvector - Data storage with vector search capabilities
Kubernetes Pipeline - Document processing and knowledge base ingestion

Architecture

📖 Detailed Architecture →

Requirements
Copy link

Minimum hardware requirements
Copy link

For a full working version with local inference:

GPU - Required for running inference locally
Alternatively, you can deploy without a GPU by using:
- Remote vLLM deployment
- Vertex AI

Minimum software requirements
Copy link

Red Hat OpenShift - Container orchestration platform
Red Hat OpenShift AI - AI/ML platform for model serving and management
oc CLI - OpenShift command-line tool
make - Build automation tool
Hugging Face token - With access to models (some models require authorization)

Required user permissions
Copy link

Cluster admin access - Required for installing ClusterRole resources for OAuth authentication

Deploy
Copy link

Cluster Deployment
Copy link

For production installation on Kubernetes/OpenShift:

clone the repository
git clone https://github.com/rh-ai-quickstart/ai-virtual-agent.git

# Navigate to cluster deployment directory
cd deploy/cluster

# Install with interactive prompts for configuration
make install NAMESPACE=your-namespace

# clone the repository
git clone https://github.com/rh-ai-quickstart/ai-virtual-agent.git

# Navigate to cluster deployment directory
cd deploy/cluster

# Install with interactive prompts for configuration
make install NAMESPACE=your-namespace

Copy to Clipboard

Toggle word wrap

🧭 Advanced instructions →

📖 Full Installation Guide →

Delete
Copy link

To remove the application and all associated resources:

cd deploy/cluster
make uninstall NAMESPACE=your-namespace

cd deploy/cluster
make uninstall NAMESPACE=your-namespace

Copy to Clipboard

Toggle word wrap

This will automatically clean up the Helm chart, deployed resources, and PVCs.

Example Use Case
Copy link

Creating a Customer Support Agent with Knowledge Base

import requests

BASE_URL = "http://localhost:8000/api/v1"

# 1. Create a knowledge base
kb_response = requests.post(
    f"{BASE_URL}/knowledge_bases",
    json={
        "vector_store_name": "support-docs-v1",
        "name": "Support Documentation",
        "version": "v1",
        "embedding_model": "sentence-transformers/all-MiniLM-L6-v2",
        "provider_id": "ollama",
        "source": "S3"
    }
)
print(f"Knowledge base created: {kb_response.status_code}")

# 2. Create a support agent
agent_response = requests.post(
    f"{BASE_URL}/virtual_agents",
    headers={
        "X-Forwarded-User": "admin",
        "X-Forwarded-Email": "admin@change.me"
    },
    json={
        "name": "Support Agent",
        "model_name": "meta-llama/Llama-3.2-3B-Instruct",
        "prompt": "You are a helpful customer support agent",
        "knowledge_base_ids": ["support-docs-v1"],
        "tools": [{"toolgroup_id": "builtin::web_search"}],
        "temperature": 0.7,
        "top_p": 0.9,
        "max_tokens": 2048
    }
)
agent_id = agent_response.json()["id"]
print(f"Agent created: {agent_id}")

# 3. Create a chat session
session_response = requests.post(
    f"{BASE_URL}/chat_sessions",
    json={
        "agent_id": agent_id,
        "session_name": "Customer Support Session"
    }
)
session_id = session_response.json()["id"]
print(f"Chat session created: {session_id}")

# 4. Send a chat message
chat_response = requests.post(
    f"{BASE_URL}/chat",
    json={
        "virtualAgentId": agent_id,
        "sessionId": session_id,
        "message": {
            "role": "user",
            "content": [
                {
                    "type": "input_text",
                    "text": "Who is the first president of the United States?"
                }
            ]
        },
        "stream": False
    }
)
answer = chat_response.json()
print(f"Agent response: {answer}")

import requests

BASE_URL = "http://localhost:8000/api/v1"

# 1. Create a knowledge base
kb_response = requests.post(
    f"{BASE_URL}/knowledge_bases",
    json={
        "vector_store_name": "support-docs-v1",
        "name": "Support Documentation",
        "version": "v1",
        "embedding_model": "sentence-transformers/all-MiniLM-L6-v2",
        "provider_id": "ollama",
        "source": "S3"
    }
)
print(f"Knowledge base created: {kb_response.status_code}")

# 2. Create a support agent
agent_response = requests.post(
    f"{BASE_URL}/virtual_agents",
    headers={
        "X-Forwarded-User": "admin",
        "X-Forwarded-Email": "admin@change.me"
    },
    json={
        "name": "Support Agent",
        "model_name": "meta-llama/Llama-3.2-3B-Instruct",
        "prompt": "You are a helpful customer support agent",
        "knowledge_base_ids": ["support-docs-v1"],
        "tools": [{"toolgroup_id": "builtin::web_search"}],
        "temperature": 0.7,
        "top_p": 0.9,
        "max_tokens": 2048
    }
)
agent_id = agent_response.json()["id"]
print(f"Agent created: {agent_id}")

# 3. Create a chat session
session_response = requests.post(
    f"{BASE_URL}/chat_sessions",
    json={
        "agent_id": agent_id,
        "session_name": "Customer Support Session"
    }
)
session_id = session_response.json()["id"]
print(f"Chat session created: {session_id}")

# 4. Send a chat message
chat_response = requests.post(
    f"{BASE_URL}/chat",
    json={
        "virtualAgentId": agent_id,
        "sessionId": session_id,
        "message": {
            "role": "user",
            "content": [
                {
                    "type": "input_text",
                    "text": "Who is the first president of the United States?"
                }
            ]
        },
        "stream": False
    }
)
answer = chat_response.json()
print(f"Agent response: {answer}")

Copy to Clipboard

Toggle word wrap

Advanced Solutions with AI Virtual Agents
Copy link

Connect Oracle Database for BI Analysts
Copy link

Integrate AI agents with an Oracle data warehouse using the Model Context Protocol (MCP). Oracle MCP Server on OpenShift with the Red Hat AI Virtual Agent

Advanced instructions
Copy link

Getting Started Guides
Copy link

👩‍💻 For Developers
Copy link

Local Development Guide - Containerized development environment (without cluster)
Contributing Guide - Development setup and workflow
Backend API Reference - Complete API documentation
Frontend Architecture - UI components and patterns

🚀 For Deployment
Copy link

Installation Guide - Production deployment on Kubernetes
Agent Templates - Pre-built agent configurations
Knowledge Base Setup - Document processing pipeline

🔧 For Integration
Copy link

Testing Guide - Running integration tests
API Reference - Backend API endpoints

Project Structure
Copy link

ai-virtual-agent/
├── frontend/           # React UI with PatternFly components
├── backend/            # FastAPI server with PostgreSQL
├── docs/               # Architecture and API documentation
├── deploy/
│   ├── cluster/        # Kubernetes/Helm cluster deployment
│   │   ├── helm/       # Helm chart files
│   │   ├── scripts/    # Cluster deployment scripts
│   │   ├── Containerfile # Cluster container image
│   │   └── Makefile    # Cluster deployment commands
│   └── local/          # Local development deployment
│       ├── compose.dev.yaml # Docker Compose for local dev
│       ├── dev/        # Local development configs
│       └── Makefile    # Local development commands
└── tests/              # Integration test suite

ai-virtual-agent/
├── frontend/           # React UI with PatternFly components
├── backend/            # FastAPI server with PostgreSQL
├── docs/               # Architecture and API documentation
├── deploy/
│   ├── cluster/        # Kubernetes/Helm cluster deployment
│   │   ├── helm/       # Helm chart files
│   │   ├── scripts/    # Cluster deployment scripts
│   │   ├── Containerfile # Cluster container image
│   │   └── Makefile    # Cluster deployment commands
│   └── local/          # Local development deployment
│       ├── compose.dev.yaml # Docker Compose for local dev
│       ├── dev/        # Local development configs
│       └── Makefile    # Local development commands
└── tests/              # Integration test suite

Copy to Clipboard

Toggle word wrap

Local Development
Copy link

For local containerized development (without cluster):

📖 → See Local Development Guide

Note: Local setup has limited functionality compared to OpenShift AI deployment:

No authentication/authorization

Knowledge bases not available

MCP servers not tested

These features are only available with the full OpenShift AI deployment.

cd deploy/local

# Start all services
make compose-up

# Other available commands
make compose-down        # Stop all services
make compose-logs        # View logs
make compose-restart     # Restart services
make compose-status      # Show status

cd deploy/local

# Start all services
make compose-up

# Other available commands
make compose-down        # Stop all services
make compose-logs        # View logs
make compose-restart     # Restart services
make compose-status      # Show status

Copy to Clipboard

Toggle word wrap

Access your app:

Cluster Development
Copy link

cd deploy/cluster

# Install on cluster
make install NAMESPACE=your-namespace

# Other available commands
make uninstall NAMESPACE=your-namespace    # Remove application
make install-status NAMESPACE=your-namespace    # Check status
make list-mcps                              # List available MCP servers

cd deploy/cluster

# Install on cluster
make install NAMESPACE=your-namespace

# Other available commands
make uninstall NAMESPACE=your-namespace    # Remove application
make install-status NAMESPACE=your-namespace    # Check status
make list-mcps                              # List available MCP servers

Copy to Clipboard

Toggle word wrap

Note: All Makefile targets automatically load environment variables from a .env file in the repository root if it exists.

Environment setup (.env)
Copy link

Create a .env file in the repository root to configure your local environment. All Makefile targets will dynamically load this file if present:

cp .env.example .env
# then edit `.env` as needed

cp .env.example .env
# then edit `.env` as needed

Copy to Clipboard

Toggle word wrap

At minimum, set:

DATABASE_URL=postgresql+asyncpg://admin:password@localhost:5432/ai_virtual_agent

DATABASE_URL=postgresql+asyncpg://admin:password@localhost:5432/ai_virtual_agent

Copy to Clipboard

Toggle word wrap

Optional toggles:

# Skip attachments bucket initialization/access during local dev
DISABLE_ATTACHMENTS=true

# Provide admin bootstrap for Alembic seeding (optional)
# ADMIN_USERNAME=admin
# ADMIN_EMAIL=admin@change.me

# Skip attachments bucket initialization/access during local dev
DISABLE_ATTACHMENTS=true

# Provide admin bootstrap for Alembic seeding (optional)
# ADMIN_USERNAME=admin
# ADMIN_EMAIL=admin@change.me

Copy to Clipboard

Toggle word wrap

Note: If you're not using attachments in local dev, you can set DISABLE_ATTACHMENTS=true in .env to skip attachment-related initialization.

Community & Support
Copy link

🐛 Issues - Report bugs and request features
💬 Discussions - Ask questions and share ideas
🤝 Contributing - See CONTRIBUTING.md for guidelines
📚 Documentation - Browse /docs for detailed guides

License
Copy link

MIT License - Built with ❤️ by the Red Hat Ecosystem App Engineering team

Build an AI-powered virtual agent

Build an AI-powered virtual agent
Copy link

Detailed description
Copy link

Key Features
Copy link

Architecture Overview
Copy link

Requirements
Copy link

Minimum hardware requirements
Copy link

Minimum software requirements
Copy link

Required user permissions
Copy link

Deploy
Copy link

Cluster Deployment
Copy link

Delete
Copy link

Example Use Case
Copy link

Advanced Solutions with AI Virtual Agents
Copy link

Connect Oracle Database for BI Analysts
Copy link

Advanced instructions
Copy link

Getting Started Guides
Copy link

👩‍💻 For Developers
Copy link

🚀 For Deployment
Copy link

🔧 For Integration
Copy link

Project Structure
Copy link

Local Development
Copy link

Cluster Development
Copy link

Environment setup (.env)
Copy link

Community & Support
Copy link

License
Copy link

Learn

Try, buy, & sell

Communities

About Red Hat Documentation

Making open source more inclusive

About Red Hat

Theme

Red Hat legal and privacy links

Red Hat legal and privacy links

Build an AI-powered virtual agent

Build an AI-powered virtual agentCopy linkLink copied!

Detailed descriptionCopy linkLink copied!

Key FeaturesCopy linkLink copied!

Architecture OverviewCopy linkLink copied!

RequirementsCopy linkLink copied!

Minimum hardware requirementsCopy linkLink copied!

Minimum software requirementsCopy linkLink copied!

Required user permissionsCopy linkLink copied!

DeployCopy linkLink copied!

Cluster DeploymentCopy linkLink copied!

DeleteCopy linkLink copied!

Example Use CaseCopy linkLink copied!

Advanced Solutions with AI Virtual AgentsCopy linkLink copied!

Connect Oracle Database for BI AnalystsCopy linkLink copied!

Advanced instructionsCopy linkLink copied!

Getting Started GuidesCopy linkLink copied!

👩‍💻 For DevelopersCopy linkLink copied!

🚀 For DeploymentCopy linkLink copied!

🔧 For IntegrationCopy linkLink copied!

Project StructureCopy linkLink copied!

Local DevelopmentCopy linkLink copied!

Cluster DevelopmentCopy linkLink copied!

Environment setup (.env)Copy linkLink copied!

Community & SupportCopy linkLink copied!

LicenseCopy linkLink copied!

Learn

Try, buy, & sell

Communities

About Red Hat Documentation

Making open source more inclusive

About Red Hat

Theme

Red Hat legal and privacy links

Red Hat legal and privacy links

Build an AI-powered virtual agent
Copy link

Detailed description
Copy link

Key Features
Copy link

Architecture Overview
Copy link

Requirements
Copy link

Minimum hardware requirements
Copy link

Minimum software requirements
Copy link

Required user permissions
Copy link

Deploy
Copy link

Cluster Deployment
Copy link

Delete
Copy link

Example Use Case
Copy link

Advanced Solutions with AI Virtual Agents
Copy link

Connect Oracle Database for BI Analysts
Copy link

Advanced instructions
Copy link

Getting Started Guides
Copy link

👩‍💻 For Developers
Copy link

🚀 For Deployment
Copy link

🔧 For Integration
Copy link

Project Structure
Copy link

Local Development
Copy link

Cluster Development
Copy link

Environment setup (.env)
Copy link

Community & Support
Copy link

License
Copy link