Ansible Chatbot (llama) Stack

This repository contains the necessary configuration to build a Docker Container Image for ansible-chatbot-stack.

ansible-chatbot-stack builds on top of lightspeed-stack that wraps Meta's llama-stack AI framework.

ansible-chatbot-stack includes various customisations for:

A remote vLLM inference provider (RHOSAI vLLM compatible)
The inline sentence transformers (Meta)
AAP RAG database files and configuration
Lightspeed external providers
System Prompt injection

Build/Run overview:

flowchart TB
%% Nodes
    LLAMA_STACK([fa:fa-layer-group llama-stack:x.y.z])
    LIGHTSPEED_STACK([fa:fa-layer-group lightspeed-stack:x.y.z])
    LIGHTSPEED_RUN_CONFIG{{fa:fa-wrench lightspeed-stack.yaml}}
    ANSIBLE_CHATBOT_STACK([fa:fa-layer-group ansible-chatbot-stack:x.y.z])
    ANSIBLE_CHATBOT_RUN_CONFIG{{fa:fa-wrench ansible-chatbot-run.yaml}}
    ANSIBLE_CHATBOT_DOCKERFILE{{fa:fa-wrench Containerfile}}
    ANSIBLE_LIGHTSPEED([fa:fa-layer-group ansible-ai-connect-service:x.y.z])
    LIGHTSPEED_PROVIDERS("fa:fa-code-branch lightspeed-providers:x.y.z")
    PYPI("fa:fa-database PyPI")

%% Edge connections between nodes
    ANSIBLE_LIGHTSPEED -- Uses --> ANSIBLE_CHATBOT_STACK
    ANSIBLE_CHATBOT_STACK -- Consumes --> PYPI
    LIGHTSPEED_PROVIDERS -- Publishes --> PYPI
    ANSIBLE_CHATBOT_STACK -- Built from --> ANSIBLE_CHATBOT_DOCKERFILE
    ANSIBLE_CHATBOT_STACK -- Inherits from --> LIGHTSPEED_STACK
    ANSIBLE_CHATBOT_STACK -- Includes --> LIGHTSPEED_RUN_CONFIG
    ANSIBLE_CHATBOT_STACK -- Includes --> ANSIBLE_CHATBOT_RUN_CONFIG
    LIGHTSPEED_STACK -- Embeds --> LLAMA_STACK
    LIGHTSPEED_STACK -- Uses --> LIGHTSPEED_RUN_CONFIG
    LLAMA_STACK -- Uses --> ANSIBLE_CHATBOT_RUN_CONFIG

Build

Setup for Ansible Chatbot Stack

External Providers YAML manifests must be present in providers.d/ of your host's llama-stack directory.
Vector Database is copied from the latest aap-rag-content image to ./vector_db.
Embeddings image files are copied from the latest aap-rag-content image to ./embeddings_model.

        make setup

Building Ansible Chatbot Stack

Builds the image ansible-chatbot-stack:$ANSIBLE_CHATBOT_VERSION.

Change the ANSIBLE_CHATBOT_VERSION version and inference parameters below accordingly.

    export ANSIBLE_CHATBOT_VERSION=0.0.1
    
    make build

Container file structure

Files from `lightspeed-stack` base image

└── app-root/
    ├── .venv/
    └── src/
        ├── <lightspeed-stack files>
        └── lightspeed_stack.py

Runtime files

These are stored in a PersistentVolumeClaim for resilience

└── .llama/
    └── data/
        └── distributions/
            └── ansible-chatbot/
                ├── aap_faiss_store.db
                ├── agents_store.db
                ├── responses_store.db
                ├── localfs_datasetio.db
                ├── trace_store.db
                └── embeddings_model/

Configuration files

└── .llama/
    ├── distributions/
    │   └── ansible-chatbot/
    │       ├── lightspeed-stack.yaml
    │       ├── ansible-chatbot-run.yaml
    │       ├── ansible-chatbot-version-info.json
    │       └── system-prompts/
    │           └── default.txt
    └── providers.d
        └── <llama-stack external providers>

Run

Runs the image ansible-chatbot-stack:$ANSIBLE_CHATBOT_VERSION as a local container.

Change the ANSIBLE_CHATBOT_VERSION version and inference parameters below accordingly.

    export ANSIBLE_CHATBOT_VERSION=0.0.1
    export ANSIBLE_CHATBOT_VLLM_URL=<YOUR_MODEL_SERVING_URL>
    export ANSIBLE_CHATBOT_VLLM_API_TOKEN=<YOUR_MODEL_SERVING_API_TOKEN>
    export ANSIBLE_CHATBOT_INFERENCE_MODEL=<YOUR_INFERENCE_MODEL>
    export ANSIBLE_CHATBOT_INFERENCE_MODEL_FILTER=<YOUR_INFERENCE_MODEL_TOOLS_FILTERING>
    
    make run

Basic tests

Runs basic tests against the local container.

Change the ANSIBLE_CHATBOT_VERSION version and inference parameters below accordingly.

    export ANSIBLE_CHATBOT_VERSION=0.0.1
    export ANSIBLE_CHATBOT_VLLM_URL=<YOUR_MODEL_SERVING_URL>
    export ANSIBLE_CHATBOT_VLLM_API_TOKEN=<YOUR_MODEL_SERVING_API_TOKEN>
    export ANSIBLE_CHATBOT_INFERENCE_MODEL=<YOUR_INFERENCE_MODEL>
    export ANSIBLE_CHATBOT_INFERENCE_MODEL_FILTER=<YOUR_INFERENCE_MODEL_TOOLS_FILTERING>
    
    make run-test

Deploy into a k8s cluster

Change configuration in `kustomization.yaml` accordingly, then

    kubectl kustomize . > my-chatbot-stack-deploy.yaml

Deploy the service

    kubectl apply -f my-chatbot-stack-deploy.yaml

Appendix - Host clean-up

If you have the need for re-building images, apply the following clean-ups right before:

    make clean

Appendix - Obtain a container shell

    # Obtain a container shell for the Ansible Chatbot Stack.
    make shell

Appendix - Run from source (PyCharm)

Clone the lightspeed-core/lightspeed-stack repository to your development environment.

In the ansible-chatbot-stack project root, create .env file in the project root and define following variables:

PYTHONDONTWRITEBYTECODE=1
PYTHONUNBUFFERED=1
PYTHONCOERCECLOCALE=0
PYTHONUTF8=1
PYTHONIOENCODING=UTF-8
LANG=en_US.UTF-8
VLLM_URL=(VLLM URL Here)
VLLM_API_TOKEN=(VLLM API Token Here)
INFERENCE_MODEL=granite-3.3-8b-instruct

LIBRARY_CLIENT_CONFIG_PATH=./ansible-chatbot-run.yaml
SYSTEM_PROMPT_PATH=./ansible-chatbot-system-prompt.txt
EMBEDDINGS_MODEL=./embeddings_model
VECTOR_DB_DIR=./vector_db
PROVIDERS_DB_DIR=./work
EXTERNAL_PROVIDERS_DIR=./llama-stack/providers.d

Create a Python run configuration with following values:
- script/module: script
- script path: (lightspeed-stack project root)/src/lightspeed_stack.py
- arguments: --config ./lightspeed-stack_local.yaml
- working directory: (ansible-chatbot-stack project root)
- path to ".env" files: (ansible-chatbot-stack project root)/.env
Run the created configuration from PyCharm main menu.

Note:

If you want to debug codes in the lightspeed-providers project, you can add it as a local package dependency with:

uv add --editable (lightspeed-providers project root)

It will update pyproject.toml and uv.lock files. Remember that they are for debugging purpose only and avoid checking in those local changes.

Name		Name	Last commit message	Last commit date
Latest commit History 78 Commits
.github		.github
.gitignore		.gitignore
.python-version		.python-version
Containerfile		Containerfile
LICENSE.md		LICENSE.md
Makefile		Makefile
README.md		README.md
ansible-chatbot-deploy.yaml		ansible-chatbot-deploy.yaml
ansible-chatbot-run.yaml		ansible-chatbot-run.yaml
ansible-chatbot-system-prompt.txt		ansible-chatbot-system-prompt.txt
entrypoint.sh		entrypoint.sh
kustomization.yaml		kustomization.yaml
lightspeed-stack.yaml		lightspeed-stack.yaml
lightspeed-stack_local.yaml		lightspeed-stack_local.yaml
pyproject.toml		pyproject.toml
uv.lock		uv.lock

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Ansible Chatbot (llama) Stack

Build

Setup for Ansible Chatbot Stack

Building Ansible Chatbot Stack

Container file structure

Files from `lightspeed-stack` base image

Runtime files

Configuration files

Run

Basic tests

Deploy into a k8s cluster

Change configuration in `kustomization.yaml` accordingly, then

Deploy the service

Appendix - Host clean-up

Appendix - Obtain a container shell

Appendix - Run from source (PyCharm)

Note:

About

Uh oh!

Releases

Packages

Uh oh!

Contributors 6

Uh oh!

Languages

License

ansible/ansible-chatbot-stack

Folders and files

Latest commit

History

Repository files navigation

Ansible Chatbot (llama) Stack

Build

Setup for Ansible Chatbot Stack

Building Ansible Chatbot Stack

Container file structure

Files from lightspeed-stack base image

Runtime files

Configuration files

Run

Basic tests

Deploy into a k8s cluster

Change configuration in kustomization.yaml accordingly, then

Deploy the service

Appendix - Host clean-up

Appendix - Obtain a container shell

Appendix - Run from source (PyCharm)

Note:

About

Resources

License

Code of conduct

Security policy

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors 6

Uh oh!

Languages

Files from `lightspeed-stack` base image

Change configuration in `kustomization.yaml` accordingly, then

Packages