Open WebUI Benchmark Suite

A comprehensive benchmarking framework for testing Open WebUI performance under various load conditions.

Overview

This benchmark suite is designed to:

Measure concurrent user capacity - Test how many users can simultaneously use features like Channels
Identify performance limits - Find the point where response times degrade
Compare compute profiles - Test performance across different resource configurations
Generate actionable reports - Provide detailed metrics and recommendations

Quick Start

Prerequisites

Python 3.11+
Docker and Docker Compose
A running Open WebUI instance (or use the provided Docker setup)

Installation

cd benchmark
python -m venv .venv
source .venv/bin/activate  # On Windows: .venv\Scripts\activate
pip install -e .

Configuration

Copy the example environment file and configure your admin credentials:

cp .env.example .env

Edit .env with your Open WebUI admin credentials:

OPEN_WEBUI_URL=http://localhost:8080
ADMIN_USER_EMAIL=[email protected]
ADMIN_USER_PASSWORD=your-password

Running Benchmarks

Start Open WebUI with benchmark configuration:

cd docker
./run.sh default  # Use the default compute profile (2 CPU, 8GB RAM)

Run the benchmark:

# Run all benchmarks
owb run all

# Run only channel concurrency benchmark
owb run channels -m 50  # Test up to 50 concurrent users

# Run with a specific target URL
owb run channels -u http://localhost:3000

# Run with a specific compute profile
owb run channels -p cloud_medium

View results:

Results are saved to results/ in JSON, CSV, and text formats.

Compute Profiles

Compute profiles define the resource constraints for the Open WebUI container:

Profile	CPUs	Memory	Use Case
`default`	2	8GB	Local MacBook testing
`minimal`	1	4GB	Testing lower bounds
`cloud_small`	2	4GB	Small cloud VM
`cloud_medium`	4	8GB	Medium cloud VM
`cloud_large`	8	16GB	Large cloud VM

List available profiles:

owb profiles

Available Benchmarks

Channel Concurrency (`channels`)

Tests concurrent user capacity in Open WebUI Channels:

Creates a test channel
Progressively adds users (10, 20, 30, ... up to max)
Each user sends messages at a configured rate
Measures response times and error rates
Identifies the maximum sustainable user count

Configuration options:

channels:
  max_concurrent_users: 100  # Maximum users to test
  user_step_size: 10         # Increment users by this amount
  sustain_time: 30           # Seconds to run at each level
  message_frequency: 0.5     # Messages per second per user

Channel WebSocket (`channels-ws`)

Tests WebSocket scalability for real-time message delivery.

Configuration

Configuration files are located in config/:

benchmark_config.yaml - Main benchmark settings
compute_profiles.yaml - Resource profiles for Docker containers

Environment Variables

All configuration can be set via environment variables (loaded from .env file):

Variable	Description	Default
`OPEN_WEBUI_URL`	Open WebUI URL for benchmarking	`http://localhost:8080`
Variable	Description	Default
----------	-------------	---------
`OPEN_WEBUI_URL`	Open WebUI URL	`http://localhost:8080`
`OLLAMA_BASE_URL`	Ollama API URL	`http://host.docker.internal:11434`
`ENABLE_CHANNELS`	Enable Channels feature	`true`
`ADMIN_USER_EMAIL`	Admin email	-
`ADMIN_USER_PASSWORD`	Admin password	-
`MAX_CONCURRENT_USERS`	Max concurrent users	`50`
`USER_STEP_SIZE`	User increment step	`10`
`SUSTAIN_TIME_SECONDS`	Test duration per level	`30`
`MESSAGE_FREQUENCY`	Messages/sec per user	`0.5`
`OPEN_WEBUI_PORT`	Container port	`8080`
`CPU_LIMIT`	CPU limit	`2.0`
`MEMORY_LIMIT`	Memory limit	`8g`

Create a new file in benchmark/scenarios/:

from benchmark.core.base import BaseBenchmark
from benchmark.core.metrics import BenchmarkResult

class MyNewBenchmark(BaseBenchmark):
    name = "My New Benchmark"
    description = "Tests something new"
    version = "1.0.0"
    
    async def setup(self) -> None:
        # Set up test environment
        pass
    
    async def run(self) -> BenchmarkResult:
        # Execute the benchmark
        # Use self.metrics to record timings
        return self.metrics.get_result(self.name)
    
    async def teardown(self) -> None:
        # Clean up
        pass

Register the benchmark in benchmark/cli.py
Add configuration options if needed in config/benchmark_config.yaml

Custom Metrics Collection

from benchmark.core.metrics import MetricsCollector

metrics = MetricsCollector()
metrics.start()

# Time individual operations
with metrics.time_operation("my_operation"):
    await do_something()

# Or record manually
metrics.record_timing(
    operation="api_call",
    duration_ms=150.5,
    success=True,
)

metrics.stop()
result = metrics.get_result("My Benchmark")

Understanding Results

Key Metrics

Metric	Description	Good Threshold
`avg_response_time_ms`	Average response time	< 2000ms
`p95_response_time_ms`	95th percentile response time	< 3000ms
`error_rate_percent`	Percentage of failed requests	< 1%
`requests_per_second`	Throughput	> 10

Result Files

*.json - Detailed results for each benchmark run
benchmark_results_*.csv - Combined results in CSV format
summary_*.txt - Human-readable summary

Interpreting Channel Benchmark Results

The channel benchmark reports:

max_sustainable_users: Maximum users where performance thresholds are met
results_by_level: Performance at each user count level
tested_levels: All user counts that were tested

Example result analysis:

Users: 10  | P95: 150ms  | Errors: 0%    | ✓ PASS
Users: 20  | P95: 280ms  | Errors: 0.1%  | ✓ PASS
Users: 30  | P95: 520ms  | Errors: 0.3%  | ✓ PASS
Users: 40  | P95: 1200ms | Errors: 0.8%  | ✓ PASS
Users: 50  | P95: 3500ms | Errors: 2.1%  | ✗ FAIL

Maximum sustainable users: 40

Architecture

benchmark/
├── benchmark/
│   ├── core/           # Core framework
│   │   ├── base.py     # Base benchmark class
│   │   ├── config.py   # Configuration management
│   │   ├── metrics.py  # Metrics collection
│   │   └── runner.py   # Benchmark orchestration
│   ├── clients/        # API clients
│   │   ├── http_client.py    # HTTP/REST client
│   │   └── websocket_client.py # WebSocket client
│   ├── scenarios/      # Benchmark implementations
│   │   └── channels.py # Channel benchmarks
│   ├── utils/          # Utilities
│   │   └── docker.py   # Docker management
│   └── cli.py          # Command-line interface
├── config/             # Configuration files
├── docker/             # Docker Compose for benchmarking
└── results/            # Benchmark output (gitignored)

Dependencies

The benchmark suite reuses Open WebUI dependencies where possible:

From Open WebUI:

httpx - HTTP client
aiohttp - Async HTTP
python-socketio - WebSocket client
pydantic - Data validation
pandas - Data analysis

Benchmark-specific:

locust - Load testing (optional, for advanced scenarios)
rich - Terminal output
docker - Docker SDK
matplotlib - Plotting results

Troubleshooting

Common Issues

Connection refused: Ensure Open WebUI is running and accessible
Authentication errors: Check admin credentials in config
Docker resource errors: Ensure Docker has enough resources allocated
WebSocket timeout: Increase websocket_timeout in config

Debug Mode

Set logging level to DEBUG:

export BENCHMARK_LOG_LEVEL=DEBUG
owb run channels

Contributing

When adding new benchmarks:

Follow the BaseBenchmark interface
Add tests for the new benchmark
Update configuration schema if needed
Add documentation to this README

License

MIT License - See LICENSE file

Name		Name	Last commit message	Last commit date
Latest commit History 9 Commits
benchmark		benchmark
config		config
docker		docker
examples		examples
tests		tests
.env.example		.env.example
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
pyproject.toml		pyproject.toml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Open WebUI Benchmark Suite

Overview

Quick Start

Prerequisites

Installation

Configuration

Running Benchmarks

Compute Profiles

Available Benchmarks

Channel Concurrency (`channels`)

Channel WebSocket (`channels-ws`)

Configuration

Environment Variables

Custom Metrics Collection

Understanding Results

Key Metrics

Result Files

Interpreting Channel Benchmark Results

Architecture

Dependencies

Troubleshooting

Common Issues

Debug Mode

Contributing

License

About

Uh oh!

Releases

Packages

Contributors 2

Languages

License

open-webui/benchmark

Folders and files

Latest commit

History

Repository files navigation

Open WebUI Benchmark Suite

Overview

Quick Start

Prerequisites

Installation

Configuration

Running Benchmarks

Compute Profiles

Available Benchmarks

Channel Concurrency (channels)

Channel WebSocket (channels-ws)

Configuration

Environment Variables

Custom Metrics Collection

Understanding Results

Key Metrics

Result Files

Interpreting Channel Benchmark Results

Architecture

Dependencies

Troubleshooting

Common Issues

Debug Mode

Contributing

License

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Channel Concurrency (`channels`)

Channel WebSocket (`channels-ws`)

Packages