chore: Move Benchmark code to top-level #1559

tgasser-nv · 2025-12-31T19:43:38Z

Description

This PR moves the benchmarking code outside of the poetry-managed nemoguardrails directory. It creates a separate virtual environment for Mock LLM benchmarking, and moves tests under there. The README is updated to reflect the new locations of configs. cc @Pouyanpi , @cparisien

Related Issue(s)

#1501 : Comment here

Checklist

I've read the CONTRIBUTING guidelines.
I've updated the documentation if applicable.
I've added tests if applicable.
@mentions of the person or team responsible for reviewing proposed changes.

github-actions · 2025-12-31T19:45:02Z

Documentation preview

https://nvidia-nemo.github.io/Guardrails/review/pr-1559

greptile-apps · 2025-12-31T19:49:45Z

Greptile Summary

This PR moves the benchmarking code from nemoguardrails/benchmark/ to a top-level benchmark/ directory, creating a separate virtual environment for Mock LLM benchmarking outside the poetry-managed package structure
Updates all import paths and configuration file references throughout the codebase to reflect the new directory structure and module hierarchy
Adds new benchmark/requirements.txt for standalone dependency management and updates pytest.ini to include the new test path for benchmark tests

Important Files Changed

Filename	Overview
`benchmark/README.md`	Documentation updated for new setup process but contains outdated path reference on line 69
`benchmark/requirements.txt`	New standalone dependency file with minimum version constraints that could cause compatibility issues
`pytest.ini`	Updated test discovery paths to include relocated benchmark tests
`benchmark/mock_llm_server/response_data.py`	Import path updated and bug fix added to return scalar value instead of numpy array

Confidence score: 4/5

This PR is safe to merge with minimal risk as it's primarily a code reorganization effort with clear separation of concerns
Score reflects comprehensive restructuring with mostly mechanical changes, but deducted one point due to a documentation error and potential version compatibility concerns with flexible dependency constraints
Pay close attention to benchmark/README.md line 69 which contains an outdated directory path that needs correction

Sequence Diagram

sequenceDiagram
    participant User
    participant CliApp as "CLI Application"
    participant AIPerfRunner
    participant ServiceChecker as "Service Checker"
    participant MockLLMServer as "Mock LLM Server"
    participant GuardrailsServer as "Guardrails Server"
    participant AIPerfTool as "AIPerf Tool"

    User->>CliApp: "Run benchmark with config file"
    CliApp->>AIPerfRunner: "Initialize with config"
    AIPerfRunner->>ServiceChecker: "Check service availability"
    ServiceChecker->>MockLLMServer: "GET /health"
    MockLLMServer-->>ServiceChecker: "200 OK"
    ServiceChecker->>GuardrailsServer: "GET /v1/rails/configs"
    GuardrailsServer-->>ServiceChecker: "200 OK"
    ServiceChecker-->>AIPerfRunner: "Services healthy"
    AIPerfRunner->>AIPerfRunner: "Generate sweep combinations"
    loop For each sweep combination
        AIPerfRunner->>AIPerfRunner: "Build command with parameters"
        AIPerfRunner->>AIPerfTool: "Execute benchmark"
        AIPerfTool->>GuardrailsServer: "POST /v1/chat/completions"
        GuardrailsServer->>MockLLMServer: "Content safety check"
        MockLLMServer-->>GuardrailsServer: "Safety response"
        GuardrailsServer->>MockLLMServer: "Main LLM request"
        MockLLMServer-->>GuardrailsServer: "LLM response"
        GuardrailsServer-->>AIPerfTool: "Final response"
        AIPerfTool-->>AIPerfRunner: "Benchmark results"
        AIPerfRunner->>AIPerfRunner: "Save run metadata"
    end
    AIPerfRunner-->>CliApp: "Summary with total/completed/failed"
    CliApp-->>User: "Benchmark completion status"

greptile-apps

_{27 files reviewed, 1 comment}

_{Edit Code Review Agent Settings | Greptile}

benchmark/README.md

codecov · 2025-12-31T19:56:06Z

Codecov Report

✅ All modified and coverable lines are covered by tests.

📢 Thoughts on this report? Let us know!

…er examples, and benchmark-specific tests

…ils itself

…ready has this in the poetry env. It's not used in the mocks

tgasser-nv requested review from Pouyanpi and cparisien December 31, 2025 19:44

tgasser-nv self-assigned this Dec 31, 2025

greptile-apps bot reviewed Dec 31, 2025

View reviewed changes

benchmark/README.md Outdated Show resolved Hide resolved

cparisien approved these changes Jan 4, 2026

View reviewed changes

tgasser-nv added 12 commits January 5, 2026 10:46

Move mock LLMs into top-level benchmark dir, local content_safety und…

ffd72f5

…er examples, and benchmark-specific tests

Initial checkin of validation script

d5502ac

Remove un-needed files under nemoguardrails/benchmark

9abe39a

Move unit-tests under benchamrk top-level dir

811ac9f

Update unit-tests with new code location

88c04d9

Add requirements to keep benchmark dependencies separate from Guardra…

0171c25

…ils itself

Update server run script and Procfile with new file locations

593e9e4

Return np.array with size (1,) from mock function calls in tests

a34f036

Remove langchain_nvidia_ai_endpoints from requirements, Guardrails al…

0142faf

…ready has this in the poetry env. It's not used in the mocks

Update README to match new file locations and include venv instructions

205db90

Cleanups to the README

53810b0

README.md cleanup

f74addb

tgasser-nv force-pushed the chore/move-benchmark-to-top branch from c0132cd to f74addb Compare January 5, 2026 16:47

tgasser-nv merged commit 109a36d into develop Jan 5, 2026
10 checks passed

tgasser-nv deleted the chore/move-benchmark-to-top branch January 5, 2026 17:05

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

chore: Move Benchmark code to top-level #1559

chore: Move Benchmark code to top-level #1559

Uh oh!

tgasser-nv commented Dec 31, 2025 •

edited

Loading

Uh oh!

github-actions bot commented Dec 31, 2025

Uh oh!

greptile-apps bot commented Dec 31, 2025

Confidence score: 4/5

Sequence Diagram

Uh oh!

greptile-apps bot left a comment

Uh oh!

Uh oh!

codecov bot commented Dec 31, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

chore: Move Benchmark code to top-level #1559

chore: Move Benchmark code to top-level #1559

Uh oh!

Conversation

tgasser-nv commented Dec 31, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Description

Related Issue(s)

Checklist

Uh oh!

github-actions bot commented Dec 31, 2025

Documentation preview

Uh oh!

greptile-apps bot commented Dec 31, 2025

Greptile Summary

Important Files Changed

Confidence score: 4/5

Sequence Diagram

Uh oh!

greptile-apps bot left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

codecov bot commented Dec 31, 2025

Codecov Report

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

tgasser-nv commented Dec 31, 2025 •

edited

Loading