V5 additions #245

lorenss-m · 2025-12-17T21:56:40Z

Note

Introduces remote BYOK support, tighter error propagation, stricter scenario validation, and broader HTTP client instrumentation.

Remote BYOK support: Adds --byok CLI flag, byok config, and use_byok in SingleTaskRequest; forwarded via submit_rollouts while stripping api_key/model_client from remote agent_params.
Agent error propagation: MCPAgent.run() now sets ctx.error on failures and step errors; new tests cover exception and step-level propagation.
Scenario robustness: Environment.run_scenario_setup() now raises detailed errors for missing/invalid scenarios and malformed/empty content; MCPAgent.run() gives clearer messages when ctx.prompt is missing.
Gateway auth rules: OpenAIChatAgent forbids custom api_key with HUD Gateway; CLI injects HUD API key for OpenAI-compatible when targeting gateway.
HTTP instrumentation: Extends auto-instrumentation to aiohttp alongside httpx to inject trace/auth headers.
Eval context stability: EvalContext.__aenter__ cleans up connections and contextvars on setup failure.
Minor: log auto-respond errors; lazy-import questionary/typer; display byok in eval summary.

^{Written by Cursor Bugbot for commit 4ddaa33. This will update automatically on new commits. Configure here.}

…e-all-exceptions-during-rollouts Propagate agent errors

- Add context-aware errors in MCPAgent when ctx.prompt is not set - Show available scenarios when requested scenario is not found - Handle malformed and empty scenario responses with clear messages

…m/hud-evals/hud-python into v5

…o v5

shfunc and others added 13 commits December 17, 2025 12:49

propagate agent errors to ctx.error for platform visibility, tests

2154581

Merge pull request #244 from hud-evals/ilya/hud-483-identify-and-rais…

3abea9e

…e-all-exceptions-during-rollouts Propagate agent errors

feat: add BYOK support

6ac7967

Improve scenario prompt error messages

013eeae

- Add context-aware errors in MCPAgent when ctx.prompt is not set - Show available scenarios when requested scenario is not found - Handle malformed and empty scenario responses with clear messages

fix: improve API key handling

c6963de

fix: use httpx client for gemini

2d9f1f3

fix: cleanup httpx client

2f7078d

lint: ruff

3b158bf

fix: hook into aiohttp instead

63576fe

additions tests

623dba5

Merge branch 'v5-' of https://github.com/hud-evals/hud-python into v5

b9dfa8f

Merge branch 'fix/scenario-prompt-error-message' of https://github.co…

8500d38

…m/hud-evals/hud-python into v5

Merge branch 'j/byok' of https://github.com/hud-evals/hud-python into v5

7ef6282

mintlify bot deployed to staging - docs December 28, 2025 10:25 View deployment

Merge branch 'hud-521' of https://github.com/hud-evals/hud-python int…

4ddaa33

…o v5

lorenss-m marked this pull request as ready for review December 28, 2025 17:24

lorenss-m had a problem deploying to pre-release December 28, 2025 17:24 — with GitHub Actions Failure

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

V5 additions #245

V5 additions #245

Uh oh!

lorenss-m commented Dec 17, 2025 •

edited by cursor bot

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

6 participants

V5 additions #245

Are you sure you want to change the base?

V5 additions #245

Uh oh!

Conversation

lorenss-m commented Dec 17, 2025 • edited by cursor bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

6 participants

lorenss-m commented Dec 17, 2025 •

edited by cursor bot

Loading