Fix Openrouter functionality by nitink23 · Pull Request #6 · ZephyrCloudIO/ze-benchmarks

nitink23 · 2025-10-27T23:08:49Z

What's the issues or discussion related to this PR ?

Prior to this PR, when users selected an OpenRouter model in interactive mode and executed batch benchmarks, the OpenRouter adapter would always fall back to the default model (minimax/minimax-m2:free) instead of using the selected model. This occurred because the batch execution logic in executeMultipleBenchmarks() was only passing model parameters to anthropic and claude-code agents, while explicitly passing undefined to all other agents including openrouter.
Users reported seeing warnings like:

What's added in this PR?

• Fixed OpenRouter Model Selection in Batch Execution: Updated executeMultipleBenchmarks() function to include openrouter in the list of agents that receive model parameters, preventing it from always falling back to the default model

• Enhanced Model Source Tracking: Added modelSource property and getModelSource() method to OpenRouterAdapter to track whether model came from parameter, environment variable, or default fallback

• Improved Cost Calculation: Implemented proper USD cost calculation based on actual OpenRouter API pricing data instead of hardcoded zero values, including pricing cache and intelligent fallback pricing for unknown models

• Silent Execution Mode: Removed verbose console.log statements from OpenRouterAdapter to match AnthropicAdapter's clean execution behavior, showing only essential error logging and final CLI summary

• Web Dashboard Model Name Prioritization: Updated all dashboard pages (Agents, Overview, Runs, Batch Details) to display specific model names (e.g., mistralai/devstral-small-2505:free) as primary identifiers instead of generic agent types like "openrouter"

• Enhanced User Feedback: Added comprehensive model source detection with clear visual indicators and actionable guidance when default models are used

What are the steps to test this PR?

Here are the instructions for environment variables setup:

Create a .env file in the root directory of the project (*\ze-benchmarks\.env):

# OpenRouter API Key
OPENROUTER_API_KEY=your_openrouter_api_key_here

# Anthropic API Key  
ANTHROPIC_API_KEY=your_anthropic_api_key_here

# Optional: Set default OpenRouter model (overrides default)
OPENROUTER_MODEL=openai/gpt-4o-mini

pnpm bench (select the openrouter models)

Documentation update for this PR (if applicable)?

not needed

(Optional) What's left to be done for this PR?

(Optional) What's the potential risk and how to mitigate it?

Who do you wish to review this PR other than required reviewers?

@zackarychapple @Nsttt

Examples:

Does this PR introduce breaking changes?
Does this include backend implementation that affects plugins, side panels and frontend?
Is this a UI changes that should be added to documentation?
Is this a UI changes on components that will make application look/behave inconsistent?

(Required) Pre-PR/Merge checklist

I have added/updated our documentation to cover this new behavior
I have added an explanation of my changes
I have written new tests (if applicable)
I have tested this locally (standing from a first time user point of view, never touch this app before)
I have mentioned the related person or team responsible for reviewing proposed changes
I have/will run tests, or ask for help to add test

Tool calls and oracle

Nsttt

LGTM for now

RussellCanfield and others added 30 commits October 2, 2025 16:31

Add benchmarks

17cb812

feat:Anthropic Agent adapter

29e0f8d

fix:AnthropicAdapter

6321d36

feat: Oracle tools

6c19161

Chore:cleanup runtime tools

e4efe02

chore: clean up agentadpater

749e893

chore:clean TS files

35a5908

chore:cleaned up imports

67f5dfd

fix:tsc-->tsx

0c30fe8

fix:cli typescript fix

26e00a2

chore: add tsx dev dependency

3daf20b

fix:CLI parser

7cdcfcf

Merge pull request #3 from ZephyrCloudIO/feature/tool-calls-and-oracle

01f64c9

Tool calls and oracle

Feat:database

9bbfffc

feat: SQLite setup

e9d9585

fix: dependencies

4c6ee88

feat: UI changes

dc8cd5d

feat: multi-select enabled

fe5c318

feat: ze-bench ui fixes

9be71b3

adding rsbuild project

8a2858b

reports pulling from db file

f544223

reports working

7acc7dc

initial reports done

4dc2161

feat: stable version

9a93ef9

fix: dev-server start issue solved

932b1fc

fix: database persistence

0aec9d1

fix: database persistence

da63c99

fix: added and changed docs

060b8d0

fix: openrouter-working

23e8312

feat:openrouter-anymodel

cf8cf0d

fix/report website update

aa8ee30

nitink23 marked this pull request as ready for review October 27, 2025 23:20

Merge branch 'main' into fix/openrouter

599ae15

zackarychapple approved these changes Oct 28, 2025

View reviewed changes

Nsttt approved these changes Oct 28, 2025

View reviewed changes

zackarychapple approved these changes Oct 28, 2025

View reviewed changes

zackarychapple merged commit 5f29d91 into main Oct 28, 2025
3 of 8 checks passed

zackarychapple deleted the fix/openrouter branch October 28, 2025 13:29

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix Openrouter functionality#6

Fix Openrouter functionality#6
zackarychapple merged 32 commits intomainfrom
fix/openrouter

nitink23 commented Oct 27, 2025 •

edited

Loading

Uh oh!

Nsttt left a comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Conversation

nitink23 commented Oct 27, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

What's the issues or discussion related to this PR ?

What's added in this PR?

What are the steps to test this PR?

Documentation update for this PR (if applicable)?

(Optional) What's left to be done for this PR?

(Optional) What's the potential risk and how to mitigate it?

Who do you wish to review this PR other than required reviewers?

(Required) Pre-PR/Merge checklist

Uh oh!

Nsttt left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

nitink23 commented Oct 27, 2025 •

edited

Loading