Show cache read and write prices for OpenRouter inference providers #7176

chrarnoldus · 2025-08-18T09:11:59Z

Related GitHub Issue

Upstreaming some changes from Kilo Code: Kilo-Org/kilocode#1893, Kilo-Org/kilocode#1940

Description

Fixes OpenRouters models that don't list a cache write price, but do support caching showing as "Does not support prompt caching" (e.g. GPT-5).

Fixes all models showing as "Supports prompt caching" when a specific inference provider is selected, even when they don't support prompt caching.

For specific inference providers, show the cache read and write prices from the OpenRouter metadata.

Use the unique tag for inference providers, rather than the non-unique provider_name.

Test Procedure

Go into the API Providers settings, select OpenRouter and verify all displayed info is correct:

"Supports prompt caching" label
Cache read/write prices
Names of inference providers are unique

Pre-Submission Checklist

Issue Linked: This PR is linked to an approved GitHub Issue (see "Related GitHub Issue" above).
Scope: My changes are focused on the linked issue (one major feature/fix per PR).
Self-Review: I have performed a thorough self-review of my code.
Testing: New and/or updated tests have been added to cover my changes (if applicable).
Documentation Impact: I have considered if my changes require documentation updates (see "Documentation Updates" section below).
Contribution Guidelines: I have read and agree to the Contributor Guidelines.

Screenshots / Videos

Before

GPT-5 shown as not supporting prompt caching even though it does:

Qwen3 Coder with DeepInfra inference provider showing it does support prompt caching even though it doesn't:

Inference provider list for Sonnet 4 being incomplete, because the names are not unique:

After

GPT-5 shown as supporting prompt caching:

Qwen3 Coder with DeepInfra inference provider shown as not supporting prompt caching:

GLM 4.5 metadata without specific inference provider selected:

GLM 4.5 metadata with Z.AI selected as inference provider, which supports prompt caching:

Entire list of Sonnet 4 providers shown with unique tags:

Additional Notes

Some of the fields in the OpenRouter metadata are not documented. I have pinged our OpenRouter reps to update the docs.

Get in Touch

Christiaan in shared Slack

Important

Fixes cache support and pricing display for OpenRouter models, using unique tags for provider identification.

Behavior:
- Fixes incorrect cache support display for models without cache write price in openrouter.ts.
- Corrects cache support display for specific inference providers in useOpenRouterModelProviders.ts.
- Displays cache read/write prices from OpenRouter metadata in useOpenRouterModelProviders.ts.
Schema and Parsing:
- Adds tag field to openRouterModelEndpointSchema in openrouter.ts and useOpenRouterModelProviders.ts.
- Updates parseOpenRouterModel to use tag for provider identification in openrouter.ts.
Tests:
- Updates tests in openrouter.spec.ts to reflect changes in cache support and provider identification.

^{This description was created by}^{for 4733f15. You can customize this summary. It will automatically update as commits are pushed.}

roomote

Thank you for your contribution! I've reviewed the changes and they look good overall. The implementation correctly addresses the cache support detection issues and improves provider identification. I have a few minor suggestions for improvement.

roomote · 2025-08-18T09:17:08Z

src/api/providers/fetchers/openrouter.ts

@@ -188,7 +189,7 @@ export const parseOpenRouterModel = ({

 	const cacheReadsPrice = model.pricing?.input_cache_read ? parseApiPrice(model.pricing?.input_cache_read) : undefined

-	const supportsPromptCache = typeof cacheWritesPrice !== "undefined" && typeof cacheReadsPrice !== "undefined"
+	const supportsPromptCache = typeof cacheReadsPrice !== "undefined" // some models support caching but don't charge a cacheWritesPrice, e.g. GPT-5


Could we make this comment more descriptive? Consider:

Suggested change

const supportsPromptCache = typeof cacheReadsPrice !== "undefined" // some models support caching but don't charge a cacheWritesPrice, e.g. GPT-5

const supportsPromptCache = typeof cacheReadsPrice !== "undefined" // OpenRouter reports cache support based on read price only, as some models support caching without charging for cache writes (e.g. GPT-5)

roomote · 2025-08-18T09:17:09Z

src/api/providers/fetchers/__tests__/openrouter.spec.ts

@@ -24,6 +24,7 @@ describe("OpenRouter API", () => {
 			const models = await getOpenRouterModels()

 			const openRouterSupportedCaching = Object.entries(models)
+				.filter(([id, _]) => id.startsWith("anthropic/claude") || id.startsWith("google/gemini")) // only these support cache_control breakpoints (https://openrouter.ai/docs/features/prompt-caching)


Would it be helpful to expand this comment to explain the cache_control limitation more clearly? Something like:

Suggested change

.filter(([id, _]) => id.startsWith("anthropic/claude") || id.startsWith("google/gemini")) // only these support cache_control breakpoints (https://openrouter.ai/docs/features/prompt-caching)

.filter(([id, _]) => id.startsWith("anthropic/claude") || id.startsWith("google/gemini")) // Only Anthropic Claude and Google Gemini models support cache_control breakpoints for explicit cache management (https://openrouter.ai/docs/features/prompt-caching)

roomote · 2025-08-18T09:17:09Z

webview-ui/src/components/ui/hooks/useOpenRouterModelProviders.ts


 		for (const endpoint of endpoints) {
-			const providerName = endpoint.name.split("|")[0].trim()
+			const providerName = endpoint.tag ?? endpoint.name


For consistency with the backend implementation, consider adding a comment explaining the fallback logic:

Suggested change

const providerName = endpoint.tag ?? endpoint.name

const providerName = endpoint.tag ?? endpoint.name // Use unique tag when available, fallback to name for backward compatibility

roomote · 2025-08-18T09:17:09Z

src/api/providers/fetchers/openrouter.ts

@@ -149,7 +150,7 @@ export async function getOpenRouterModelEndpoints(
 		const { id, architecture, endpoints } = data

 		for (const endpoint of endpoints) {
-			models[endpoint.provider_name] = parseOpenRouterModel({
+			models[endpoint.tag ?? endpoint.provider_name] = parseOpenRouterModel({


Is this fallback to endpoint.provider_name intentional for backward compatibility? If the OpenRouter API doesn't always provide tags, this is a good defensive approach.

Show cache read and write prices for OpenRouter inference providers

4733f15

chrarnoldus requested review from mrubens, cte and jr as code owners August 18, 2025 09:12

github-project-automation bot added this to Roo Code Roadmap Aug 18, 2025

github-project-automation bot moved this to Triage in Roo Code Roadmap Aug 18, 2025

github-project-automation bot added this to Roo Code Roadmap Aug 18, 2025

github-project-automation bot moved this to New in Roo Code Roadmap Aug 18, 2025

dosubot bot added size:M This PR changes 30-99 lines, ignoring generated files. bug Something isn't working labels Aug 18, 2025

roomote bot reviewed Aug 18, 2025

View reviewed changes

hannesrudolph added the Issue/PR - Triage New issue. Needs quick review to confirm validity and assign labels. label Aug 18, 2025

daniel-lxs moved this from Triage to PR [Needs Prelim Review] in Roo Code Roadmap Aug 19, 2025

hannesrudolph added PR - Needs Preliminary Review and removed Issue/PR - Triage New issue. Needs quick review to confirm validity and assign labels. labels Aug 19, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Show cache read and write prices for OpenRouter inference providers #7176

Show cache read and write prices for OpenRouter inference providers #7176

chrarnoldus commented Aug 18, 2025 •

edited by ellipsis-dev bot

Loading

Uh oh!

roomote bot left a comment

Uh oh!

roomote bot Aug 18, 2025

Uh oh!

roomote bot Aug 18, 2025

Uh oh!

roomote bot Aug 18, 2025

Uh oh!

roomote bot Aug 18, 2025

Uh oh!

Uh oh!

	const supportsPromptCache = typeof cacheReadsPrice !== "undefined" // some models support caching but don't charge a cacheWritesPrice, e.g. GPT-5
	const supportsPromptCache = typeof cacheReadsPrice !== "undefined" // OpenRouter reports cache support based on read price only, as some models support caching without charging for cache writes (e.g. GPT-5)

	.filter(([id, _]) => id.startsWith("anthropic/claude") \|\| id.startsWith("google/gemini")) // only these support cache_control breakpoints (https://openrouter.ai/docs/features/prompt-caching)
	.filter(([id, _]) => id.startsWith("anthropic/claude") \|\| id.startsWith("google/gemini")) // Only Anthropic Claude and Google Gemini models support cache_control breakpoints for explicit cache management (https://openrouter.ai/docs/features/prompt-caching)

	const providerName = endpoint.tag ?? endpoint.name
	const providerName = endpoint.tag ?? endpoint.name // Use unique tag when available, fallback to name for backward compatibility

Show cache read and write prices for OpenRouter inference providers #7176

Are you sure you want to change the base?

Show cache read and write prices for OpenRouter inference providers #7176

Conversation

chrarnoldus commented Aug 18, 2025 • edited by ellipsis-dev bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Related GitHub Issue

Description

Test Procedure

Pre-Submission Checklist

Screenshots / Videos

Before

After

Additional Notes

Get in Touch

Uh oh!

roomote bot left a comment

Choose a reason for hiding this comment

Uh oh!

roomote bot Aug 18, 2025

Choose a reason for hiding this comment

Uh oh!

roomote bot Aug 18, 2025

Choose a reason for hiding this comment

Uh oh!

roomote bot Aug 18, 2025

Choose a reason for hiding this comment

Uh oh!

roomote bot Aug 18, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

chrarnoldus commented Aug 18, 2025 •

edited by ellipsis-dev bot

Loading