feat(usage): integrate LiteLLM pricing catalog and tiered token pricing for proxy usage by allenxu09 · Pull Request #4470 · farion1231/cc-switch

allenxu09 · 2026-06-21T07:47:47Z

Summary

Proxy usage pricing now falls back to LiteLLM's model pricing catalog when no custom pricing is configured for a provider. The catalog is embedded at build time and refreshed at runtime, so unknown models no longer result in missing cost data. Additionally, cost calculation now supports above-200k token tiered pricing, aligning with upstream API pricing models.

Changes

New: src-tauri/src/proxy/usage/pricing.rs — LiteLLM-backed pricing catalog with embedded JSON (litellm-pricing.json) and runtime refresh from GitHub
New: src-tauri/src/proxy/usage/litellm-pricing.json — Compressed LiteLLM model pricing embedded at build time
Modified: src-tauri/src/proxy/usage/calculator.rs — Added tiered pricing (above 200k tokens) for input, output, cache read, and cache creation costs
Modified: src-tauri/src/proxy/usage/logger.rs — Falls back to LiteLLM pricing lookup when no custom pricing is found per provider
Modified: src-tauri/src/proxy/usage/mod.rs — Registered pricing module
Modified: src-tauri/src/proxy/providers/transform_codex_chat.rs — Rust 1.95 formatting alignment
Modified: src/components/usage/PricingConfigPanel.tsx — Updated pricing display to reflect tiered cost structure
Modified: src/i18n/locales/*.json (en, zh, zh-TW, ja) — Updated pricing-related UI strings

How It Works

Embedded fallback: A compressed LiteLLM pricing catalog is embedded in the binary via include_str!
Runtime refresh: On first pricing lookup, a background task fetches the latest catalog from GitHub (at most once per process lifetime)
Model matching: Normalizes model names (strips /prefix, :suffix, replaces ./@/_ → -, lowercases) and performs prefix-based fuzzy matching
Tiered pricing: Costs above 200k tokens use *_above_200k_tokens rates when available

allenxu09 · 2026-06-21T07:48:44Z

@codex review

chatgpt-codex-connector

💡 Codex Review

Here are some automated review suggestions for this pull request.

Reviewed commit: 7b09adb695

ℹ️ About Codex in GitHub

Your team has set up Codex to review pull requests in this repo. Reviews are triggered when you

Open a pull request for review
Mark a draft as ready
Comment "@codex review".

If Codex has suggestions, it will comment; otherwise it will react with 👍.

Codex can also answer questions or update the PR. Try commenting "@codex address that feedback".

chatgpt-codex-connector · 2026-06-21T07:54:41Z

+        if let Some(above) = above_200k.filter(|_| tokens > THRESHOLD) {
+            Decimal::from(THRESHOLD) * base + Decimal::from(tokens - THRESHOLD) * above


Apply long-context rates to the whole request

For models such as Gemini 2.5 Pro, the LiteLLM *_above_200k_tokens fields represent the rate to use when the prompt/context exceeds 200k tokens, not a progressive tier within each token bucket. With a 250k-token Gemini prompt and a small output, this charges only the last 50k input tokens at the high input rate and leaves output at the base rate because output_tokens <= 200k, so large-context requests are materially underreported; the calculation needs to select the above-200k input/cache/output rates based on the request context size instead of tiering each component independently.

Useful? React with 👍 / 👎.

chatgpt-codex-connector · 2026-06-21T07:54:41Z

+    let model = model.rsplit('/').next().unwrap_or(model);
+    let model = model.split(':').next().unwrap_or(model);


Preserve provider and model qualifiers in pricing keys

These truncations collapse distinct LiteLLM catalog entries before both insertion and lookup. For example, the embedded catalog contains anthropic.claude-haiku-4-5-20251001-v1:0 and bedrock/us-gov-west-1/anthropic.claude-haiku-4-5-20251001-v1:0 at different prices, but both normalize to anthropic-claude-haiku-4-5-20251001-v1, so one overwrites the other and standard traffic can be priced with gov-region rates; OpenAI fine-tune IDs like ft:gpt-4o-mini-... also normalize to just ft. Keep enough of the provider/suffix structure to avoid cross-provider and fine-tune collisions.

Useful? React with 👍 / 👎.

allenxu09 added 3 commits June 20, 2026 23:20

feat: use LiteLLM pricing for proxy usage

97e8e7f

chore: match Rust 1.95 formatting

17885a6

fix: align usage pricing with token types

7b09adb

chatgpt-codex-connector Bot reviewed Jun 21, 2026

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

feat(usage): integrate LiteLLM pricing catalog and tiered token pricing for proxy usage#4470

feat(usage): integrate LiteLLM pricing catalog and tiered token pricing for proxy usage#4470
allenxu09 wants to merge 3 commits into
farion1231:mainfrom
allenxu09:main

allenxu09 commented Jun 21, 2026

Uh oh!

allenxu09 commented Jun 21, 2026

Uh oh!

chatgpt-codex-connector Bot left a comment

Uh oh!

chatgpt-codex-connector Bot Jun 21, 2026

Uh oh!

chatgpt-codex-connector Bot Jun 21, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

		if let Some(above) = above_200k.filter(\|_\| tokens > THRESHOLD) {
		Decimal::from(THRESHOLD) * base + Decimal::from(tokens - THRESHOLD) * above

		let model = model.rsplit('/').next().unwrap_or(model);
		let model = model.split(':').next().unwrap_or(model);

Uh oh!

Conversation

allenxu09 commented Jun 21, 2026

Summary

Changes

How It Works

Uh oh!

allenxu09 commented Jun 21, 2026

Uh oh!

chatgpt-codex-connector Bot left a comment

Choose a reason for hiding this comment

💡 Codex Review

Uh oh!

chatgpt-codex-connector Bot Jun 21, 2026

Choose a reason for hiding this comment

Uh oh!

chatgpt-codex-connector Bot Jun 21, 2026

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant