You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
DEEPSEEK_PROVIDER=deepseek # Adds prefix to model names (deepseek/deepseek-chat) for LiteLLM
1971
+
DEEPSEEK_PROVIDER=deepseek # Adds prefix to model names (deepseek/deepseek-v4-flash) for LiteLLM
1972
1972
```
1973
1973
1974
1974
#### Supported Models
1975
1975
1976
-
PentAGI supports 2 DeepSeek-V3.2 models with tool calling, streaming, thinking modes, and context caching. Both models are used in default configuration.
1976
+
PentAGI supports 2 DeepSeek V4 models with tool calling, streaming, thinking modes, and context caching. Models marked with `*` are used in default configuration.
1977
1977
1978
-
| Model ID | Thinking | Context | Max Output | Price (Input/Output/Cache) | Use Case |
**Prices**: Per 1M tokens. Cache pricing is for prompt caching (10% of input cost). Models with thinking support include reinforcement learning chain-of-thought reasoning.
1983
+
**Prices**: Per 1M tokens. Cache pricing applies to prompt tokens served from cache and is heavily discounted versus input price. Models with thinking support include reinforcement learning chain-of-thought reasoning.
1984
+
1985
+
> The legacy model names `deepseek-chat` and `deepseek-reasoner` are scheduled
1986
+
> for deprecation by DeepSeek on 2026-07-24. Existing user configurations
1987
+
> referencing the legacy names continue to work until then; the defaults above
1988
+
> use the current V4 names.
1984
1989
1985
1990
**Key Features**:
1986
-
- **Automatic Prompt Caching**: 40-60% cost reduction on repeated context (10% of input price)
Copy file name to clipboardExpand all lines: backend/docs/config.md
+1-1Lines changed: 1 addition & 1 deletion
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -610,7 +610,7 @@ These settings control the integration with various Large Language Model (LLM) p
610
610
| DeepSeekServerURL |`DEEPSEEK_SERVER_URL`|`https://api.deepseek.com`| DeepSeek API endpoint URL |
611
611
| DeepSeekProvider |`DEEPSEEK_PROVIDER`|*(none)*| Provider name prefix for LiteLLM integration (optional) |
612
612
613
-
**LiteLLM Integration**: Set `DEEPSEEK_PROVIDER=deepseek` to enable model prefixing (e.g., `deepseek/deepseek-chat`) when using LiteLLM proxy with default PentAGI configs.
613
+
**LiteLLM Integration**: Set `DEEPSEEK_PROVIDER=deepseek` to enable model prefixing (e.g., `deepseek/deepseek-v4-flash`) when using LiteLLM proxy with default PentAGI configs.
description: DeepSeek-V3.2 (Non-thinking Mode) - Suitable for general dialogue, code generation, and tool calling tasks. Supports JSON Output, Tool Calls, Chat Prefix Completion, and FIM Completion. 128K context, max output 8K
1
+
- name: deepseek-v4-flash
2
+
description: DeepSeek V4 Flash - Cost-efficient general-purpose model suitable for dialogue, code generation, and tool calling. Supports JSON output and tool calls. 1M context, up to 384K output tokens.
3
3
thinking: false
4
4
price:
5
-
input: 0.28
6
-
output: 0.42
7
-
cache_read: 0.028
5
+
input: 0.14
6
+
output: 0.28
7
+
cache_read: 0.0028
8
8
9
-
- name: deepseek-reasoner
10
-
description: DeepSeek-V3.2 (Thinking Mode) - Advanced reasoning model with reinforcement learning chain-of-thought capabilities, suitable for complex logic, mathematical reasoning, and security analysis tasks. 128K context, max output 64K
9
+
- name: deepseek-v4-pro
10
+
description: DeepSeek V4 Pro - Higher-tier reasoning model suitable for complex logic, mathematical reasoning, and security analysis. 1M context, up to 384K output tokens.
0 commit comments