Estimate DeepSeek API costs before you build. This calculator helps you forecast per-request, daily, monthly, and yearly spend for both deepseek-chat and deepseek-reasoner using current DeepSeek-V3.2 API pricing, including cache-hit discounts.
Both API models currently use the same published token rates, but your final bill still depends on workload shape—especially input size, output length, and how much repeated context is reused from cache. Use it to compare chat-style requests with reasoning-heavy workflows before you ship.
DeepSeek API Cost Calculator
DeepSeek V3.2 Pricing
All prices per 1 million tokens (USD). Last verified: April 26, 2026 — Source: DeepSeek Models & Pricing | DeepSeek Platform
| Model | Input (Cache Hit)(2) | Input (Cache Miss) | Output | Context | Max Output |
|---|---|---|---|---|---|
|
DeepSeek-V4-Flash(1) Fast tier — supports both non-thinking and thinking modes |
$0.0028 | $0.140 | $0.28 | 1M | 384K |
|
DeepSeek-V4-Pro Advanced tier — 75% off until 2026-05-05; regular: $1.74 / $3.48 |
$0.003625(3) | $0.435(3) | $0.87(3) | 1M | 384K |
- 1 The model names
deepseek-chatanddeepseek-reasonerwill be deprecated in the future. For compatibility, they correspond to the non-thinking mode and thinking mode ofdeepseek-v4-flash, respectively. - 2 For all models, the input cache hit price has been reduced to 1/10 of the launch price.
- 3 The
deepseek-v4-promodel is currently offered at a limited-time 75% discount, valid until 05/05/2026 15:59 UTC.
DeepSeek-V4-Flash
- JSON Output
- Tool Calls
- Chat Prefix (Beta)
- FIM Completion (Beta)
- Thinking Mode
DeepSeek-V4-Pro
- JSON Output
- Tool Calls
- Chat Prefix (Beta)
- FIM Completion (Beta)
- Thinking Mode
Count Your Tokens
Need setup help? Read our DeepSeek API guide. Need the bigger picture? Start with our main DeepSeek AI guide.
