Estimate DeepSeek API costs before you build. This calculator helps you forecast per-request, daily, monthly, and yearly spend for both deepseek-v4-flash and deepseek-v4-pro using current DeepSeek V4 API pricing, including cache-hit discounts. The legacy deepseek-chat and deepseek-reasoner aliases now map to V4-Flash non-thinking and thinking modes.
V4-Flash and V4-Pro share a 1M context window and 384K max output, but pricing differs sharply: V4-Flash is the low-cost workhorse, while V4-Pro targets the hardest reasoning, math, and code workloads (currently 75% off through 2026/05/31 15:59 UTC). Use the calculator to compare workloads before you ship.
DeepSeek API Cost Calculator
DeepSeek V4 Pricing
All prices per 1 million tokens (USD). Last verified: May 1, 2026 — Source: DeepSeek Models & Pricing | DeepSeek Platform
| Model | Input (Cache Hit)(2) | Input (Cache Miss) | Output | Context | Max Output |
|---|---|---|---|---|---|
|
DeepSeek-V4-Flash(1) Fast tier — supports both non-thinking and thinking modes |
$0.0028 | $0.140 | $0.28 | 1M | 384K |
|
DeepSeek-V4-Pro Advanced tier — 75% off until 2026/05/31 15:59 UTC; regular: $1.74 / $3.48 |
$0.003625(3) | $0.435(3) | $0.87(3) | 1M | 384K |
- 1 The model names
deepseek-chatanddeepseek-reasonerwill be deprecated in the future. For compatibility, they correspond to the non-thinking mode and thinking mode ofdeepseek-v4-flash, respectively. - 2 For all models, the input cache hit price has been reduced to 1/10 of the launch price.
- 3 The
deepseek-v4-promodel is currently offered at a limited-time 75% discount, valid until 2026/05/31 15:59 UTC.
DeepSeek-V4-Flash
- JSON Output
- Tool Calls
- Chat Prefix (Beta)
- FIM Completion (Beta)
- Thinking Mode
DeepSeek-V4-Pro
- JSON Output
- Tool Calls
- Chat Prefix (Beta)
- FIM Completion (Beta)
- Thinking Mode
Count Your Tokens
Need setup help? Read our DeepSeek API guide. Need the bigger picture? Start with our main DeepSeek AI guide.
