Back to Models
DeepSeek V4 Flash
deepseekdeepseek/deepseek-v4-flash
DeepSeek V4 Flash β fast, cost-efficient model with 1M context window. Supports reasoning, tool calling, and structured output.
Context Length
1M
Max Output
384K
Input Priceper 1M tokens
$0.240/ 1M tokens
Output Priceper 1M tokens
$0.480/ 1M tokens
Modalities
textβtext
Pricing Breakdown
| Type | Rate |
|---|---|
| Input | $0.240 / 1M tokens |
| Output | $0.480 / 1M tokens |
Supported Parameters
temperaturetop_pfrequency_penaltypresence_penaltymax_tokensmax_completion_tokensstoptoolstool_choiceresponse_formatstreamreasoning
API Usage Examples
Use the global api.therouter.ai endpoint shown below for new integrations; the legacy China accelerated endpoint is retired.
cURL
curl https://api.therouter.ai/v1/chat/completions -H "Content-Type: application/json" -H "Authorization: Bearer $THE_ROUTER_API_KEY" -d '{
"model": "deepseek/deepseek-v4-flash",
"messages": [
{"role": "user", "content": "Summarize the key points from this input."}
]
}'