Back to Models
Llama 4 Scout
metameta/llama-4-scout
Llama 4 Scout is a general purpose model with 17B active parameters, 16 experts, and 109B total parameters. Features an industry-leading 10M token context length, enabling multi-document summarization, parsing extensive user activity, and reasoning over vast codebases.
Context Length
10M
Max Output
16K
Input Priceper 1M tokens
$0.264/ 1M tokens
Output Priceper 1M tokens
$1.02/ 1M tokens
Modalities
textimageβtext
Pricing Breakdown
| Type | Rate |
|---|---|
| Input | $0.264 / 1M tokens |
| Output | $1.02 / 1M tokens |
Supported Parameters
temperaturemax_tokenstop_ptoolstool_choiceresponse_formatstop
API Usage Examples
Use the global api.therouter.ai endpoint shown below for new integrations; the legacy China accelerated endpoint is retired.
cURL
curl https://api.therouter.ai/v1/chat/completions -H "Content-Type: application/json" -H "Authorization: Bearer $THE_ROUTER_API_KEY" -d '{
"model": "meta/llama-4-scout",
"messages": [
{"role": "user", "content": "Summarize the key points from this input."}
]
}'