Back to Models
Llama 3.1 8B
metameta/llama-3.1-8b
Llama 3.1 8B with 128K context length, multilinguality, and improved reasoning. Optimized for multilingual dialogue, efficient inference on consumer hardware.
Context Length
128K
Max Output
8K
Input Priceper 1M tokens
$0.336/ 1M tokens
Output Priceper 1M tokens
$0.336/ 1M tokens
Modalities
textβtext
Pricing Breakdown
| Type | Rate |
|---|---|
| Input | $0.336 / 1M tokens |
| Output | $0.336 / 1M tokens |
Supported Parameters
temperaturemax_tokenstop_ptoolstool_choicestop
API Usage Examples
Use the global api.therouter.ai endpoint shown below for new integrations; the legacy China accelerated endpoint is retired.
cURL
curl https://api.therouter.ai/v1/chat/completions -H "Content-Type: application/json" -H "Authorization: Bearer $THE_ROUTER_API_KEY" -d '{
"model": "meta/llama-3.1-8b",
"messages": [
{"role": "user", "content": "Summarize the key points from this input."}
]
}'