Back to Models
Llama 3.2 3B
metameta/llama-3.2-3b
Llama 3.2 3B lightweight model. Delivers highly accurate results with capabilities including text generation, summarization, sentiment analysis, and contextual understanding. Ideal for edge devices and mobile AI.
Context Length
128K
Max Output
8K
Input Priceper 1M tokens
$0.240/ 1M tokens
Output Priceper 1M tokens
$0.240/ 1M tokens
Modalities
textβtext
Pricing Breakdown
| Type | Rate |
|---|---|
| Input | $0.240 / 1M tokens |
| Output | $0.240 / 1M tokens |
Supported Parameters
temperaturemax_tokenstop_pstop
API Usage Examples
Use the global api.therouter.ai endpoint shown below for new integrations; the legacy China accelerated endpoint is retired.
cURL
curl https://api.therouter.ai/v1/chat/completions -H "Content-Type: application/json" -H "Authorization: Bearer $THE_ROUTER_API_KEY" -d '{
"model": "meta/llama-3.2-3b",
"messages": [
{"role": "user", "content": "Summarize the key points from this input."}
]
}'