Back to Models

Llama 3.2 3B

metameta/llama-3.2-3b

Llama 3.2 3B lightweight model. Delivers highly accurate results with capabilities including text generation, summarization, sentiment analysis, and contextual understanding. Ideal for edge devices and mobile AI.

Context Length
128K
Max Output
8K
Input Priceper 1M tokens
$0.240/ 1M tokens
Output Priceper 1M tokens
$0.240/ 1M tokens

Modalities

text→text

Pricing Breakdown

TypeRate
Input$0.240 / 1M tokens
Output$0.240 / 1M tokens

Supported Parameters

temperaturemax_tokenstop_pstop

API Usage Examples

Use the global api.therouter.ai endpoint shown below for new integrations; the legacy China accelerated endpoint is retired.

cURL
curl https://api.therouter.ai/v1/chat/completions   -H "Content-Type: application/json"   -H "Authorization: Bearer $THE_ROUTER_API_KEY"   -d '{
    "model": "meta/llama-3.2-3b",
    "messages": [
      {"role": "user", "content": "Summarize the key points from this input."}
    ]
  }'
Customer Support