Back to Models

Llama 3.2 1B

metameta/llama-3.2-1b

Llama 3.2 1B lightweight model with on-device processing for improved security and privacy. Ideal for multilingual dialogue, personal information management, knowledge retrieval, and rewriting tasks on edge devices.

Context Length
128K
Max Output
8K
Input Priceper 1M tokens
$0.156/ 1M tokens
Output Priceper 1M tokens
$0.156/ 1M tokens

Modalities

text→text

Pricing Breakdown

TypeRate
Input$0.156 / 1M tokens
Output$0.156 / 1M tokens

Supported Parameters

temperaturemax_tokenstop_pstop

API Usage Examples

Use the global api.therouter.ai endpoint shown below for new integrations; the legacy China accelerated endpoint is retired.

cURL
curl https://api.therouter.ai/v1/chat/completions   -H "Content-Type: application/json"   -H "Authorization: Bearer $THE_ROUTER_API_KEY"   -d '{
    "model": "meta/llama-3.2-1b",
    "messages": [
      {"role": "user", "content": "Summarize the key points from this input."}
    ]
  }'
Customer Support