Back to Models

Llama 3.1 70B

metameta/llama-3.1-70b

Llama 3.1 70B with expanded 128K context, multilinguality, and improved reasoning. Optimized for multilingual dialogue and assistant-like chat.

Context Length
128K
Max Output
8K
Input Priceper 1M tokens
$1.08/ 1M tokens
Output Priceper 1M tokens
$1.08/ 1M tokens

Modalities

text→text

Pricing Breakdown

TypeRate
Input$1.08 / 1M tokens
Output$1.08 / 1M tokens

Supported Parameters

temperaturemax_tokenstop_ptoolstool_choicestop

API Usage Examples

Use the global api.therouter.ai endpoint shown below for new integrations; the legacy China accelerated endpoint is retired.

cURL
curl https://api.therouter.ai/v1/chat/completions   -H "Content-Type: application/json"   -H "Authorization: Bearer $THE_ROUTER_API_KEY"   -d '{
    "model": "meta/llama-3.1-70b",
    "messages": [
      {"role": "user", "content": "Summarize the key points from this input."}
    ]
  }'
Customer Support