Qwen3 Next 80B

qwenqwen/qwen3-next-80b

Qwen3 Next generation hybrid Transformer-Mamba model (80B total, 3B active MoE with 512 experts). 10x inference throughput vs Qwen3-32B on long contexts.

Context Length

262K

Max Output

16K

Input Priceper 1M tokens

$0.240/ 1M tokens

Output Priceper 1M tokens

$0.960/ 1M tokens

Modalities

text→text

Pricing Breakdown

Type	Rate
Input	$0.240 / 1M tokens
Output	$0.960 / 1M tokens

Supported Parameters

temperaturemax_tokenstop_ptoolstool_choiceresponse_formatreasoningstop

API Usage Examples

Use the global api.therouter.ai endpoint shown below for new integrations; the legacy China accelerated endpoint is retired.

cURL

curl https://api.therouter.ai/v1/chat/completions   -H "Content-Type: application/json"   -H "Authorization: Bearer $THE_ROUTER_API_KEY"   -d '{
    "model": "qwen/qwen3-next-80b",
    "messages": [
      {"role": "user", "content": "Summarize the key points from this input."}
    ]
  }'