Back to Models

Gemini 2.5 Flash Lite

googlegoogle/gemini-2.5-flash-lite

Most balanced Gemini model for low latency use cases. Optimized for high-volume, cost-sensitive workloads with strong quality at minimal cost.

Context Length
1.0M
Max Output
66K
Input Priceper 1M tokens
$0.120/ 1M tokens
Output Priceper 1M tokens
$0.480/ 1M tokens

Modalities

textimagepdf→text

Pricing Breakdown

TypeRate
Input$0.120 / 1M tokens
Output$0.480 / 1M tokens

Supported Parameters

temperaturemax_tokenstop_ptoolstool_choiceresponse_formatstop

API Usage Examples

Use the global api.therouter.ai endpoint shown below for new integrations; the legacy China accelerated endpoint is retired.

cURL
curl https://api.therouter.ai/v1/chat/completions   -H "Content-Type: application/json"   -H "Authorization: Bearer $THE_ROUTER_API_KEY"   -d '{
    "model": "google/gemini-2.5-flash-lite",
    "messages": [
      {"role": "user", "content": "Summarize the key points from this input."}
    ]
  }'
Customer Support