Is TheRouter just a model gateway?

TheRouter is the AI infrastructure layer between your apps and model providers. It keeps an OpenAI-compatible integration surface while adding routing, billing visibility, spend controls, and team governance.

Can we keep our existing provider accounts and contracts?

Yes. TheRouter supports bring-your-own-key workflows so teams can keep existing provider relationships and pricing while using TheRouter for reliability, policy control, and shared visibility.

Does it work with Cursor, Claude Code, and OpenAI-compatible SDKs?

Yes. Teams can start with the OpenAI-compatible quickstart and use the dedicated vibe coding setup flow for Cursor, Claude Code, Windsurf, Cline, and similar tools.

How does TheRouter improve reliability?

TheRouter applies provider health checks, routing policy, and automatic failover before a request is sent. Teams can verify outcomes with direct provider headers and request traces on the transparent routing page.

What do teams get beyond a single API key?

TheRouter adds request-level usage tracking, spend visibility, auditability, and team controls so finance, ops, and engineering can manage shared AI traffic without building that layer themselves.

One API to the world's top LLMs — OpenAI, Anthropic, DeepSeek, Qwen and beyond.

AI Infrastructure • GA

Reliable Routing, Your Keys, Full Control.

TheRouter adds health-aware routing, automatic failover, billing visibility, and governance on top of your provider accounts.

Keep your existing provider relationships and pricing. TheRouter becomes the AI infrastructure layer for routing, policy control, and team visibility.

Get Started See How It Works

OpenAI

Anthropic

Google

Your Keys

242+Models

27Providers

10M+Max Context

<200msRouting Latency

3 linesTo Integrate

Trust signals you can verify today

AWS KMS Encrypted No Prompt Storage by Default Stripe Checkout

Powering AI across leading model providers

Anthropic OpenAI Google DeepSeek Mistral Groq xAI SiliconFlow MiniMax

View all providers →

Why TheRouter

Not just an API proxy — a control plane that makes your AI stack more reliable, transparent, and flexible.

1 line

to switch providers

Zero Lock-in

Keep your provider accounts and pricing. Switch models or providers by changing one string — no migration, no rewrite.

OpenAI SDK compatible — works with Cursor, Claude Code, any client
Bring your own API keys, keep your existing contracts
Add or drop a provider in seconds, not sprints

3 controls

fallback, objective, basket

Smart Routing

Automatic failover, provider health tracking, and approved cost-aware routing keep requests moving without hiding what the router did.

Automatic failover when a provider goes down
Approved model-basket optimization (Beta)
Measured routing evidence in request logs and analytics

100%

spend transparency

Full Visibility

See exactly where every token goes. Track usage by team, key, or model — and set limits before you get a surprise bill.

Real-time token & cost tracking per API key
Spending limits and budget alerts
Full audit trail for every request

Power OpenClaw Agents Across Every Channel

Build reliable multi-channel AI bots with automatic failover, multimodal support (vision, audio, PDF), and 125+ models. TheRouter adds health-aware routing, cost visibility, and enterprise security to your OpenClaw deployments.

Discord

Slack

Explore OpenClaw Integration →

Cut AI Spend With Guardrails

TheRouter helps you compare baseline cost vs selected route, shadow-test cheaper approved models, and prove savings in logs and analytics.

Beta

Approved Model Basket

Pick a baseline model, then approve cheaper alternatives for the same workload. Start in shadow mode, then opt into live cost routing only for that basket.

Baseline: Claude Sonnet 4.6 → Approved alt: GPT-4.1 mini

Prompt Caching

Long repeated instructions can bill at cached rates when the upstream route supports it. This is especially useful for agents and workflows with heavy system prompts.

Cached prompt input can be ~10x cheaper than uncached input

Measured Savings Evidence

Request logs and Activity show baseline charge, selected route, realized savings, and shadow-mode recommendations so teams can verify what changed.

Logs: baseline vs selected vs saved, per request

Illustrative example: support triage team

Before: every request stays on one premium baseline model

After: baseline stays protected, a cheaper approved model runs in shadow first, then goes live for eligible flows

Illustrative: 20-35% lower model spend

Illustrative example only. Actual savings depend on your approved model basket, traffic mix, and prompt shape.

Up and Running in Minutes

Change one line. Get access to every major AI model with built-in reliability.

Point Your SDK

Swap your base URL to TheRouter. Works with any OpenAI-compatible client — Cursor, Claude Code, LangChain, your own app.

We Apply Your Policy

TheRouter applies provider health, fallback rules, and your routing objective. If the first route fails, it moves to the next approved path automatically.

You Get Results

Same response format you already use. Plus usage tracking, cost visibility, and team controls — with zero extra code.

Choose Your Entry Point

Start from the exact surface that matches your buying intent, technical need, or team workflow.

Featured Models

Access top-tier models from leading providers through a single unified API.

View all models

anthropic1,000,000 ctx

Claude Sonnet 4.6

Input

$3.60 / 1M tokens

Output

$18.00 / 1M tokens

Capabilities

textimagepdf

Use Model

openai128,000 ctx

GPT-4o

Input

$3.00 / 1M tokens

Output

$12.00 / 1M tokens

Capabilities

textimage

Use Model

google1,048,576 ctx

Gemini 2.5 Pro

Input

$1.50 / 1M tokens

Output

$12.00 / 1M tokens

Capabilities

textimagepdf

Use Model

Common Questions

Learn how TheRouter handles routing, billing visibility, BYOK, and tool compatibility.

Start building in 3 lines of code

import OpenAI from "openai";

const client = new OpenAI({
  baseURL: "https://api.therouter.ai/v1",
  apiKey: "sk-your-key",
});

const response = await client.chat.completions.create({
  model: "anthropic/claude-sonnet-4.6",
  messages: [{ role: "user", content: "Hello!" }],
});

Get API Key OpenAI-Compatible Quickstart

Reliable Routing, Your Keys, Full Control.

Why TheRouter

Zero Lock-in

Smart Routing

Full Visibility

Power OpenClaw Agents Across Every Channel

Cut AI Spend With Guardrails

Approved Model Basket

Prompt Caching

Measured Savings Evidence

Up and Running in Minutes

Point Your SDK

We Apply Your Policy

You Get Results

Choose Your Entry Point

OpenAI-Compatible Quickstart

Transparent Routing Proof

Vibe Coding Setup

Pricing & Billing

For One-Person Companies

Featured Models

Common Questions

Start building in 3 lines of code