Provider profile: SiliconFlowSiliconFlow is a high-performance AI inference platform with 200+ optimized models, OpenAI-compatible APIs, and a privacy-first approach. Through our partnership, TheRouter offers 3 models completely free — including Qwen3-8B, DeepSeek-R1-Distill-Qwen-7B, and DeepSeek-OCR — plus access to DeepSeek, Qwen, GLM, Kimi, and more.

← All providers

SiliconFlow

Global21 models

200+ models including 3 completely free — DeepSeek, Qwen & more

SiliconFlow is a high-performance AI inference platform with 200+ optimized models, OpenAI-compatible APIs, and a privacy-first approach. Through our partnership, TheRouter offers 3 models completely free — including Qwen3-8B, DeepSeek-R1-Distill-Qwen-7B, and DeepSeek-OCR — plus access to DeepSeek, Qwen, GLM, Kimi, and more.

  • 3 models completely free — Qwen3-8B, DeepSeek-R1-Distill-Qwen-7B & DeepSeek-OCR at $0
  • 200+ optimized models including DeepSeek, Qwen, GLM, Kimi & more
  • OpenAI-compatible API — drop in your existing code
  • Privacy-first: no data stored, ever
Free modelsReasoningCodingVision200+ models

Quickstart

from openai import OpenAI

client = OpenAI(
    base_url="https://api.therouter.ai/v1",
    api_key="YOUR_THEROUTER_KEY",
)

response = client.chat.completions.create(
    model="deepseek/deepseek-r1",
    messages=[{"role": "user", "content": "Explain quantum entanglement"}],
    max_tokens=512,
)
print(response.choices[0].message.content)

Use the global api.therouter.ai endpoint for new integrations; the legacy China accelerated endpoint is retired.

Models

DeepSeek R1DeepSeek's flagship reasoning model with extended chain-of-thought
DeepSeek V3.2DeepSeek's latest chat model balancing speed and intelligence
DeepSeek V3.1Cost-effective chat model for high-throughput workloads
Qwen3 235BAlibaba's flagship MoE model
Qwen3 32BEfficient mid-size Qwen3 model
Qwen Coder 480BMassive coding-focused model
Qwen Coder 30BLightweight code generation specialist
Qwen3 8BFast and efficient small model

Frequently Asked Questions

Why use TheRouter for SiliconFlow instead of calling SiliconFlow directly?

TheRouter adds automatic failover (SiliconFlow fails → Bedrock takes over), a single API key for all providers, usage analytics, spend controls, and team governance — all without changing your code.

Which SiliconFlow models are free on TheRouter?

Three models are completely free: Qwen3-8B (general chat), DeepSeek-R1-Distill-Qwen-7B (reasoning), and DeepSeek-OCR (document extraction). No credit card required, no usage limits.

Does SiliconFlow support streaming and function calling?

Yes. SiliconFlow's API is OpenAI-compatible and supports streaming, function calling (tools), and reasoning content passthrough. The reasoning_content field from DeepSeek R1 is preserved end-to-end.

What is SiliconFlow?

SiliconFlow is a high-performance AI inference platform with 200+ optimized models, OpenAI-compatible APIs, and a privacy-first approach — no data stored, ever. TheRouter integrates with their international platform at siliconflow.com.

Customer Support