DeepSeek-R1:
Everything you need to know about the model

DeepSeek-R1 is a model available through Merge Gateway via Amazon Bedrock. Use it with Gateway routing policies, spend controls, request logs, and a 163,840 token context window. It supports streaming through at least one Gateway vendor route.

DeepSeek-R1 pricing

| Vendor | Input / 1M tokens | Output / 1M tokens | Zero data retention | | --- | ---: | ---: | --- | | Amazon Bedrock | $1.35 | $5.40 | Yes | | Together AI | $3.00 | $7.00 | No |

Test DeepSeek-R1 with Merge Gateway’s Simulator

DeepSeek-R1
Synced
Synced
Run simulation to see response

Ready to try it out?

Start routing requests to hundreds of large language models in your product within minutes.

Route requests to DeepSeek-R1 with Merge Gateway

Merge Gateway is a unified LLM API that lets your product route requests to DeepSeek-R1 and every other major model through a single endpoint. You get built-in fallback routing, per-request cost tracking, zero data retention support, and observability without changing your application architecture.
To get started in seconds, add our Gateway Implementation skill to your project, or pick your preferred SDK below. Check out our other quick start skills here.
Install the Merge Gateway SDK
Python
1{
2  "mcpServers": {
3    "agent-handler": {
4      "url": "https://ah-api-develop.merge.dev/api/v1/tool-packs/{TOOL_PACK_ID}/registered-users/{REGISTERED_USER_ID}/mcp",
5      "headers": {
6        "Authorization": "Bearer yMt*****"
7      }
8    }
9  }
10}
11
Make your first API call
Python
1{
2  "mcpServers": {
3    "agent-handler": {
4      "url": "https://ah-api-develop.merge.dev/api/v1/tool-packs/{TOOL_PACK_ID}/registered-users/{REGISTERED_USER_ID}/mcp",
5      "headers": {
6        "Authorization": "Bearer yMt*****"
7      }
8    }
9  }
10}
11
Try a diffrent model
Python
1{
2  "mcpServers": {
3    "agent-handler": {
4      "url": "https://ah-api-develop.merge.dev/api/v1/tool-packs/{TOOL_PACK_ID}/registered-users/{REGISTERED_USER_ID}/mcp",
5      "headers": {
6        "Authorization": "Bearer yMt*****"
7      }
8    }
9  }
10}
11
Install the Merge Gateway SDK
Python
1{
2  "mcpServers": {
3    "agent-handler": {
4      "url": "https://ah-api-develop.merge.dev/api/v1/tool-packs/{TOOL_PACK_ID}/registered-users/{REGISTERED_USER_ID}/mcp",
5      "headers": {
6        "Authorization": "Bearer yMt*****"
7      }
8    }
9  }
10}
11
Make your first API call
Python
1{
2  "mcpServers": {
3    "agent-handler": {
4      "url": "https://ah-api-develop.merge.dev/api/v1/tool-packs/{TOOL_PACK_ID}/registered-users/{REGISTERED_USER_ID}/mcp",
5      "headers": {
6        "Authorization": "Bearer yMt*****"
7      }
8    }
9  }
10}
11
Try a diffrent model
Python
1{
2  "mcpServers": {
3    "agent-handler": {
4      "url": "https://ah-api-develop.merge.dev/api/v1/tool-packs/{TOOL_PACK_ID}/registered-users/{REGISTERED_USER_ID}/mcp",
5      "headers": {
6        "Authorization": "Bearer yMt*****"
7      }
8    }
9  }
10}
11
Install the Merge Gateway SDK
Python
1{
2  "mcpServers": {
3    "agent-handler": {
4      "url": "https://ah-api-develop.merge.dev/api/v1/tool-packs/{TOOL_PACK_ID}/registered-users/{REGISTERED_USER_ID}/mcp",
5      "headers": {
6        "Authorization": "Bearer yMt*****"
7      }
8    }
9  }
10}
11
Make your first API call
Python
1{
2  "mcpServers": {
3    "agent-handler": {
4      "url": "https://ah-api-develop.merge.dev/api/v1/tool-packs/{TOOL_PACK_ID}/registered-users/{REGISTERED_USER_ID}/mcp",
5      "headers": {
6        "Authorization": "Bearer yMt*****"
7      }
8    }
9  }
10}
11
Try a diffrent model
Python
1{
2  "mcpServers": {
3    "agent-handler": {
4      "url": "https://ah-api-develop.merge.dev/api/v1/tool-packs/{TOOL_PACK_ID}/registered-users/{REGISTERED_USER_ID}/mcp",
5      "headers": {
6        "Authorization": "Bearer yMt*****"
7      }
8    }
9  }
10}
11
Install the Merge Gateway SDK
Python
1{
2  "mcpServers": {
3    "agent-handler": {
4      "url": "https://ah-api-develop.merge.dev/api/v1/tool-packs/{TOOL_PACK_ID}/registered-users/{REGISTERED_USER_ID}/mcp",
5      "headers": {
6        "Authorization": "Bearer yMt*****"
7      }
8    }
9  }
10}
11
Make your first API call
Python
1{
2  "mcpServers": {
3    "agent-handler": {
4      "url": "https://ah-api-develop.merge.dev/api/v1/tool-packs/{TOOL_PACK_ID}/registered-users/{REGISTERED_USER_ID}/mcp",
5      "headers": {
6        "Authorization": "Bearer yMt*****"
7      }
8    }
9  }
10}
11
Try a diffrent model
Python
1{
2  "mcpServers": {
3    "agent-handler": {
4      "url": "https://ah-api-develop.merge.dev/api/v1/tool-packs/{TOOL_PACK_ID}/registered-users/{REGISTERED_USER_ID}/mcp",
5      "headers": {
6        "Authorization": "Bearer yMt*****"
7      }
8    }
9  }
10}
11

Explore other models available in Merge Gateway

model logo
Devstral 2512
model logo
Devstral Latest
model logo
Devstral Medium Latest
model logo
Dola Seed 2.0 Code (preview)
model logo
Dola Seed 2.0 Lite
model logo
Dola Seed 2.0 Mini
model logo
Dola Seed 2.0 Pro
model logo
Gemini 2.5 Flash
model logo
Gemini 2.5 Flash Lite
model logo
Gemini 2.5 Pro
model logo
Gemini 3.1 Flash Lite
model logo
Gemini 3.1 Flash Lite Preview
model logo
Gemini 3.1 Pro Preview
model logo
Gemini 3.1 Pro Preview Customtools
model logo
Gemini 3.5 Flash
model logo
Gemini 3 Flash Preview
model logo
Gemini 3 Pro Preview
model logo
Gemini Flash Latest
model logo
Gemini Flash Lite Latest
model logo
Gemini Pro Latest
model logo
Gemma 3 12B
model logo
Gemma 3 27B
model logo
Gemma 3 4B
model logo
GLM-4-32B-0414-128K

DeepSeek-R1 FAQ

In case you have any other questions on DeepSeek-R1, we've addressed several more below. It's also worth noting that the information below was written in June, 2026, and is subject to change.

Heading

How does DeepSeek-R1 differ from DeepSeek's other models?

DeepSeek-R1 is DeepSeek's primary reasoning model, applying extended chain-of-thought generation to improve accuracy on math, logic, and multi-step tasks at the cost of higher latency and price.

  • Pricing: At $1.35 per million input tokens and $4.20 per million output tokens (as of 06/01/2026), R1 is more than three times the input cost of DeepSeek V3 ($0.40) and nearly five times the output cost ($0.89), reflecting the additional computation from chain-of-thought generation
  • Architecture: R1 uses a 685B-parameter Mixture of Experts model with 37B active parameters per call, similar in scale to V3, but applies reasoning-focused training that produces longer, more deliberate outputs
  • Verbosity: R1 generates approximately 49 million output tokens per evaluation run (as of 06/01/2026), above the 43 million median, which increases cost per query compared to more concise non-reasoning models
  • Intelligence Index: R1 scores 27 on the Artificial Analysis Intelligence Index (as of 06/01/2026), placing it above V3 (16) and well above Coder V2 (11), reflecting its stronger performance on complex tasks
  • Context window: R1 supports 128k tokens, the same as V3 and Coder V2, making it appropriate for long-document reasoning tasks

DeepSeek-R1 is the right pick when accuracy on structured reasoning tasks matters more than cost or response speed, such as financial analysis, scientific problem solving, or multi-step planning.

What models should I consider using alongside DeepSeek-R1?

No single model is optimal for every task. Here are models worth pairing with DeepSeek R1 depending on what your product needs:

  • DeepSeek V3: For the majority of non-reasoning requests in your pipeline, routing to DeepSeek V3 instead of R1 reduces cost by approximately 70% per output token while maintaining strong general-purpose quality
  • o3 (OpenAI): For the most demanding reasoning tasks where accuracy is critical and cost is secondary, o3 provides a well-supported alternative reasoning model from a provider with broad enterprise infrastructure and SLA guarantees
  • Claude Sonnet 4 (Anthropic): When tasks require precise instruction following combined with moderate reasoning depth, Claude Sonnet 4 provides a balanced option that avoids R1's verbose output overhead
  • Gemini 2.5 Pro (Google): For long-context reasoning tasks that exceed R1's 128k context window, Gemini 2.5 Pro's extended context support provides a path for document-heavy workflows
  • Llama 3.3 70B (Meta): For cost-sensitive high-volume inference where full reasoning is not needed, Llama 3.3 70B offers open-weight portability and self-hosting options at a fraction of R1's cost

What are the challenges of using DeepSeek-R1 in my product?

Like any production LLM, DeepSeek-R1 comes with tradeoffs worth planning for:

  • Cost at scale: At $4.20 per million output tokens (as of 06/01/2026) and with verbose output averaging above the median, R1's costs compound quickly at high request volumes without prompt optimization or selective routing to cheaper models
  • Provider dependency: DeepSeek's infrastructure is independent of major cloud providers. A DeepSeek-side outage means no automatic failover unless you have routing logic that switches to an alternative reasoning model
  • Latency from chain-of-thought: R1's extended reasoning generation increases time-to-final-response compared to non-reasoning models. Latency-sensitive user-facing applications may experience noticeable delays on complex queries
  • Text-only modality: R1 does not support image input. Multimodal use cases require routing to a separate model, adding integration complexity when the same pipeline handles both text and image tasks
  • Verbosity overhead: R1's above-average output token count means you pay for reasoning tokens even when the final answer is short, making it inefficient for simple tasks that do not benefit from the extended thinking process

Why should I use Merge Gateway to route LLM requests with DeepSeek-R1 and every other model?

Using DeepSeek-R1 through Merge Gateway gives you access to the model itself and the infrastructure layer around it:

  • One API, every provider: Access DeepSeek R1 alongside every other major LLM through a single endpoint and API key. Change providers by swapping the model string, with no application code changes required
  • Intelligent routing and automatic failover: Merge routes around DeepSeek outages automatically. Routing policies based on cost, latency, or quality can reduce spend by 40–60% without touching your application code
  • Cost governance: Set hard or soft project budgets so DeepSeek R1 spend stays within plan. Every request is attributed to a model, project, and tag in a unified billing dashboard across all providers
  • Build Your Own Router: Define what "best" means for your traffic by selecting from curated ML benchmarks or adding your own eval scores. The router scores each available model against your weights and picks the winner per request, with a plain-language explanation of every decision
  • Security and compliance controls: Apply DLP rules and prompt injection protection before every request reaches DeepSeek. Enforce per-project model and region policies without adding that logic to your application

How can I start using Merge Gateway to route requests with DeepSeek-R1?

Getting DeepSeek-R1 running through Merge Gateway takes a few minutes:1

1. Create an account and get your API key from the dashboard.

2. Install the Merge Gateway SDK: run pip install merge-gateway-sdk (Python) or npm install merge-gateway-sdk (Node). Alternatively, if you're already using the OpenAI SDK, set base_url = "https://api-gateway.merge.dev/v1/openai" and your existing code works as-is.

3. Make your first request using the provider/model format. For DeepSeek-R1, the model string is deepseek-r1. Swap the model string to route to any other provider without changing anything else.

4. Configure a routing policy in the dashboard to set failover behavior, cost limits, and optimization strategy. Your first policy can be as simple as naming DeepSeel-R1 as primary for reasoning requests with a non-reasoning model as fallback for simpler tasks.

Full setup instructions and SDK references are in the Merge Gateway docs.

Try DeepSeek-R1 through Merge Gateway

Route, observe, and control AI requests across providers from one API.