DeepSeek-R1 is a model available through Merge Gateway via Amazon Bedrock. Use it with Gateway routing policies, spend controls, request logs, and a 163,840 token context window. It supports streaming through at least one Gateway vendor route.

DeepSeek-R1 pricing
Test DeepSeek-R1 with Merge Gateway’s Simulator

Ready to try it out?
Start routing requests to hundreds of large language models in your product within minutes.

Route requests to DeepSeek-R1 with Merge Gateway
1{
2 "mcpServers": {
3 "agent-handler": {
4 "url": "https://ah-api-develop.merge.dev/api/v1/tool-packs/{TOOL_PACK_ID}/registered-users/{REGISTERED_USER_ID}/mcp",
5 "headers": {
6 "Authorization": "Bearer yMt*****"
7 }
8 }
9 }
10}
111{
2 "mcpServers": {
3 "agent-handler": {
4 "url": "https://ah-api-develop.merge.dev/api/v1/tool-packs/{TOOL_PACK_ID}/registered-users/{REGISTERED_USER_ID}/mcp",
5 "headers": {
6 "Authorization": "Bearer yMt*****"
7 }
8 }
9 }
10}
111{
2 "mcpServers": {
3 "agent-handler": {
4 "url": "https://ah-api-develop.merge.dev/api/v1/tool-packs/{TOOL_PACK_ID}/registered-users/{REGISTERED_USER_ID}/mcp",
5 "headers": {
6 "Authorization": "Bearer yMt*****"
7 }
8 }
9 }
10}
111{
2 "mcpServers": {
3 "agent-handler": {
4 "url": "https://ah-api-develop.merge.dev/api/v1/tool-packs/{TOOL_PACK_ID}/registered-users/{REGISTERED_USER_ID}/mcp",
5 "headers": {
6 "Authorization": "Bearer yMt*****"
7 }
8 }
9 }
10}
111{
2 "mcpServers": {
3 "agent-handler": {
4 "url": "https://ah-api-develop.merge.dev/api/v1/tool-packs/{TOOL_PACK_ID}/registered-users/{REGISTERED_USER_ID}/mcp",
5 "headers": {
6 "Authorization": "Bearer yMt*****"
7 }
8 }
9 }
10}
111{
2 "mcpServers": {
3 "agent-handler": {
4 "url": "https://ah-api-develop.merge.dev/api/v1/tool-packs/{TOOL_PACK_ID}/registered-users/{REGISTERED_USER_ID}/mcp",
5 "headers": {
6 "Authorization": "Bearer yMt*****"
7 }
8 }
9 }
10}
111{
2 "mcpServers": {
3 "agent-handler": {
4 "url": "https://ah-api-develop.merge.dev/api/v1/tool-packs/{TOOL_PACK_ID}/registered-users/{REGISTERED_USER_ID}/mcp",
5 "headers": {
6 "Authorization": "Bearer yMt*****"
7 }
8 }
9 }
10}
111{
2 "mcpServers": {
3 "agent-handler": {
4 "url": "https://ah-api-develop.merge.dev/api/v1/tool-packs/{TOOL_PACK_ID}/registered-users/{REGISTERED_USER_ID}/mcp",
5 "headers": {
6 "Authorization": "Bearer yMt*****"
7 }
8 }
9 }
10}
111{
2 "mcpServers": {
3 "agent-handler": {
4 "url": "https://ah-api-develop.merge.dev/api/v1/tool-packs/{TOOL_PACK_ID}/registered-users/{REGISTERED_USER_ID}/mcp",
5 "headers": {
6 "Authorization": "Bearer yMt*****"
7 }
8 }
9 }
10}
111{
2 "mcpServers": {
3 "agent-handler": {
4 "url": "https://ah-api-develop.merge.dev/api/v1/tool-packs/{TOOL_PACK_ID}/registered-users/{REGISTERED_USER_ID}/mcp",
5 "headers": {
6 "Authorization": "Bearer yMt*****"
7 }
8 }
9 }
10}
111{
2 "mcpServers": {
3 "agent-handler": {
4 "url": "https://ah-api-develop.merge.dev/api/v1/tool-packs/{TOOL_PACK_ID}/registered-users/{REGISTERED_USER_ID}/mcp",
5 "headers": {
6 "Authorization": "Bearer yMt*****"
7 }
8 }
9 }
10}
111{
2 "mcpServers": {
3 "agent-handler": {
4 "url": "https://ah-api-develop.merge.dev/api/v1/tool-packs/{TOOL_PACK_ID}/registered-users/{REGISTERED_USER_ID}/mcp",
5 "headers": {
6 "Authorization": "Bearer yMt*****"
7 }
8 }
9 }
10}
11Explore other models available in Merge Gateway
DeepSeek-R1 FAQ
Heading
How does DeepSeek-R1 differ from DeepSeek's other models?
DeepSeek-R1 is DeepSeek's primary reasoning model, applying extended chain-of-thought generation to improve accuracy on math, logic, and multi-step tasks at the cost of higher latency and price.
- Pricing: At $1.35 per million input tokens and $4.20 per million output tokens (as of 06/01/2026), R1 is more than three times the input cost of DeepSeek V3 ($0.40) and nearly five times the output cost ($0.89), reflecting the additional computation from chain-of-thought generation
- Architecture: R1 uses a 685B-parameter Mixture of Experts model with 37B active parameters per call, similar in scale to V3, but applies reasoning-focused training that produces longer, more deliberate outputs
- Verbosity: R1 generates approximately 49 million output tokens per evaluation run (as of 06/01/2026), above the 43 million median, which increases cost per query compared to more concise non-reasoning models
- Intelligence Index: R1 scores 27 on the Artificial Analysis Intelligence Index (as of 06/01/2026), placing it above V3 (16) and well above Coder V2 (11), reflecting its stronger performance on complex tasks
- Context window: R1 supports 128k tokens, the same as V3 and Coder V2, making it appropriate for long-document reasoning tasks
DeepSeek-R1 is the right pick when accuracy on structured reasoning tasks matters more than cost or response speed, such as financial analysis, scientific problem solving, or multi-step planning.
What models should I consider using alongside DeepSeek-R1?
No single model is optimal for every task. Here are models worth pairing with DeepSeek R1 depending on what your product needs:
- DeepSeek V3: For the majority of non-reasoning requests in your pipeline, routing to DeepSeek V3 instead of R1 reduces cost by approximately 70% per output token while maintaining strong general-purpose quality
- o3 (OpenAI): For the most demanding reasoning tasks where accuracy is critical and cost is secondary, o3 provides a well-supported alternative reasoning model from a provider with broad enterprise infrastructure and SLA guarantees
- Claude Sonnet 4 (Anthropic): When tasks require precise instruction following combined with moderate reasoning depth, Claude Sonnet 4 provides a balanced option that avoids R1's verbose output overhead
- Gemini 2.5 Pro (Google): For long-context reasoning tasks that exceed R1's 128k context window, Gemini 2.5 Pro's extended context support provides a path for document-heavy workflows
- Llama 3.3 70B (Meta): For cost-sensitive high-volume inference where full reasoning is not needed, Llama 3.3 70B offers open-weight portability and self-hosting options at a fraction of R1's cost
What are the challenges of using DeepSeek-R1 in my product?
Like any production LLM, DeepSeek-R1 comes with tradeoffs worth planning for:
- Cost at scale: At $4.20 per million output tokens (as of 06/01/2026) and with verbose output averaging above the median, R1's costs compound quickly at high request volumes without prompt optimization or selective routing to cheaper models
- Provider dependency: DeepSeek's infrastructure is independent of major cloud providers. A DeepSeek-side outage means no automatic failover unless you have routing logic that switches to an alternative reasoning model
- Latency from chain-of-thought: R1's extended reasoning generation increases time-to-final-response compared to non-reasoning models. Latency-sensitive user-facing applications may experience noticeable delays on complex queries
- Text-only modality: R1 does not support image input. Multimodal use cases require routing to a separate model, adding integration complexity when the same pipeline handles both text and image tasks
- Verbosity overhead: R1's above-average output token count means you pay for reasoning tokens even when the final answer is short, making it inefficient for simple tasks that do not benefit from the extended thinking process
Why should I use Merge Gateway to route LLM requests with DeepSeek-R1 and every other model?
Using DeepSeek-R1 through Merge Gateway gives you access to the model itself and the infrastructure layer around it:
- One API, every provider: Access DeepSeek R1 alongside every other major LLM through a single endpoint and API key. Change providers by swapping the model string, with no application code changes required
- Intelligent routing and automatic failover: Merge routes around DeepSeek outages automatically. Routing policies based on cost, latency, or quality can reduce spend by 40–60% without touching your application code
- Cost governance: Set hard or soft project budgets so DeepSeek R1 spend stays within plan. Every request is attributed to a model, project, and tag in a unified billing dashboard across all providers
- Build Your Own Router: Define what "best" means for your traffic by selecting from curated ML benchmarks or adding your own eval scores. The router scores each available model against your weights and picks the winner per request, with a plain-language explanation of every decision
- Security and compliance controls: Apply DLP rules and prompt injection protection before every request reaches DeepSeek. Enforce per-project model and region policies without adding that logic to your application
How can I start using Merge Gateway to route requests with DeepSeek-R1?
Getting DeepSeek-R1 running through Merge Gateway takes a few minutes:1
1. Create an account and get your API key from the dashboard.
2. Install the Merge Gateway SDK: run pip install merge-gateway-sdk (Python) or npm install merge-gateway-sdk (Node). Alternatively, if you're already using the OpenAI SDK, set base_url = "https://api-gateway.merge.dev/v1/openai" and your existing code works as-is.
3. Make your first request using the provider/model format. For DeepSeek-R1, the model string is deepseek-r1. Swap the model string to route to any other provider without changing anything else.
4. Configure a routing policy in the dashboard to set failover behavior, cost limits, and optimization strategy. Your first policy can be as simple as naming DeepSeel-R1 as primary for reasoning requests with a non-reasoning model as fallback for simpler tasks.
Full setup instructions and SDK references are in the Merge Gateway docs.
Try DeepSeek-R1 through Merge Gateway
Route, observe, and control AI requests across providers from one API.





