GLM 4.7 Flash:
pricing, performance, and how to route requests
GLM 4.7 Flash is accessible via Merge Gateway. With Gateway, you can apply routing policies and spend controls, and access per-request logs. Context window and streaming support depend on the provider route you select.

GLM 4.7 Flash pricing

| Vendor | Input / 1M tokens | Output / 1M tokens | Zero data retention | | --- | ---: | ---: | --- | | Amazon Bedrock | $0.0700 | $0.4000 | Yes |

Test GLM 4.7 Flash with Merge Gateway’s Simulator

GLM 4.7 Flash
Synced
Synced
Run simulation to see response

Route requests to GLM 4.7 Flash with Merge Gateway

Merge Gateway is a unified LLM API that lets your product route requests to GLM 4.7 Flash and every other major model through a single endpoint. You get built-in fallback routing, per-request cost tracking, data loss prevention (DLP), prompt injection protection, and observability without changing your application architecture.
To get started in seconds, add our Gateway Implementation skill to your project, or pick your preferred SDK below. Check out our other quick start skills here.
Install the Merge Gateway SDK
Python
Copied!
1$ pip install merge-gateway-sdk
Send a request
Python
Copied!
1from merge_gateway import MergeGateway
2
3client = MergeGateway(api_key="YOUR_API_KEY")
4
5response = client.responses.create(
6    model="openai/gpt-5.2",
7    input=[
8        {"type": "message", "role": "system", "content": "You are a helpful programming tutor. Explain the concepts clearly with practical examples."},
9        {"type": "message", "role": "user", "content": "Explain the concept of recursion in programming with a simple set of examples."},
10    ],
11)
12
13print(response.output[0].content[0].text)
Try a diffrent model
Swap the model string to route to a different provider. No other code changes needed.
Anthropic
Copied!
1response = client.responses.create(
2    model="anthropic/claude-sonnet-4-20250514",
3    input=[
4        {"type": "message", "role": "system", "content": "You are a helpful programming tutor. Explain the concepts clearly with practical examples."},
5        {"type": "message", "role": "user", "content": "Explain the concept of recursion in programming with a simple set of examples."},
6    ],
7)
Point to Gateway
Python
Copied!
1from openai import OpenAI
2
3client = OpenAI(
4    api_key="YOUR_API_KEY",
5    base_url="https://api-gateway.merge.dev/v1/openai",
6)
Send a request
Use the standard chat.completions.create method. No provider prefix needed on the model name.
Python
Copied!
1response = client.chat.completions.create(
2    model="gpt-5.2",
3    messages=[
4        {"role": "system", "content": "You are a helpful programming tutor. Explain the concepts clearly with practical examples."},
5        {"role": "user", "content": "Explain the concept of recursion in programming with a simple set of examples."},
6    ],
7)
8
9print(response.choices[0].message.content)
Install packages
Copied!
1npm install merge-gateway-ai-sdk-provider ai
Create the provider
TypeScript
Copied!
1import { createMergeGateway } from "merge-gateway-ai-sdk-provider";
2
3const gateway = createMergeGateway({
4  apiKey: "YOUR_API_KEY",
5});
Send a request
Use generateText to send a request. Model names use the provider/model format.
TypeScript
Copied!
1import { generateText } from "ai";
2
3const { text } = await generateText({
4  model: gateway("openai/gpt-4o"),
5  prompt: "Explain the concept of recursion in programming with a simple set of examples.",
6});
7
8console.log(text);
If you already have @ai-sdk/openai installed, point it at Gateway with a base URL change:
TypeScript
Copied!
1import { createOpenAI } from "@ai-sdk/openai";
2
3const gateway = createOpenAI({
4  apiKey: "YOUR_API_KEY",
5  baseURL: "https://api-gateway.merge.dev/v1/ai-sdk",
6});
7
8// All generateText/streamText calls work unchanged
Install the Merge Gateway SDK
Anthropic SDK
Copied!
1from anthropic import Anthropic
2
3client = Anthropic(
4    api_key="YOUR_API_KEY",
5    base_url="https://api-gateway.merge.dev/v1/anthropic",
6)
7
8message = client.messages.create(
9    model="claude-sonnet-4-20250514",
10    max_tokens=1024,
11    messages=[
12        {"role": "user", "content": "Explain the concept of recursion in programming with a simple set of examples."},
13    ],
14)
15
16print(message.content[0].text)

Explore other models available in Merge Gateway

model logo
Amazon Nova 2 Lite
model logo
Amazon Nova 2 Sonic
model logo
Amazon Nova Premier
model logo
Amazon Nova Pro
model logo
Claude Opus 4.6
model logo
Claude Opus 4.7
model logo
Claude Opus 4.8
model logo
Claude Sonnet 4.5
model logo
Claude Sonnet 4.6
model logo
Codestral
model logo
Codestral 25.08
model logo
DeepSeek V3
model logo
DeepSeek V3.2
model logo
DeepSeek V4 Flash
model logo
DeepSeek V4 Pro
model logo
Devstral 2512
model logo
Dola Seed 2.0 Code (preview)
model logo
Dola Seed 2.0 Lite
model logo
Dola Seed 2.0 Mini
model logo
Dola Seed 2.0 Pro
model logo
Gemini 2.5 Flash
model logo
Gemini 2.5 Flash Lite
model logo
Gemini 2.5 Pro
model logo
Gemini 3.1 Flash Lite

GLM 4.7 Flash FAQ

Heading

What provider owns GLM 4.7 Flash?

GLM 4.7 Flash is a Amazon Bedrock model.

Which vendors can run GLM 4.7 Flash?

Amazon Bedrock is the default listed vendor, and other active vendors may also be available.

What context window does GLM 4.7 Flash support?

GLM 4.7 Flash supports 200,000 tokens on the primary listed vendor route.

What capabilities does GLM 4.7 Flash support?

Gateway currently lists streaming support for GLM 4.7 Flash across its available vendor routes.

Try GLM 4.7 Flash through Merge Gateway

Route, observe, and control AI requests across providers from one API.