Llama 33 70B Fp8:
Everything you need to know about the model

Llama 33 70B Fp8 is a Meta model available through Merge Gateway via Parasail. Use it with Gateway routing policies, spend controls, request logs, and a 131,072 token context window. It supports streaming through at least one Gateway vendor route.

Llama 33 70B Fp8 pricing

| Vendor | Input / 1M tokens | Output / 1M tokens | Zero data retention | | --- | ---: | ---: | --- | | Parasail | $0.2200 | $0.5000 | Yes |

Test Llama 33 70B Fp8 with Merge Gateway’s Simulator

Llama 33 70B Fp8
Synced
Synced
Run simulation to see response

Ready to try it out?

Start routing requests to hundreds of large language models in your product within minutes.

Route requests to Llama 33 70B Fp8 with Merge Gateway

Merge Gateway is a unified LLM API that lets your product route requests to Llama 33 70B Fp8 and every other major model through a single endpoint. You get built-in fallback routing, per-request cost tracking, zero data retention support, and observability without changing your application architecture.
To get started in seconds, add our Gateway Implementation skill to your project, or pick your preferred SDK below. Check out our other quick start skills here.
Install the Merge Gateway SDK
Python
1{
2  "mcpServers": {
3    "agent-handler": {
4      "url": "https://ah-api-develop.merge.dev/api/v1/tool-packs/{TOOL_PACK_ID}/registered-users/{REGISTERED_USER_ID}/mcp",
5      "headers": {
6        "Authorization": "Bearer yMt*****"
7      }
8    }
9  }
10}
11
Make your first API call
Python
1{
2  "mcpServers": {
3    "agent-handler": {
4      "url": "https://ah-api-develop.merge.dev/api/v1/tool-packs/{TOOL_PACK_ID}/registered-users/{REGISTERED_USER_ID}/mcp",
5      "headers": {
6        "Authorization": "Bearer yMt*****"
7      }
8    }
9  }
10}
11
Try a diffrent model
Python
1{
2  "mcpServers": {
3    "agent-handler": {
4      "url": "https://ah-api-develop.merge.dev/api/v1/tool-packs/{TOOL_PACK_ID}/registered-users/{REGISTERED_USER_ID}/mcp",
5      "headers": {
6        "Authorization": "Bearer yMt*****"
7      }
8    }
9  }
10}
11
Install the Merge Gateway SDK
Python
1{
2  "mcpServers": {
3    "agent-handler": {
4      "url": "https://ah-api-develop.merge.dev/api/v1/tool-packs/{TOOL_PACK_ID}/registered-users/{REGISTERED_USER_ID}/mcp",
5      "headers": {
6        "Authorization": "Bearer yMt*****"
7      }
8    }
9  }
10}
11
Make your first API call
Python
1{
2  "mcpServers": {
3    "agent-handler": {
4      "url": "https://ah-api-develop.merge.dev/api/v1/tool-packs/{TOOL_PACK_ID}/registered-users/{REGISTERED_USER_ID}/mcp",
5      "headers": {
6        "Authorization": "Bearer yMt*****"
7      }
8    }
9  }
10}
11
Try a diffrent model
Python
1{
2  "mcpServers": {
3    "agent-handler": {
4      "url": "https://ah-api-develop.merge.dev/api/v1/tool-packs/{TOOL_PACK_ID}/registered-users/{REGISTERED_USER_ID}/mcp",
5      "headers": {
6        "Authorization": "Bearer yMt*****"
7      }
8    }
9  }
10}
11
Install the Merge Gateway SDK
Python
1{
2  "mcpServers": {
3    "agent-handler": {
4      "url": "https://ah-api-develop.merge.dev/api/v1/tool-packs/{TOOL_PACK_ID}/registered-users/{REGISTERED_USER_ID}/mcp",
5      "headers": {
6        "Authorization": "Bearer yMt*****"
7      }
8    }
9  }
10}
11
Make your first API call
Python
1{
2  "mcpServers": {
3    "agent-handler": {
4      "url": "https://ah-api-develop.merge.dev/api/v1/tool-packs/{TOOL_PACK_ID}/registered-users/{REGISTERED_USER_ID}/mcp",
5      "headers": {
6        "Authorization": "Bearer yMt*****"
7      }
8    }
9  }
10}
11
Try a diffrent model
Python
1{
2  "mcpServers": {
3    "agent-handler": {
4      "url": "https://ah-api-develop.merge.dev/api/v1/tool-packs/{TOOL_PACK_ID}/registered-users/{REGISTERED_USER_ID}/mcp",
5      "headers": {
6        "Authorization": "Bearer yMt*****"
7      }
8    }
9  }
10}
11
Install the Merge Gateway SDK
Python
1{
2  "mcpServers": {
3    "agent-handler": {
4      "url": "https://ah-api-develop.merge.dev/api/v1/tool-packs/{TOOL_PACK_ID}/registered-users/{REGISTERED_USER_ID}/mcp",
5      "headers": {
6        "Authorization": "Bearer yMt*****"
7      }
8    }
9  }
10}
11
Make your first API call
Python
1{
2  "mcpServers": {
3    "agent-handler": {
4      "url": "https://ah-api-develop.merge.dev/api/v1/tool-packs/{TOOL_PACK_ID}/registered-users/{REGISTERED_USER_ID}/mcp",
5      "headers": {
6        "Authorization": "Bearer yMt*****"
7      }
8    }
9  }
10}
11
Try a diffrent model
Python
1{
2  "mcpServers": {
3    "agent-handler": {
4      "url": "https://ah-api-develop.merge.dev/api/v1/tool-packs/{TOOL_PACK_ID}/registered-users/{REGISTERED_USER_ID}/mcp",
5      "headers": {
6        "Authorization": "Bearer yMt*****"
7      }
8    }
9  }
10}
11

Explore other models available in Merge Gateway

model logo
Gemini 3.1 Pro Preview
model logo
Gemini 3.1 Pro Preview Customtools
model logo
Gemini 3.5 Flash
model logo
Gemini 3 Flash Preview
model logo
Gemini 3 Pro Preview
model logo
Gemini Flash Latest
model logo
Gemini Flash Lite Latest
model logo
Gemini Pro Latest
model logo
Gemma 3 12B
model logo
Gemma 3 27B
model logo
Gemma 3 4B
model logo
GLM-4-32B-0414-128K
model logo
GLM-4.5
model logo
GLM-4.5-Air
model logo
GLM-4.5-AirX
model logo
GLM-4.5-X
model logo
GLM-4.6
model logo
Glm47
model logo
GLM 4.7
model logo
GLM 4.7 Flash
model logo
GLM-4.7-FlashX
model logo
Glm 5
model logo
GLM-5
model logo
GLM-5-Turbo

Llama 33 70B Fp8 FAQ

In case you have any more questions on using Llama 33 70B Fp8, we’ve addressed several below.

Heading

What provider owns Llama 33 70B Fp8?

Llama 33 70B Fp8 is a Meta model.

Which vendors can run Llama 33 70B Fp8?

Parasail is the default listed vendor, and other active vendors may also be available.

What context window does Llama 33 70B Fp8 support?

Llama 33 70B Fp8 supports 131,072 tokens on the primary listed vendor route.

What capabilities does Llama 33 70B Fp8 support?

Gateway currently lists streaming support for Llama 33 70B Fp8 across its available vendor routes.

Try Llama 33 70B Fp8 through Merge Gateway

Route, observe, and control AI requests across providers from one API.