CreateOS AI Gateway Pricing

One API for every model, routed to the cheapest available source, plus a flat 2.5% platform fee. Compare live prices across nine providers and model your spend.

Book a demo View the Gateway

Anthropic

OpenAI

Google

Mistral

DeepSeek

Alibaba (Qwen)

Zhipu AI

Moonshot AI

Minimax

Cost calculator

Model

Input tokens

1.00M tokens

Output tokens

0.20M tokens

Input cost

$5.00

Output cost

$5.00

Base total

$10.00

Before platform fee

Total with 2.5% fee

$10.25

Final billed amount

Prices are the base upstream cost per 1M tokens. A flat 2.5% platform fee applies. The Gateway routes each request to the cheapest available source.

Provider

Tier

Model	Provider	Context	Input $/1M	Output $/1M	Blended $/1M
Claude Opus 4.7	Anthropic	1M	$5.00	$25.00	$10.00
Claude Opus 4.6	Anthropic	200K (1M β)	$5.00	$25.00	$10.00
Claude Sonnet 4.6	Anthropic	200K (1M β)	$3.00	$15.00	$6.00
Claude Opus 4.5	Anthropic	200K	$5.00	$25.00	$10.00
Claude Sonnet 4.5	Anthropic	200K	$3.00	$15.00	$6.00
Claude Haiku 4.5	Anthropic	200K	$1.00	$5.00	$2.00
GPT-5.4	OpenAI	1.05M	$2.50	$5.00	$3.13
GPT-5.4 Pro	OpenAI	1.05M	$30.00	$180.00	$67.50
GPT-5.4 Mini	OpenAI	400K	$0.75	$4.50	$1.69
GPT-5.4 Nano	OpenAI	400K	$0.20	$1.25	$0.46
GPT-5-mini	OpenAI	400K	$0.25	$2.00	$0.69
o4-mini	OpenAI	200K	$1.10	$4.40	$1.93
GPT-4.1	OpenAI	1M	$2.00	$8.00	$3.50
GLM-5	Zhipu AI	200K	$0.30	$2.43	$0.83
GLM-5.1	Zhipu AI	203K	$0.95	$3.15	$1.50
GLM-5 Turbo	Zhipu AI	200K	$0.96	$3.20	$1.52
GLM-4.7	Zhipu AI	200K	$0.40	$1.75	$0.74
GLM-4.7-Flash	Zhipu AI	200K	FREE	FREE	FREE
Kimi K2.5	Moonshot AI	256K	$0.45	$2.32	$0.92
Kimi K2	Moonshot AI	131K	$0.55	$2.20	$0.96
Kimi K2 Turbo	Moonshot AI	131K	$0.50	$2.11	$0.90
Gemini 3.1 Pro Preview	Google	1M	$2.00	$12.00	$4.50
Gemini 2.5 Pro	Google	1M	$1.25	$10.00	$3.44
Gemini 3 Flash Preview	Google	1M	$0.50	$3.00	$1.13
Gemini 2.5 Flash	Google	1M	$0.30	$0.30	$0.30
Gemini 2.5 Flash-Lite	Google	1M	$0.10	$0.40	$0.18
Gemini 3.1 Flash Lite	Google	1M	$0.25	$1.50	$0.56
Gemini 3 Pro	Google	1M	$2.00	$12.00	$4.50
MiniMax M2.5 Standard	Minimax	1M	$0.28	$1.00	$0.46
MiniMax M2.7	Minimax	205K	$0.30	$1.20	$0.52
MiniMax-01	Minimax	4M	$0.20	$1.10	$0.43
MiniMax M2.1	Minimax	196K	$0.27	$0.95	$0.44
Qwen3.5 Plus	Alibaba	1M	$0.26	$1.56	$0.58
Qwen 3.6 Plus	Alibaba	1M	$0.33	$1.95	$0.73
Qwen3.5 Flash	Alibaba	1M	$0.10	$0.40	$0.18
Qwen3 Max	Alibaba	262K	$1.20	$6.00	$2.40
Qwen3 Coder	Alibaba	262K	$0.22	$0.90	$0.39
Mistral Small 4	Mistral	262K	$0.10	$0.30	$0.15
DeepSeek (Latest)	DeepSeek	128K	$0.28	$0.42	$0.32

Prices are the base upstream cost per 1M tokens. A flat 2.5% platform fee applies. The Gateway routes each request to the cheapest available source.

Zero Data Retention Available

Zero Data Retention (ZDR) is available for supported models: the Gateway does not store your data and does not train on it. Ask for a ZDR-compliant model when you request access.

Common Questions

What is the CreateOS AI Gateway?

The CreateOS AI Gateway is a unified API gateway that routes every model request to the cheapest available upstream source. You reach dozens of models from nine providers through one endpoint, with one API key and one bill.

How does pricing work?

The prices shown are the base upstream cost per 1 million tokens. The Gateway adds a flat 2.5% platform fee and routes each request to the cheapest available source, so you always pay the lowest rate. No hidden fees, no minimums, no monthly commitment.

What is blended pricing?

Blended pricing is a weighted average that reflects typical usage: input cost times three, plus output cost, divided by four. Most calls send more input than they receive as output, so this gives a realistic per-million-token estimate.

Can I switch models without changing code?

Yes. Every model is available through the same endpoint, so switching is as simple as changing the model name in your request. No new SDK and no code changes.

What happens if a provider goes down?

The Gateway monitors upstream health in real time and reroutes to the next cheapest available source on an outage, so you get built-in redundancy at no extra cost.

Are there minimums or hidden fees?

No. You pay the upstream token cost plus the flat 2.5% platform fee. No minimums, no monthly charges, and no volume requirements.

One API for every model, at the cheapest price.

Book a demo, or start building on the Gateway.

Book a demo Start building