Z.ai models & pricing

Z.ai hosts 11 models (10 with public pricing) covering 1 modalities. GLM family of frontier Chinese-leading reasoning and chat models. Cheapest input starts at $0.1000/M tokens; the most premium goes up to $2.20/M. Use Future AGI's Agent Command Center to route any Z.ai model with cost-optimized fallback and unified observability.

Homepage ↗ Docs ↗

chat 11

Model Input / 1M Output / 1M Context Caps
Glm 4.32B 0414.128k $0.1000/M $0.1000/M 128,000 tools
Glm 4.5 Air $0.200/M $1.10/M 128,000 tools
Glm 4.5 $0.600/M $2.20/M 128,000 tools
Glm 4.5v $0.600/M $1.80/M 128,000 tools · vision
Glm 4.6 $0.600/M $2.20/M 200,000 tools · reasoning · cache
Glm 4.7 $0.600/M $2.20/M 200,000 tools · reasoning · cache
Glm 5 $1.00/M $3.20/M 200,000 tools · reasoning · cache
Glm 4.5 Airx $1.10/M $4.50/M 128,000 tools
Glm 5 Code $1.20/M $5.00/M 200,000 tools · reasoning · cache
Glm 4.5 X $2.20/M $8.90/M 128,000 tools
Glm 4.5 Flash 128,000 tools

FAQ

How many Z.ai models are there?

11 Z.ai models are listed across 1 modality on this page. 10 have public per-token pricing.

How is Z.ai pricing verified?

Pricing is aggregated from BerriAI/litellm, models.dev, and OpenRouter and refreshed weekly. Each row shows a per-model "verified" date. If a price is wrong, click the row to open the model page and use the inline "suggest edit" link — submissions go into a public review queue.

Which Z.ai model is cheapest?

Input pricing on Z.ai starts at $0.1000 per 1M tokens. Sort the table by price (or use the in-page filter at the top) to find the cheapest model that matches your capability requirements.

Can I route to Z.ai via an OpenAI-compatible API?

Yes — point your OpenAI client at Future AGI's Agent Command Center, configure a Z.ai target, and call Z.ai models with the standard /v1/chat/completions surface. The same gateway can route to other providers as fallback. Free for the first 100K requests/month.

Route any Z.ai model via Agent Command Center →
OpenAI-compatible endpoint. Caching, fallback, guardrails, observability. Free for 100K requests/month.