Z.ai models & pricing
Z.ai hosts 11 models (10 with public pricing) covering 1 modalities. GLM family of frontier Chinese-leading reasoning and chat models. Cheapest input starts at $0.1000/M tokens; the most premium goes up to $2.20/M. Use Future AGI's Agent Command Center to route any Z.ai model with cost-optimized fallback and unified observability.
chat 11
| Model ↕ | Input / 1M ↕ | Output / 1M ↕ | Context ↕ | Caps |
|---|---|---|---|---|
| Glm 4.32B 0414.128k | $0.1000/M | $0.1000/M | 128,000 | tools |
| Glm 4.5 Air | $0.200/M | $1.10/M | 128,000 | tools |
| Glm 4.5 | $0.600/M | $2.20/M | 128,000 | tools |
| Glm 4.5v | $0.600/M | $1.80/M | 128,000 | tools · vision |
| Glm 4.6 | $0.600/M | $2.20/M | 200,000 | tools · reasoning · cache |
| Glm 4.7 | $0.600/M | $2.20/M | 200,000 | tools · reasoning · cache |
| Glm 5 | $1.00/M | $3.20/M | 200,000 | tools · reasoning · cache |
| Glm 4.5 Airx | $1.10/M | $4.50/M | 128,000 | tools |
| Glm 5 Code | $1.20/M | $5.00/M | 200,000 | tools · reasoning · cache |
| Glm 4.5 X | $2.20/M | $8.90/M | 128,000 | tools |
| Glm 4.5 Flash | — | — | 128,000 | tools |
FAQ
How many Z.ai models are there?
11 Z.ai models are listed across 1 modality on this page. 10 have public per-token pricing.
How is Z.ai pricing verified?
Pricing is aggregated from BerriAI/litellm, models.dev, and OpenRouter and refreshed weekly. Each row shows a per-model "verified" date. If a price is wrong, click the row to open the model page and use the inline "suggest edit" link — submissions go into a public review queue.
Which Z.ai model is cheapest?
Input pricing on Z.ai starts at $0.1000 per 1M tokens. Sort the table by price (or use the in-page filter at the top) to find the cheapest model that matches your capability requirements.
Can I route to Z.ai via an OpenAI-compatible API?
Yes — point your OpenAI client at Future AGI's Agent Command Center, configure a Z.ai target, and call Z.ai models with the standard /v1/chat/completions surface. The same gateway can route to other providers as fallback. Free for the first 100K requests/month.