Z.ai: GLM 4.6
Z-AI Developer Architecture Profile
Intelligence (ELO)1120Chatbot Arena Verified
Max Context204,800Tokens
API Cost / 1M$2.29Blended Prompt + Completion
Model Capabilities
Compared with GLM-4.5, this generation brings several key improvements:
Longer context window: The context window has been expanded from 128K to 200K tokens, enabling the model to handle more complex agentic tasks.
Superior coding performance: The model achieves higher scores on code benchmarks and demonstrates better real-world performance in applications such as Claude Code、Cline、Roo Code and Kilo Code, including improvements in generating visually polished front-end pages.
Advanced reasoning: GLM-4.6 shows a clear improvement in reasoning performance and supports tool use during inference, leading to stronger overall capability.
More capable agents: GLM-4.6 exhibits stronger performance in tool using and search-based agents, and integrates more effectively within agent frameworks.
Refined writing: Better aligns with human preferences in style and readability, and performs more naturally in role-playing scenarios.
Granular Pricing Matrix
Input Tokens (Prompt)$0.39 / 1M
Output Tokens (Completion)$1.90 / 1M
Pricing data via OpenRouter. Sync: 3/16/2026
Evaluate Competitors
VS Engine MatchupZ.ai: GLM 4.6 vs Z.ai: GLM 5 TurboVS Engine MatchupZ.ai: GLM 4.6 vs Inception: Mercury 2VS Engine MatchupZ.ai: GLM 4.6 vs Qwen: Qwen3.5-27BVS Engine MatchupZ.ai: GLM 4.6 vs Qwen: Qwen3.5-122B-A10BVS Engine MatchupZ.ai: GLM 4.6 vs AionLabs: Aion-2.0VS Engine MatchupZ.ai: GLM 4.6 vs Qwen: Qwen3.5 Plus 2026-02-15