Z.ai

Z.ai GLM 4.6

Z.aiBalanced
ThinkingTool UseStructured Output

About this model

Compared with GLM-4.5, this generation brings several key improvements: Longer context window: The context window has been expanded from 128K to 200K tokens, enabling the model to handle more complex agentic tasks. Superior coding performance: The model achieves higher scores on code benchmarks and demonstrates better real-world performance in applications such as Claude Code、Cline、Roo Code and Kilo Code, including improvements in generating visually polished front-end pages. Advanced reasoning: GLM-4.6 shows a clear improvement in reasoning performance and supports tool use during inference, leading to stronger overall capability. More capable agents: GLM-4.6 exhibits stronger performance in tool using and search-based agents, and integrates more effectively within agent frameworks. Refined writing: Better aligns with human preferences in style and readability, and performs more naturally in role-playing scenarios.

Performance Tier

Balanced

Z.ai GLM 4.6 is a balanced model from Z.ai : strong performance at a reasonable price.

Strong cost-performance ratio. Reliable for most professional use cases without premium pricing.

Pricing

This model is included in Elosia plans
Typeper 1M tokens
Input (prompt)$0.390
Output (completion)$1.90

Capabilities

Context Length205K
Max Output Tokens205K
TokenizerOther
Inputtext
Outputtext
Release DateSeptember 30, 2025

Benchmarks

General Intelligence
MMLU
80%
Mathematics
MATH-500
72%
Programming
HumanEval
76.5%

Recommended Use Cases

General ChatCodingTranslation

Strengths

  • Solid general-purpose model with good Chinese/English support
  • Good value for bilingual applications
  • Reliable for everyday reasoning and conversation

Limitations

  • Superseded by GLM 4.7 on most benchmarks
  • Limited multilingual support beyond Chinese and English

Resources

This model may use your data for training

Similar Models