Z.ai

Z.ai GLM 5.1

Z.aiBalanced
ThinkingTool UseStructured Output

About this model

GLM-5.1 delivers a major leap in coding capability, with particularly significant gains in handling long-horizon tasks. Unlike previous models built around minute-level interactions, GLM-5.1 can work independently and continuously on...

Performance Tier

Balanced

Z.ai GLM 5.1 is a balanced model from Z.ai : strong performance at a reasonable price.

Strong cost-performance ratio. Reliable for most professional use cases without premium pricing.

Pricing

This model is included in Elosia plans
Moderate

Moderate cost. A balanced choice for regular use without constant cap watching.

Typeper 1M tokens
Input (prompt)$0.980
Output (completion)$3.08
Cache read$0.490

Capabilities

Context Length203K
Max Output Tokens66K
TokenizerOther
Inputtext
Outputtext
Release DateApril 7, 2026

Benchmarks

General Intelligence
MMLU
Not reported
GPQA Diamond
86.2%
Mathematics
MATH-500
Not reported
Programming
HumanEval
Not reported
SWE-bench Verified
77.8%
LiveCodeBench
52%
Reasoning
Humanity's Last Exam
52.3%
Agentic
SWE-bench Pro
58.4%
Terminal-Bench 2.0
69%

Recommended Use Cases

CodingAnalysisResearchGeneral Chat

Strengths

  • 754B MoE model (40B active), strong agentic coder on SWE-Bench Pro (58.4) — surpassed by GLM 5.2 for frontier use cases
  • Long-horizon agentic coding, sustains autonomous execution for up to 8 hours on a single task
  • 200K context window with up to 131K output tokens
  • Open-weight MIT license, self-hostable, trained entirely on Huawei chips
  • Competitive pricing ($0.95/M input, $3.15/M output) vs Western frontier models

Limitations

  • Text-only : no multimodal (image/audio) capabilities
  • Smaller ecosystem of integrations and tooling outside Chinese market
  • Self-reported benchmark scoring, independent verification still limited

Resources

This model may use your data for training

Similar Models