Z.ai

Z.ai GLM 5 Turbo

Z.aiBalanced
ThinkingTool Use

About this model

GLM-5 Turbo is a new model from Z.ai designed for fast inference and strong performance in agent-driven environments such as OpenClaw scenarios. It is deeply optimized for real-world agent workflows involving long execution chains, with improved complex instruction decomposition, tool use, scheduled and persistent execution, and overall stability across extended tasks.

Performance Tier

Balanced

Z.ai GLM 5 Turbo is a balanced model from Z.ai : strong performance at a reasonable price.

Strong cost-performance ratio. Reliable for most professional use cases without premium pricing.

Pricing

This model is included in Elosia plans
Typeper 1M tokens
Input (prompt)$1.20
Output (completion)$4.00
Cache read$0.240

Capabilities

Context Length203K
Max Output Tokens131K
TokenizerOther
Inputtext
Outputtext
Release DateMarch 15, 2026

Benchmarks

General Intelligence
MMLU
85%
GPQA Diamond
86%
Mathematics
MATH-500
Not reported
Programming
HumanEval
90%
SWE-bench Verified
77.8%
Reasoning
IFEval
88%

Recommended Use Cases

CodingAnalysisGeneral ChatData Extraction

Strengths

  • Faster, cheaper variant of GLM 5 flagship optimized for agentic workflows
  • Extremely low tool-calling error rate (0.67%) for reliable agent orchestration
  • 200K context window with up to 128K output tokens
  • Competitive pricing ($0.96/M input) compared to Western frontier models

Limitations

  • Closed-source unlike the open-weight GLM 5 base model
  • Limited community feedback outside Chinese market

Resources

This model may use your data for training

Similar Models