GLM-5 Turbo is a new model from Z.ai designed for fast inference and strong performance in agent-driven environments such as OpenClaw scenarios. It is deeply optimized for real-world agent workflows involving long execution chains, with improved complex instruction decomposition, tool use, scheduled and persistent execution, and overall stability across extended tasks.
Performance Tier
Balanced
Z.ai GLM 5 Turbo is a balanced model from Z.ai : strong performance at a reasonable price.
Strong cost-performance ratio. Reliable for most professional use cases without premium pricing.
Pricing
This model is included in Elosia plans
Type
per 1M tokens
Input (prompt)
$1.20
Output (completion)
$4.00
Cache read
$0.240
Capabilities
Context Length203K
Max Output Tokens131K
TokenizerOther
Inputtext
Outputtext
Release DateMarch 15, 2026
Benchmarks
General Intelligence
MMLU
85%
GPQA Diamond
86%
Mathematics
MATH-500
Not reported
Programming
HumanEval
90%
SWE-bench Verified
77.8%
Reasoning
IFEval
88%
Recommended Use Cases
CodingAnalysisGeneral ChatData Extraction
Strengths
Faster, cheaper variant of GLM 5 flagship optimized for agentic workflows
Extremely low tool-calling error rate (0.67%) for reliable agent orchestration
200K context window with up to 128K output tokens
Competitive pricing ($0.96/M input) compared to Western frontier models
Limitations
Closed-source unlike the open-weight GLM 5 base model