Z.ai GLM 5.1

Z.aiBalanced

ThinkingTool UseStructured Output

About this model

GLM-5.1 delivers a major leap in coding capability, with particularly significant gains in handling long-horizon tasks. Unlike previous models built around minute-level interactions, GLM-5.1 can work independently and continuously on...

Performance Tier

Balanced

Z.ai GLM 5.1 is a balanced model from Z.ai : strong performance at a reasonable price.

Strong cost-performance ratio. Reliable for most professional use cases without premium pricing.

Pricing

This model is included in Elosia plans

Moderate

Moderate cost. A balanced choice for regular use without constant cap watching.

Type	per 1M tokens
Input (prompt)	$0.980
Output (completion)	$3.08
Cache read	$0.490

Capabilities

Context Length203K

Max Output Tokens66K

TokenizerOther

Inputtext

Outputtext

Release DateApril 7, 2026

Benchmarks

General Intelligence

MMLU

Not reported

GPQA Diamond

86.2%

Mathematics

MATH-500

Not reported

Programming

HumanEval

Not reported

SWE-bench Verified

77.8%

LiveCodeBench

52%

Reasoning

Humanity's Last Exam

52.3%

Agentic

SWE-bench Pro

58.4%

Terminal-Bench 2.0

69%

Recommended Use Cases

CodingAnalysisResearchGeneral Chat

Strengths

754B MoE model (40B active), strong agentic coder on SWE-Bench Pro (58.4) — surpassed by GLM 5.2 for frontier use cases
Long-horizon agentic coding, sustains autonomous execution for up to 8 hours on a single task
200K context window with up to 131K output tokens
Open-weight MIT license, self-hostable, trained entirely on Huawei chips
Competitive pricing ($0.95/M input, $3.15/M output) vs Western frontier models

Limitations

Text-only : no multimodal (image/audio) capabilities
Smaller ecosystem of integrations and tooling outside Chinese market
Self-reported benchmark scoring, independent verification still limited

Resources

Official Documentation huggingface

This model may use your data for training

Similar Models

Z.ai

Z.ai

Z.ai

Z.ai