Grok 4 is xAI's latest reasoning model with a 256k context window. It supports parallel tool calling, structured outputs, and both image and text inputs. Note that reasoning is not exposed, reasoning cannot be disabled, and the reasoning effort cannot be specified. Pricing increases once the total tokens in a given request is greater than 128k tokens. See more details on the [xAI docs](https://docs.x.ai/docs/models/grok-4-0709)
Performance Tier
Balanced
Grok 4 is a balanced model from Grok : strong performance at a reasonable price.
Strong cost-performance ratio. Reliable for most professional use cases without premium pricing.
Pricing
This model is included in Elosia plans
Type
per 1M tokens
Input (prompt)
$3.00
Output (completion)
$15.00
Cache read
$0.750
Capabilities
Context Length256K
Max Output Tokens—
TokenizerGrok
Inputimage, text, file
Outputtext
Release DateJuly 9, 2025
Benchmarks
General Intelligence
MMLU
89.2%
GPQA Diamond
82%
Mathematics
MATH-500
88%
AIME 2025
91.7%
Programming
HumanEval
92.5%
SWE-bench Verified
68.5%
Reasoning
IFEval
88%
ARC-AGI-2
15.9%
Humanity's Last Exam
40%
Recommended Use Cases
CodingAnalysisResearchGeneral ChatMathematics
Strengths
xAI's most capable model with strong all-around performance
Excellent coding abilities (HumanEval 92.5%)
Strong math and science reasoning (GPQA 82%, MATH-500 88%)