Grok 4.1 Fast is xAI's best agentic tool calling model that shines in real-world use cases like customer support and deep research. 2M context window.
Reasoning can be enabled/disabled using the `reasoning` `enabled` parameter in the API. [Learn more in our docs](https://openrouter.ai/docs/use-cases/reasoning-tokens#controlling-reasoning-tokens)
Performance Tier
Balanced
Grok 4.1 Fast is a balanced model from Grok : strong performance at a reasonable price.
Strong cost-performance ratio. Reliable for most professional use cases without premium pricing.
Pricing
This model is included in Elosia plans
Type
per 1M tokens
Input (prompt)
$0.200
Output (completion)
$0.500
Cache read
$0.050
Capabilities
Context Length2.0M
Max Output Tokens30K
TokenizerGrok
Inputtext, image, file
Outputtext
Release DateNovember 19, 2025
Benchmarks
General Intelligence
MMLU
88.5%
Mathematics
MATH-500
86%
Programming
HumanEval
91%
Recommended Use Cases
General ChatCodingCreative Writing
Strengths
Optimized for low-latency responses with Grok 4.1 capabilities
Strong coding performance (HumanEval 91%)
Good general knowledge and conversational abilities