Gemini

Gemini 3.1 Pro

GeminiFlagship
ThinkingTool UseVisionStructured Output

About this model

Gemini 3.1 Pro Preview is Google’s frontier reasoning model, delivering enhanced software engineering performance, improved agentic reliability, and more efficient token usage across complex workflows. Building on the multimodal foundation of the Gemini 3 series, it combines high-precision reasoning across text, image, video, audio, and code with a 1M-token context window. Reasoning Details must be preserved when using multi-turn tool calling, see our docs here: https://openrouter.ai/docs/use-cases/reasoning-tokens#preserving-reasoning. The 3.1 update introduces measurable gains in SWE benchmarks and real-world coding environments, along with stronger autonomous task execution in structured domains such as finance and spreadsheet-based workflows. Designed for advanced development and agentic systems, Gemini 3.1 Pro Preview improves long-horizon stability and tool orchestration while increasing token efficiency. It introduces a new medium thinking level to better balance cost, speed, and performance. The model excels in agentic coding, structured planning, multimodal analysis, and workflow automation, making it well-suited for autonomous agents, financial modeling, spreadsheet automation, and high-context enterprise tasks.

Performance Tier

Flagship

Gemini 3.1 Pro is a flagship model from Gemini : the most capable in their lineup.

Best-in-class model from this provider. Highest performance across benchmarks, ideal for demanding tasks.

Pricing

This model is included in Elosia plans
Typeper 1M tokens
Input (prompt)$2.00
Output (completion)$12.00
Image$2.00
Internal reasoning$12.00
Cache read$0.200
Cache write$0.375

Capabilities

Context Length1.0M
Max Output Tokens66K
TokenizerGemini
Inputaudio, file, image, text, video
Outputtext
Release DateFebruary 19, 2026

Benchmarks

General Intelligence
MMLU
Not reported
GPQA Diamond
94.3%
Mathematics
MATH-500
Not reported
AIME 2025
100%
Programming
HumanEval
Not reported
SWE-bench Verified
80.6%
Reasoning
IFEval
Not reported
ARC-AGI-2
77.1%
Humanity's Last Exam
51.4%
Multimodal
MMMU-Pro
80.5%

Recommended Use Cases

ResearchAnalysisCodingMathematicsSummarizationData Extraction

Strengths

  • ARC-AGI-2 77.1% — more than double Gemini 3 Pro reasoning score
  • Top-tier software engineering (SWE-bench 80.6%)
  • Exceptional science reasoning (GPQA Diamond 94.3%)
  • Enhanced agentic reliability for complex autonomous workflows

Limitations

  • Preview model — may have stability changes
  • Premium pricing ($2/$12 per M tokens)

Resources

This model may use your data for training

Similar Models