DeepSeek-V3.2 is a large language model designed to harmonize high computational efficiency with strong reasoning and agentic tool-use performance. It introduces DeepSeek Sparse Attention (DSA), a fine-grained sparse attention mechanism that reduces training and inference cost while preserving quality in long-context scenarios. A scalable reinforcement learning post-training framework further improves reasoning, with reported performance in the GPT-5 class, and the model has demonstrated gold-medal results on the 2025 IMO and IOI. V3.2 also uses a large-scale agentic task synthesis pipeline to better integrate reasoning into tool-use settings, boosting compliance and generalization in interactive environments. Users can control the reasoning behaviour with the `reasoning` `enabled` boolean. [Learn more in our docs](https://openrouter.ai/docs/use-cases/reasoning-tokens#enable-reasoning-with-default-config)
DeepSeek v3.2 is a flagship model from DeepSeek : the most capable in their lineup.
Best-in-class model from this provider. Highest performance across benchmarks, ideal for demanding tasks.
| Type | per 1M tokens |
|---|---|
| Input (prompt) | $0.260 |
| Output (completion) | $0.380 |
| Cache read | $0.130 |
This model may use your data for training