GPT-5.4 mini brings the core capabilities of GPT-5.4 to a faster, more efficient model optimized for high-throughput workloads. It supports text and image inputs with strong performance across reasoning, coding, and tool use, while reducing latency and cost for large-scale deployments. The model is designed for production environments that require a balance of capability and efficiency, making it well suited for chat applications, coding assistants, and agent workflows that operate at scale. GPT-5.4 mini delivers reliable instruction following, solid multi-step reasoning, and consistent performance across diverse tasks with improved cost efficiency.
GPT-5.4 Mini is a balanced model from GPT : strong performance at a reasonable price.
Strong cost-performance ratio. Reliable for most professional use cases without premium pricing.
| Type | per 1M tokens |
|---|---|
| Input (prompt) | $0.750 |
| Output (completion) | $4.50 |
| Cache read | $0.075 |
This model may use your data for training