GPT-5.4 nano is the most lightweight and cost-efficient variant of the GPT-5.4 family, optimized for speed-critical and high-volume tasks. It supports text and image inputs and is designed for low-latency use cases such as classification, data extraction, ranking, and sub-agent execution.
The model prioritizes responsiveness and efficiency over deep reasoning, making it ideal for pipelines that require fast, reliable outputs at scale. GPT-5.4 nano is well suited for background tasks, real-time systems, and distributed agent architectures where minimizing cost and latency is essential.
Performance Tier
Compact
GPT-5.4 Nano is a compact model from GPT : optimized for speed and affordability.
Small, fast, and affordable. Optimized for speed and low cost, great for high-volume or simple tasks.
Pricing
This model is included in Elosia plans
Type
per 1M tokens
Input (prompt)
$0.200
Output (completion)
$1.25
Cache read
$0.020
Capabilities
Context Length400K
Max Output Tokens128K
TokenizerGPT
Inputfile, image, text
Outputtext
Release DateMarch 17, 2026
Benchmarks
General Intelligence
MMLU
Not reported
Mathematics
MATH-500
Not reported
Programming
HumanEval
Not reported
Reasoning
Humanity's Last Exam
24.3%
Agentic
SWE-bench Pro
52.4%
Terminal-Bench 2.0
46.3%
Recommended Use Cases
Data ExtractionSummarizationCustomer SupportGeneral Chat
Strengths
Extremely affordable at $0.20/M input — ideal for high-volume batch processing
Ultra-fast response times for latency-sensitive applications
Significant upgrade over GPT-5 nano across all capabilities
Strong enough for classification, extraction, ranking, and simple coding subagents
Limitations
Limited reasoning depth — not suitable for complex analysis or research