OpenAI: GPT-5.4 Mini
openai/gpt-5.4-miniAbout
GPT-5.4 Mini brings the core capabilities of GPT-5.4 to a faster, more efficient model optimized for high-throughput workloads, running more than 2x faster than its predecessor while lowering latency and cost. It accepts text, image, and file inputs with performance across reasoning, coding, and tool use, making it suited to chat applications, coding assistants, and agent workflows that operate at scale. The model provides a 400K-token context window with up to 128K tokens of output and supports reasoning. GPT-5.4 Mini was released in March 2026.
Capabilities
- Context Length
- 400K
- Max Output
- 128K
- Reasoning
- Yes
- In
- file, image, text
- Out
- text
Benchmarks
View leaderboardReasoning & Knowledge
Coding & Agentic
Source: Artificial Analysis
Pricing
Full pricing| Type | Price / 1M tokens |
|---|---|
| Input | $0.75 |
| Output | $4.50 |
| Cache Read | $0.075 |
| Web Search | $0.01 / call |
OpenAI-compatible · Model ID openai/gpt-5.4-mini
curl https://api.elliotgate.com/v1/chat/completions \
-H "Authorization: Bearer sk-omg-your-api-key" \
-H "Content-Type: application/json" \
-d '{
"model": "openai/gpt-5.4-mini",
"messages": [{"role": "user", "content": "Hello!"}]
}'OFTEN COMPARED
GPT-5.4 Mini comparisons
Decide which model wins on the dimensions that matter for your workload — context, benchmarks, pricing, or serving latency.
GPT-5.4 Mini vs MiMo-V2-Pro
GPT-5.
See full comparison →GPT-5.4 Mini vs Grok 4.20
GPT-5.
See full comparison →GPT-5.4 Mini vs MiniMax M2.7
GPT-5.
See full comparison →GPT-5.4 Mini vs GLM 5
GLM 5 and GPT-5.
See full comparison →GPT-5.4 Mini vs Qwen3.6 Plus
GPT-5.
See full comparison →GPT-5.4 Mini vs GLM 5 Turbo
GLM 5 Turbo and GPT-5.
See full comparison →GPT-5.4 Mini vs Kimi K2.5
GPT-5.
See full comparison →GPT-5.4 Mini vs Claude Opus 4.6 (Fast)
Claude Opus 4.
See full comparison →