GPT-5.4 mini

Name: GPT-5.4 mini
Brand: OpenAI
Price: 0.75 USD

OpenAI · Active · Updated May 10, 2026

OpenAI's most cost-efficient small model, redesigned for high-volume production with improved reasoning and speed.

Pricing Compare

Input Price

$0.75/M

per million tokens

Output Price

$4.50/M

per million tokens

Context Window

262,144

tokens

Max Output

16,384

tokens

Technical Specifications

Provider	OpenAI
Release Date	March 15, 2026
Pricing Type	per token
Input Price	$0.75.00 / 1M tokens
Output Price	$4.5.00 / 1M tokens
Cached Input	$0.07 / 1M tokens
Context Window	262,144 tokens
Max Output	16,384 tokens
Input Modalities	text, image
Output Modalities	text
Status	active
Availability	api
Latency	very fast
Rate Limit	30,000 RPM
Pricing URL	View official pricing →
Docs URL	—

Capability Scores

Coding

Reasoning

Math

Image

Speed

Overview

GPT-5.4 mini is OpenAI's cost-optimized small model, designed for high-volume, low-latency applications. It inherits the 256K context window from its larger sibling while delivering dramatically better reasoning and coding performance than its predecessor GPT-4o mini. At just $0.20 per million input tokens, it offers the best price-to-performance ratio in OpenAI's lineup.

Pros

+Extremely affordable — $0.20/M input tokens
+Very fast inference — highest speed score (94/100)
+256K context window — double the previous generation

Cons

−Lower accuracy on complex reasoning and coding tasks
−Struggles with nuanced instruction following
−Text-only output, no audio or image generation

Compare with Alternatives

vs Gemini 2 5 flash

GPT-5.4 mini wins on coding and reasoning; Gemini 2.5 Flash wins on speed, pricing, and context window.

Use Cases

High-volume text classification and routing

Chatbots and customer service at scale

Simple content generation with long-context support

Frequently Asked Questions about GPT-5.4 mini

How much does GPT-5.4 mini cost?

GPT-5.4 mini costs $0.75 per million input tokens and $4.5 per million output tokens. Cached input is $0.075 per million tokens.

What is the context window of GPT-5.4 mini?

GPT-5.4 mini has a 262,144 token context window, with a maximum output of 16,384 tokens.

Is GPT-5.4 mini good for coding?

GPT-5.4 mini scores 76/100 on coding benchmarks.

What modalities does GPT-5.4 mini support?

GPT-5.4 mini supports text, image input and text output.

← Back to all models