Skip to content

GPT-5.4 mini

OpenAI · GPT · measured on live video

Overshoot API id gpt-5.4-mini

Compare this model

ClosedReasoningImage + Video

Intelligence
76
Speed
245 tok/s
TTFT
60 ms
E2E
3.0 s
Blended $/1M
$4.78
Params
Context
256k
License
Proprietary
Released
Mar 2026

Speed, TTFT, E2E, and price at OpenAI API, US East (Virginia) reference region. Rank is position on the VLM Intelligence Index.

Capability breakdown

OCR & Text85%Document & Chart78%Scene & Spatial76%Video QA70%Grounding63%Struct. Extraction77%

Intelligence over time

647280Mar 2026Apr 2026Jun 2026

Provider & region options

Fastest loop: Overshoot · US East (Virginia) · 2.6 s end to end.

OvershootUS East (Virginia)248 tok/s46 ms2.6 s$5.00
OvershootUS West (Oregon)248 tok/s54 ms2.6 s$5.00
OvershootEU West (Ireland)248 tok/s55 ms2.6 s$5.00
OvershootEU Central (Frankfurt)248 tok/s64 ms2.7 s$5.00
OvershootAsia Pacific (Tokyo)248 tok/s81 ms2.7 s$5.00
OvershootAsia Pacific (Mumbai)248 tok/s97 ms2.7 s$5.00
OvershootMiddle East (Dubai)248 tok/s98 ms2.7 s$5.00
OvershootSouth America (São Paulo)248 tok/s107 ms2.7 s$5.00
OpenAI APIUS East (Virginia)245 tok/s60 ms3.0 s$4.78
OpenAI APIUS West (Oregon)245 tok/s72 ms3.0 s$4.78
OpenAI APIEU West (Ireland)245 tok/s72 ms3.0 s$4.78
OpenAI APIAsia Pacific (Tokyo)245 tok/s95 ms3.0 s$4.78

Sample outputs

Illustrative samples shared across models, not verbatim output from GPT-5.4 mini.

Scene description

Describe what's happening in this camera frame.

A forklift is reversing toward loading bay 3 while a worker in a hi-vis vest signals from the left. Two pallets remain unwrapped near the dock edge.

OCR + extraction

Read the shipping label and return structured JSON.

{ "tracking": "1Z A24 9W7 03 8421 990 4", "carrier": "UPS", "weight_kg": 12.4, "dest": "Portland, OR 97204" }

Grounding

Point to every person not wearing a hard hat.

2 detections, [x:0.41, y:0.62, w:0.08, h:0.19] conf 0.94 · [x:0.77, y:0.55, w:0.07, h:0.21] conf 0.88.

Related models

GPT-5.4vs
Index
85
Best speed
96 tok/s
Blended $/1M
$6.68
GPT-5.4 nanovs
Index
64
Best speed
354 tok/s
Blended $/1M
$3.20

Where it ranks