Skip to content

GPT-5.4

OpenAI · GPT · measured on live video

Overshoot API id gpt-5.4

Compare this model

ClosedReasoningImage + Video

Intelligence
85
Speed
91 tok/s
TTFT
138 ms
E2E
9.3 s
Blended $/1M
$6.68
Params
Context
400k
License
Proprietary
Released
Mar 2026

Speed, TTFT, E2E, and price at OpenAI API, US East (Virginia) reference region. Rank is position on the VLM Intelligence Index.

Capability breakdown

OCR & Text95%Document & Chart85%Scene & Spatial86%Video QA83%Grounding76%Struct. Extraction85%

Intelligence over time

728190Mar 2026Apr 2026Jun 2026

Provider & region options

Fastest loop: Overshoot · US East (Virginia) · 8.9 s end to end.

OvershootUS East (Virginia)96 tok/s87 ms8.9 s$6.63
OvershootUS West (Oregon)96 tok/s101 ms8.9 s$6.63
OvershootEU West (Ireland)96 tok/s103 ms8.9 s$6.63
OvershootEU Central (Frankfurt)96 tok/s117 ms9.0 s$6.63
OvershootAsia Pacific (Tokyo)96 tok/s129 ms9.0 s$6.63
OvershootAsia Pacific (Mumbai)96 tok/s137 ms9.0 s$6.63
OvershootMiddle East (Dubai)96 tok/s141 ms9.0 s$6.63
OvershootSouth America (São Paulo)96 tok/s157 ms9.0 s$6.63
OpenAI APIUS West (Oregon)91 tok/s137 ms9.3 s$6.68
OpenAI APIUS East (Virginia)91 tok/s138 ms9.3 s$6.68
OpenAI APIEU West (Ireland)91 tok/s154 ms9.3 s$6.68
OpenAI APIAsia Pacific (Tokyo)91 tok/s172 ms9.3 s$6.68

Sample outputs

Illustrative samples shared across models, not verbatim output from GPT-5.4.

Scene description

Describe what's happening in this camera frame.

A forklift is reversing toward loading bay 3 while a worker in a hi-vis vest signals from the left. Two pallets remain unwrapped near the dock edge.

OCR + extraction

Read the shipping label and return structured JSON.

{ "tracking": "1Z A24 9W7 03 8421 990 4", "carrier": "UPS", "weight_kg": 12.4, "dest": "Portland, OR 97204" }

Grounding

Point to every person not wearing a hard hat.

2 detections, [x:0.41, y:0.62, w:0.08, h:0.19] conf 0.94 · [x:0.77, y:0.55, w:0.07, h:0.21] conf 0.88.

Related models

GPT-5.4 minivs
Index
76
Best speed
248 tok/s
Blended $/1M
$4.78
GPT-5.4 nanovs
Index
64
Best speed
354 tok/s
Blended $/1M
$3.20

Where it ranks