OpenAI API

Hosted API

US East (Virginia)3.0 s, view the regional leaderboard
US West (Oregon)3.0 s, view the regional leaderboard
EU West (Ireland)3.0 s, view the regional leaderboard
Asia Pacific (Tokyo)3.0 s, view the regional leaderboard

Models on live video: 3
Regions measured: 4
Real-time models: 0 of 3
Median output speed: 245 tok/s
Median E2E · US East: 3.0 s
Median blended $/1M: $4.78

Models on OpenAI API

#
1	View gpt-5-4-nanoGPT-5.4 nanoReasoning	OpenAI	354 tok/s	48 ms	1.7 s	$3.20	–
2	View gpt-5-4-miniGPT-5.4 miniReasoning	OpenAI	245 tok/s	60 ms	3.0 s	$4.78	–
3	View gpt-5-4GPT-5.4Reasoning	OpenAI	91 tok/s	138 ms	9.3 s	$6.68	–

Output speed by model

Reasoning models are indicated by a lightbulb icon

Real-time coverage

No model clears the 200 ms bar on OpenAI API yet, in any region we measure.

Latency columns use the US East (Virginia) reference region. Real-time marks a model that closes the loop under 200 ms end to end on OpenAI API from at least one region.