OpenAI API
Hosted API- US East (Virginia)3.0 s, view the regional leaderboard
- US West (Oregon)3.0 s, view the regional leaderboard
- EU West (Ireland)3.0 s, view the regional leaderboard
- Asia Pacific (Tokyo)3.0 s, view the regional leaderboard
- Models on live video
- 3
- Regions measured
- 4
- Real-time models
- 0 of 3
- Median output speed
- 245 tok/s
- Median E2E · US East
- 3.0 s
- Median blended $/1M
- $4.78
Models on OpenAI API
| # | Details | |||||||
|---|---|---|---|---|---|---|---|---|
| 1 | View gpt-5-4-nanoGPT-5.4 nanoReasoning | OpenAI | 354 tok/s | 48 ms | 1.7 s | $3.20 | – | |
| 2 | View gpt-5-4-miniGPT-5.4 miniReasoning | OpenAI | 245 tok/s | 60 ms | 3.0 s | $4.78 | – | |
| 3 | View gpt-5-4GPT-5.4Reasoning | OpenAI | 91 tok/s | 138 ms | 9.3 s | $6.68 | – |
Reasoning models are indicated by a lightbulb icon
Real-time coverage
No model clears the 200 ms bar on OpenAI API yet, in any region we measure.
Latency columns use the US East (Virginia) reference region. Real-time marks a model that closes the loop under 200 ms end to end on OpenAI API from at least one region.