Every vision model, measured for real time
Overshoot runs every Vision Language Model on live video and benchmarks what actually ships: capability, speed, latency under the 200 ms bar, and cost, across every provider and region.
- models measured
- 13models measured
- providers
- 11providers
- regions
- 8regions
- Real-time bar
- 200 msreal-time bar
Latest
All updatesWho leads right now
Top models
Full leaderboard| # | Model | Developer | Index | Speed | $ / 1M | Readiness |
|---|---|---|---|---|---|---|
| 1 | Anthropic | 86 | 55 tok/s | $6.93 | Batch | |
| 2 | OpenAI | 85 | 91 tok/s | $6.68 | Batch | |
| 3 | Anthropic | 83 | 95 tok/s | $6.32 | Batch | |
| 4 | 83 | 106 tok/s | $5.83 | Batch | ||
| 5 | Alibaba (Qwen) | 81 | 406 tok/s | $0.12 | Batch |
Intelligence isn't free
Cost per task is the weighted average USD to run one Intelligence Index task at each model's reference provider. Reasoning models are marked with a bulb.