Skip to content

Every vision model, measured for real time

Overshoot runs every Vision Language Model on live video and benchmarks what actually ships: capability, speed, latency under the 200 ms bar, and cost, across every provider and region.

models measured
13models measured
providers
11providers
regions
8regions
Real-time bar
200 msreal-time bar

Who leads right now

#ModelDeveloperIndexSpeed$ / 1MReadiness
1Claude Opus 4.6Anthropic8655 tok/s$6.93Batch
2GPT-5.4OpenAI8591 tok/s$6.68Batch
3Claude Sonnet 4.6Anthropic8395 tok/s$6.32Batch
4Gemini 3.1 ProGoogle83106 tok/s$5.83Batch
5Qwen3.6 35B A3BAlibaba (Qwen)81406 tok/s$0.12Batch

Intelligence isn't free

Cost per task is the weighted average USD to run one Intelligence Index task at each model's reference provider. Reasoning models are marked with a bulb.

What we measure

Pick the model that sees and acts in real time