Skip to content

Novita

Hosted API
Models on live video
2
Regions measured
2
Real-time models
1 of 2
Median output speed
195 tok/s
Median E2E · US East
213 ms
Median blended $/1M
$0.15

Models on Novita

#Details
1View gemma-4-26b-a4bGemma 4 26B A4BGoogle
241 tok/s
69 ms168 ms$0.09<200 ms
2View gemma-4-31bGemma 4 31BGoogle
150 tok/s
98 ms259 ms$0.21
Output speed by model
Output speed by model. 2 bars. Highest: Gemma 4 26B A4B at 241 tok/s. Toggle the data table for exact values.Gemma 4 26B A4BGemma 4 26B A4B241 tok/sGemma 4 31BGemma 4 31B150 tok/s

Real-time coverage

US East (Virginia)1 of 2 under 200 ms
Asia Pacific (Tokyo)1 of 2 under 200 ms

Latency columns use the US East (Virginia) reference region. Real-time marks a model that closes the loop under 200 ms end to end on Novita from at least one region.