Skip to content

Qwen3.6 35B A3B

Alibaba (Qwen) · Qwen3.6 · measured on live video

Overshoot API id Qwen/Qwen3.6-35B-A3B-FP8

Compare this model

Open weightsReasoningImage + Video

Intelligence
81
Speed
406 tok/s
TTFT
35 ms
E2E
2.9 s
Blended $/1M
$0.12
Params
35B
Context
256k
License
Apache-2.0
Released
May 2026

Speed, TTFT, E2E, and price at Overshoot, US East (Virginia) reference region. Rank is position on the VLM Intelligence Index.

Capability breakdown

OCR & Text93%Document & Chart84%Scene & Spatial79%Video QA78%Grounding69%Struct. Extraction80%

Intelligence over time

707784May 2026Jun 2026

Provider & region options

Fastest loop: DeepInfra · US East (Virginia) · 2.4 s end to end.

DeepInfraUS East (Virginia)246 tok/s67 ms2.4 s$0.09
DeepInfraEU Central (Frankfurt)246 tok/s89 ms2.5 s$0.09
vLLM (self-host)US East (Virginia)219 tok/s66 ms2.7 s$0.08
vLLM (self-host)EU West (Ireland)219 tok/s90 ms2.7 s$0.08
FireworksUS East (Virginia)310 tok/s59 ms2.7 s$0.12
FireworksUS West (Oregon)310 tok/s59 ms2.7 s$0.12
FireworksEU West (Ireland)310 tok/s75 ms2.8 s$0.12
OvershootUS East (Virginia)406 tok/s35 ms2.9 s$0.12
OvershootUS West (Oregon)406 tok/s42 ms2.9 s$0.12
OvershootEU West (Ireland)406 tok/s51 ms2.9 s$0.12
OvershootEU Central (Frankfurt)406 tok/s51 ms2.9 s$0.12
OvershootAsia Pacific (Tokyo)406 tok/s68 ms2.9 s$0.12
OvershootMiddle East (Dubai)406 tok/s83 ms2.9 s$0.12
OvershootAsia Pacific (Mumbai)406 tok/s87 ms2.9 s$0.12
OvershootSouth America (São Paulo)406 tok/s91 ms2.9 s$0.12

Sample outputs

Illustrative samples shared across models, not verbatim output from Qwen3.6 35B A3B.

Scene description

Describe what's happening in this camera frame.

A forklift is reversing toward loading bay 3 while a worker in a hi-vis vest signals from the left. Two pallets remain unwrapped near the dock edge.

OCR + extraction

Read the shipping label and return structured JSON.

{ "tracking": "1Z A24 9W7 03 8421 990 4", "carrier": "UPS", "weight_kg": 12.4, "dest": "Portland, OR 97204" }

Grounding

Point to every person not wearing a hard hat.

2 detections, [x:0.41, y:0.62, w:0.08, h:0.19] conf 0.94 · [x:0.77, y:0.55, w:0.07, h:0.21] conf 0.88.

Related models

Qwen3.6 27BReal-timevs
Index
77
Best speed
253 tok/s
Blended $/1M
$0.24

Where it ranks