modelstop.top

Compare Models

Run side-by-side checks for pricing, context window, and latency.

Qwen: Qwen3 VL 32B Instruct

qwen

Qwen3-VL-32B-Instruct is a large-scale multimodal vision-language model designed for high-precision understanding and reasoning across text, images, and video. With 32 billion parameters, it combines deep visual perception with advanced text...

Context window
131,072 tokens
Input cost
$0.10 / 1M
Output cost
$0.42 / 1M
Latency (p50)