Qwen: Qwen2.5 VL 32B Instruct
qwen
Qwen2.5-VL-32B is a multimodal vision-language model fine-tuned through reinforcement learning for enhanced mathematical reasoning, structured outputs, and visual problem-solving capabilities. It excels at visual analysis tasks, including object recognition, textual...
- Context window
- 128,000 tokens
- Input cost
- $0.20 / 1M
- Output cost
- $0.60 / 1M
- Latency (p50)
- —
