Z.ai: GLM 4.5V
z-ai
GLM-4.5V is a vision-language foundation model for multimodal agent applications. Built on a Mixture-of-Experts (MoE) architecture with 106B parameters and 12B activated parameters, it achieves state-of-the-art results in video understanding,...
- Context window
- 65,536 tokens
- Input cost
- $0.60 / 1M
- Output cost
- $1.80 / 1M
- Latency (p50)
- —
