qwen3-30b-a3b-fp8
qwen
Qwen3 is the latest generation of large language models in Qwen series, offering a comprehensive suite of dense and mixture-of-experts (MoE) models. Built upon extensive training, Qwen3 delivers groundbreaking advancements in reasoning, instruction-following, agent capabilities, and multilingual support.
- Context window
- 32,768 tokens
- Input cost
- $0.05 / 1M
- Output cost
- $0.34 / 1M
- Latency (p50)
- —
