MiniMax: MiniMax M1
minimax
MiniMax-M1 is a large-scale, open-weight reasoning model designed for extended context and high-efficiency inference. It leverages a hybrid Mixture-of-Experts (MoE) architecture paired with a custom "lightning attention" mechanism, allowing it...
- Context window
- 1,000,000 tokens
- Input cost
- $0.40 / 1M
- Output cost
- $2.20 / 1M
- Latency (p50)
- —
