Meituan: LongCat Flash Chat
meituan
LongCat-Flash-Chat is a large-scale Mixture-of-Experts (MoE) model with 560B total parameters, of which 18.6B–31.3B (≈27B on average) are dynamically activated per input. It introduces a shortcut-connected MoE design to reduce...
- Context window
- 131,072 tokens
- Input cost
- $0.20 / 1M
- Output cost
- $0.80 / 1M
- Latency (p50)
- —
