kimi-k2-thinking
ollama
kimi-k2-thinking — available to run locally via Ollama on CPU and GPU hardware.
- Context window
- 262,144 tokens
- Input cost
- —
- Output cost
- —
- Latency (p50)
- —
Run side-by-side checks for pricing, context window, and latency.
ollama
kimi-k2-thinking — available to run locally via Ollama on CPU and GPU hardware.