NVIDIA: Nemotron Nano 12B 2 VL (free)
nvidia
NVIDIA Nemotron Nano 2 VL is a 12-billion-parameter open multimodal reasoning model designed for video understanding and document intelligence. It introduces a hybrid Transformer-Mamba architecture, combining transformer-level accuracy with Mamba’s...
- Context window
- 128,000 tokens
- Input cost
- $0.20 / 1M
- Output cost
- $0.60 / 1M
- Latency (p50)
- —
