modelstop.top

Compare Models

Run side-by-side checks for pricing, context window, and latency.

NVIDIA: Nemotron Nano 12B 2 VL (free)

nvidia

NVIDIA Nemotron Nano 2 VL is a 12-billion-parameter open multimodal reasoning model designed for video understanding and document intelligence. It introduces a hybrid Transformer-Mamba architecture, combining transformer-level accuracy with Mamba’s...

Context window
128,000 tokens
Input cost
$0.20 / 1M
Output cost
$0.60 / 1M
Latency (p50)