modelstop.top

Compare Models

Run side-by-side checks for pricing, context window, and latency.

NVIDIA: Nemotron Nano 9B V2 (free)

nvidia

NVIDIA-Nemotron-Nano-9B-v2 is a large language model (LLM) trained from scratch by NVIDIA, and designed as a unified model for both reasoning and non-reasoning tasks. It responds to user queries and...

Context window
128,000 tokens
Input cost
$0.04 / 1M
Output cost
$0.16 / 1M
Latency (p50)