NVIDIA: Nemotron Nano 9B V2 (free)
nvidia
NVIDIA-Nemotron-Nano-9B-v2 is a large language model (LLM) trained from scratch by NVIDIA, and designed as a unified model for both reasoning and non-reasoning tasks. It responds to user queries and...
- Context window
- 128,000 tokens
- Input cost
- $0.04 / 1M
- Output cost
- $0.16 / 1M
- Latency (p50)
- —
