modelstop.top

Compare Models

Run side-by-side checks for pricing, context window, and latency.

Llama 3.1 Nemotron 70B Instruct HF

nvidia

Llama 3.1 Nemotron 70B Instruct HF — Meta's Llama open-source language model, one of the most widely deployed open models.

Context window
32,768 tokens
Input cost
$0.88 / 1M
Output cost
$0.88 / 1M
Latency (p50)