Compare Models
Run side-by-side checks for pricing, context window, and latency.
Llama 3.1 Nemotron 70B Instruct HF
nvidia
Llama 3.1 Nemotron 70B Instruct HF — Meta's Llama open-source language model, one of the most widely deployed open models.
- Context window
- 32,768 tokens
- Input cost
- $0.88 / 1M
- Output cost
- $0.88 / 1M
- Latency (p50)
- —