nvidia/Llama-3.1-Nemotron-70B-Instruct
deepinfra
NVIDIA's Llama 3.1 Nemotron 70B Instruct — fine-tuned for helpfulness and aligned with human preferences.
- Context window
- — tokens
- Input cost
- —
- Output cost
- —
- Latency (p50)
- —
Run side-by-side checks for pricing, context window, and latency.
deepinfra
NVIDIA's Llama 3.1 Nemotron 70B Instruct — fine-tuned for helpfulness and aligned with human preferences.