modelstop.top

Compare Models

Run side-by-side checks for pricing, context window, and latency.

llama-3.3-70b-versatile

groq

Meta's Llama 3.3 70B — latest iteration with improved instruction following, served on Groq LPU.

Context window
131,072 tokens
Input cost
Output cost
Latency (p50)