modelstop.top

Compare Models

Run side-by-side checks for pricing, context window, and latency.

llama-3.1-8b-instant

groq

Meta's Llama 3.1 8B served on Groq's LPU for ultra-low latency — ideal for fast, lightweight text tasks.

Context window
131,072 tokens
Input cost
Output cost
Latency (p50)