llama-3.1-8b-instant
groq
Meta's Llama 3.1 8B served on Groq's LPU for ultra-low latency — ideal for fast, lightweight text tasks.
- Context window
- 131,072 tokens
- Input cost
- —
- Output cost
- —
- Latency (p50)
- —
Run side-by-side checks for pricing, context window, and latency.
groq
Meta's Llama 3.1 8B served on Groq's LPU for ultra-low latency — ideal for fast, lightweight text tasks.