modelstop.top
Back to models
groqmodel

llama-3.1-8b-instant

Meta's Llama 3.1 8B served on Groq's LPU for ultra-low latency — ideal for fast, lightweight text tasks.

Best for

Long DocumentsBook SummarisationRAG
Context Window
131K tokens ≈ 291 pages of text
Input Cost
$0.05/1M
Output Cost
$0.08/1M
Latency p50

Pricing Details

Standard Pricing
Input (per 1M tokens)
$0.05
Output (per 1M tokens)
$0.08

Hallucination Score™ (est.)

Community reliability estimate · not official

72
Generally reliable

About this score: Community-estimated based on user reports and publicly available benchmark data (e.g. TruthfulQA). This is not an official score from the model provider. Scores may be inaccurate — always verify with the official leaderboard before making production decisions.

Price History

Not enough historical data yet. Check back after the next pricing sync.

Provider

groq

Community Prompts

Proven prompts shared by the community for this model

Loading prompts…