Playground Find a Model ⚡ Pro Tools Pulse API Advertise PricingLoading...

The most comprehensive directory of AI models, providers, and agents. Updated daily.

Explore

All Models
Collections
Leaderboard
Compare
Pro Tools
Pulse Feed
API Docs

Stay Updated

Weekly digest of new models and price changes.

Business contact

Support: support@modelstop.top

Enquiries: hello@modelstop.top

Billing: billing@modelstop.top

Privacy: privacy@modelstop.top

Legal: legal@modelstop.top

Back to models

groqmodel

llama-3.1-8b-instant

Meta's Llama 3.1 8B served on Groq's LPU for ultra-low latency — ideal for fast, lightweight text tasks.

text free long-context

Best for

Long DocumentsBook SummarisationRAG

Use this model Try in Playground Compare →Alternatives →

Context Window

131K tokens ≈ 291 pages of text

Input Cost

$0.05/1M

Output Cost

$0.08/1M

Latency p50

103ms

Pricing Details

Standard Pricing

Input (per 1M tokens)

$0.05

Output (per 1M tokens)

$0.08

Hallucination Score™ (est.)

Community reliability estimate · not official

Generally reliable

About this score: Community-estimated based on user reports and publicly available benchmark data (e.g. TruthfulQA). This is not an official score from the model provider. Scores may be inaccurate — always verify with the official leaderboard before making production decisions.

Price History

Not enough historical data yet. Check back after the next pricing sync.

Provider

groq

Usage & Examples

Learn how to use llama-3.1-8b-instant

✓Complex reasoning and analysis

✓Customer support automation

✓Content generation and editing

✓Code review and debugging

✓Research summarization

💡 Tip: Start with the sample prompts above to see how llama-3.1-8b-instant works best.

Community Prompts

Proven prompts shared by the community for this model

Loading prompts…

llama-3.1-8b-instant — modelstop.top