Playground Find a Model ⚡ Pro Tools Pulse API Advertise PricingLoading...

The most comprehensive directory of AI models, providers, and agents. Updated daily.

Explore

All Models
Collections
Leaderboard
Compare
Pro Tools
Pulse Feed
API Docs

Stay Updated

Weekly digest of new models and price changes.

Business contact

Support: support@modelstop.top

Enquiries: hello@modelstop.top

Billing: billing@modelstop.top

Privacy: privacy@modelstop.top

Legal: legal@modelstop.top

llama-3.3-70b-instruct-fp8-fast — modelstop.top

Back to models

metamodel

llama-3.3-70b-instruct-fp8-fast

Llama 3.3 70B quantized to fp8 precision, optimized to be faster.

text instruct cheap

Best for

Instruction FollowingGeneral ChatBulk Data ExtractionHigh-Volume Tasks

Try in Playground Compare →Alternatives →

Context Window

24K tokens ≈ 53 pages of text

Input Cost

$0.29/1M

Output Cost

$2.25/1M

Latency p50

196ms

Pricing Details

Standard Pricing

Input (per 1M tokens)

$0.29

Output (per 1M tokens)

$2.25

Price history

4/23/2026$0.29 in / $2.25 out

Hallucination Score™ (est.)

Community reliability estimate · not official

—

Not yet rated

About this score: Community-estimated based on user reports and publicly available benchmark data (e.g. TruthfulQA). This is not an official score from the model provider. Scores may be inaccurate — always verify with the official leaderboard before making production decisions.

Price HistoryPro

Input & output cost per 1M tokens over time

Price history is a Pro feature

Track pricing trends and catch price drops early.

Upgrade to Pro — $19/mo →

Provider

Usage & Examples

Learn how to use llama-3.3-70b-instruct-fp8-fast

✓Complex reasoning and analysis

✓Customer support automation

✓Content generation and editing

✓Code review and debugging

✓Research summarization

💡 Tip: Start with the sample prompts above to see how llama-3.3-70b-instruct-fp8-fast works best.

Community Prompts

Proven prompts shared by the community for this model

Loading prompts…