Back to models
Context Window
4K tokens ≈ 9 pages of text
Input Cost
$3.50/1M
Output Cost
$3.50/1M
Latency p50
—
Pricing Details
Standard Pricing
Input (per 1M tokens)
$3.50
Output (per 1M tokens)
$3.50
Run it locally
Local run command
# Refer to provider docs: https://www.together.xyz/docs
Presumptive Specs:
Typical GPU
12–24 GB VRAM
System RAM
16+ GB
Disk
20+ GB free
Runtime
Docker / local runtime
Reference links
Hallucination Score™ (est.)
Community reliability estimate · not official
72
Generally reliable
About this score: Community-estimated based on user reports and publicly available benchmark data (e.g. TruthfulQA). This is not an official score from the model provider. Scores may be inaccurate — always verify with the official leaderboard before making production decisions.
Price History
Not enough historical data yet. Check back after the next pricing sync.
Provider
meta-llama
Community Prompts
Proven prompts shared by the community for this model
Loading prompts…
