Back to models
meta-llamamodel
Llama-3.1-8B-Instruct
Open-source Llama-3.1-8B-Instruct model from meta-llama — available for download and self-hosting on Hugging Face.
Best for
Instruction FollowingGeneral ChatBulk Data ExtractionHigh-Volume Tasks
Context Window
131K tokens ≈ 291 pages of text
Input Cost
Free
Output Cost
—
Latency p50
—
Pricing Details
No pricing data. Model may be free or requires direct access.
Run it locally
Local run command
pip install transformers accelerate && python -c "from transformers import AutoModelForCausalLM, AutoTokenizer; tok = AutoTokenzier.from_pretrained('meta-llama/Llama-3.1-8B-Instruct'); model = AutoModelForCausalLM.from_pretrained('meta-llama/Llama-3.1-8B-Instruct'); print('Loaded meta-llama/Llama-3.1-8B-Instruct')"Presumptive Specs:
GPU: 10–16 GB VRAM; System RAM: 16+ GB; Disk: 20+ GB free
Hallucination Score™ (est.)
Community reliability estimate · not official
—
Not yet rated
About this score: Community-estimated based on user reports and publicly available benchmark data (e.g. TruthfulQA). This is not an official score from the model provider. Scores may be inaccurate — always verify with the official leaderboard before making production decisions.
Price History
Not enough historical data yet. Check back after the next pricing sync.
Provider
meta-llama
Community Prompts
Proven prompts shared by the community for this model
Loading prompts…
