modelstop.top
Back to models
meta-llamamodel

Llama-2-7b-hf

Open-source Llama-2-7b-hf model from meta-llama — available for download and self-hosting on Hugging Face.

Best for

General ChatSummarisationContent Generation
Context Window
Input Cost
Free
Output Cost
Latency p50

Pricing Details

Standard PricingFree tier
Input (per 1M tokens)
$0.00
Output (per 1M tokens)
$0.00

Run it locally

Local run command

pip install transformers accelerate && python -c "from transformers import AutoModelForCausalLM, AutoTokenizer; tok = AutoTokenzier.from_pretrained('meta-llama/Llama-2-7b-hf'); model = AutoModelForCausalLM.from_pretrained('meta-llama/Llama-2-7b-hf'); print('Loaded meta-llama/Llama-2-7b-hf')"

Presumptive Specs:

GPU: 10–16 GB VRAM; System RAM: 16+ GB; Disk: 20+ GB free

Hallucination Score™ (est.)

Community reliability estimate · not official

Not yet rated

About this score: Community-estimated based on user reports and publicly available benchmark data (e.g. TruthfulQA). This is not an official score from the model provider. Scores may be inaccurate — always verify with the official leaderboard before making production decisions.

Price History

Not enough historical data yet. Check back after the next pricing sync.

Provider

meta-llama

Community Prompts

Proven prompts shared by the community for this model

Loading prompts…
Llama-2-7b-hf — modelstop.top