meta-llamamodel

Llama-2-7b-hf

Open-source Llama-2-7b-hf model from meta-llama — available for download and self-hosting on Hugging Face.

text free

Best for

General ChatSummarisationContent Generation

Use this model Try in Playground Compare →Alternatives →

Context Window

—

Input Cost

Free

Output Cost

—

Latency p50

—

Pricing Details

Standard PricingFree tier

Input (per 1M tokens)

$0.00

Output (per 1M tokens)

$0.00

Run it locally

Local run command

pip install transformers accelerate && python -c "from transformers import AutoModelForCausalLM, AutoTokenizer; tok = AutoTokenzier.from_pretrained('meta-llama/Llama-2-7b-hf'); model = AutoModelForCausalLM.from_pretrained('meta-llama/Llama-2-7b-hf'); print('Loaded meta-llama/Llama-2-7b-hf')"

Presumptive Specs:

GPU: 10–16 GB VRAM; System RAM: 16+ GB; Disk: 20+ GB free

Reference links

huggingface.co/meta-llama/Llama-2-7b-hf huggingface.co/docs/transformers/installation

Hallucination Score™ (est.)

Community reliability estimate · not official

—

Not yet rated

About this score: Community-estimated based on user reports and publicly available benchmark data (e.g. TruthfulQA). This is not an official score from the model provider. Scores may be inaccurate — always verify with the official leaderboard before making production decisions.

Price History

Not enough historical data yet. Check back after the next pricing sync.

Provider

meta-llama

Usage & Examples

Learn how to use Llama-2-7b-hf

✓Complex reasoning and analysis

✓Customer support automation

✓Content generation and editing

✓Code review and debugging

✓Research summarization

💡 Tip: Start with the sample prompts above to see how Llama-2-7b-hf works best.

Community Prompts

Proven prompts shared by the community for this model

Loading prompts…