modelstop.top
Back to models
meta-llamamodel

llama-2-7b-chat-hf-lora

This is a Llama2 base model that Cloudflare dedicated for inference with LoRA adapters. Llama 2 is a collection of pretrained and fine-tuned generative text models ranging in scale from 7 billion to 70 billion parameters. This is the repository for the 7B fine-tuned model, optimized for dialogue use cases and converted for the Hugging Face Transformers format.

Best for

General ChatSummarisationContent Generation
Context Window
8K tokens ≈ 18 pages of text
Input Cost
Free
Output Cost
Latency p50

Pricing Details

Standard PricingFree tier
Input (per 1M tokens)
$0.00
Output (per 1M tokens)
$0.00

Hallucination Score™ (est.)

Community reliability estimate · not official

Not yet rated

About this score: Community-estimated based on user reports and publicly available benchmark data (e.g. TruthfulQA). This is not an official score from the model provider. Scores may be inaccurate — always verify with the official leaderboard before making production decisions.

Price History

Not enough historical data yet. Check back after the next pricing sync.

Provider

meta-llama

Community Prompts

Proven prompts shared by the community for this model

Loading prompts…