modelstop.top

Compare Models

Run side-by-side checks for pricing, context window, and latency.

llama-2-7b-chat-fp16

meta

Full precision (fp16) generative text model with 7 billion parameters from Meta

Context window
4,096 tokens
Input cost
$0.56 / 1M
Output cost
$6.67 / 1M
Latency (p50)