modelstop.top

Compare Models

Run side-by-side checks for pricing, context window, and latency.

llama-2-7b-chat-int8

meta

Quantized (int8) generative text model with 7 billion parameters from Meta

Context window
8,192 tokens
Input cost
$0.00 / 1M
Output cost
$0.00 / 1M
Latency (p50)