modelstop.top

Compare Models

Run side-by-side checks for pricing, context window, and latency.

Meta: Llama 3.1 8B Instruct

meta-llama

Meta's latest class of model (Llama 3.1) launched with a variety of sizes & flavors. This 8B instruct-tuned version is fast and efficient. It has demonstrated strong performance compared to...

Context window
16,384 tokens
Input cost
$0.02 / 1M
Output cost
$0.05 / 1M
Latency (p50)