modelstop.top

Compare Models

Run side-by-side checks for pricing, context window, and latency.

llama-3.1-8b-instruct-fp8

meta

Llama 3.1 8B quantized to FP8 precision

Context window
32,000 tokens
Input cost
$0.15 / 1M
Output cost
$0.29 / 1M
Latency (p50)