modelstop.top

Compare Models

Run side-by-side checks for pricing, context window, and latency.

meta-llama/Meta-Llama-3.1-8B-Instruct

deepinfra

Meta Llama 3.1 8B Instruct on DeepInfra — fast, affordable open-source model with 128K context.

Context window
tokens
Input cost
Output cost
Latency (p50)