modelstop.top

Compare Models

Run side-by-side checks for pricing, context window, and latency.

llama-3.2-3b-instruct

meta

The Llama 3.2 instruction-tuned text only models are optimized for multilingual dialogue use cases, including agentic retrieval and summarization tasks.

Context window
80,000 tokens
Input cost
$0.05 / 1M
Output cost
$0.34 / 1M
Latency (p50)