modelstop.top

Compare Models

Run side-by-side checks for pricing, context window, and latency.

llama-3.2-1b-instruct

meta

The Llama 3.2 instruction-tuned text only models are optimized for multilingual dialogue use cases, including agentic retrieval and summarization tasks.

Context window
60,000 tokens
Input cost
$0.03 / 1M
Output cost
$0.20 / 1M
Latency (p50)