modelstop.top

Compare Models

Run side-by-side checks for pricing, context window, and latency.

Llama-3.2-1B-Instruct-Q8_0-GGUF

hugging-quants

Open-source Llama-3.2-1B-Instruct-Q8_0-GGUF model from hugging-quants — available for download and self-hosting on Hugging Face.

Context window
tokens
Input cost
Output cost
Latency (p50)