Llama-3.2-1B-Instruct-Q8_0-GGUF
hugging-quants
Open-source Llama-3.2-1B-Instruct-Q8_0-GGUF model from hugging-quants — available for download and self-hosting on Hugging Face.
- Context window
- — tokens
- Input cost
- —
- Output cost
- —
- Latency (p50)
- —
Run side-by-side checks for pricing, context window, and latency.
hugging-quants
Open-source Llama-3.2-1B-Instruct-Q8_0-GGUF model from hugging-quants — available for download and self-hosting on Hugging Face.