llama3.1-8b-pruned-60pct
kalebt
Open-source llama3.1-8b-pruned-60pct model from kalebt — available for download and self-hosting on Hugging Face.
- Context window
- — tokens
- Input cost
- —
- Output cost
- —
- Latency (p50)
- —
Run side-by-side checks for pricing, context window, and latency.
kalebt
Open-source llama3.1-8b-pruned-60pct model from kalebt — available for download and self-hosting on Hugging Face.