llama-3.1-8b-instruct-fp8
meta
Llama 3.1 8B quantized to FP8 precision
- Context window
- 32,000 tokens
- Input cost
- $0.15 / 1M
- Output cost
- $0.29 / 1M
- Latency (p50)
- —
Run side-by-side checks for pricing, context window, and latency.
meta
Llama 3.1 8B quantized to FP8 precision