llama-2-7b-chat-int8
meta
Quantized (int8) generative text model with 7 billion parameters from Meta
- Context window
- 8,192 tokens
- Input cost
- $0.00 / 1M
- Output cost
- $0.00 / 1M
- Latency (p50)
- —
Run side-by-side checks for pricing, context window, and latency.
meta
Quantized (int8) generative text model with 7 billion parameters from Meta