llama-2-7b-chat-fp16
meta
Full precision (fp16) generative text model with 7 billion parameters from Meta
- Context window
- 4,096 tokens
- Input cost
- $0.56 / 1M
- Output cost
- $6.67 / 1M
- Latency (p50)
- —
Run side-by-side checks for pricing, context window, and latency.
meta
Full precision (fp16) generative text model with 7 billion parameters from Meta