Google: Gemma 3n 4B (free)
Gemma 3n E4B-it is optimized for efficient execution on mobile and low-resource devices, such as phones, laptops, and tablets. It supports multimodal inputs—including text, visual data, and audio—enabling diverse tasks...
- Context window
- 8,192 tokens
- Input cost
- $0.06 / 1M
- Output cost
- $0.12 / 1M
- Latency (p50)
- —
