gemma-2b-it-lora
This is a Gemma-2B base model that Cloudflare dedicates for inference with LoRA adapters. Gemma is a family of lightweight, state-of-the-art open models from Google, built from the same research and technology used to create the Gemini models.
- Context window
- 8,192 tokens
- Input cost
- $0.00 / 1M
- Output cost
- $0.00 / 1M
- Latency (p50)
- —
