modelstop.top
Home/All Models

AI Model Catalogue

Browse 22 models across providers, modalities, and use cases.

πŸ“„ Long Context

22 models Β· Page 1 of 1

gemma-4-26b-a4b-it

google

Gemma 4 is Google's most intelligent family of open models, built from Gemini 3 research to maximize intelligence-per-parameter.

textcheaplong-context
256,000 ctx$0.10/1M in
Explore specs and pricingView details β†’

Gemma 3 4B IT

google

Gemma 3 4B IT β€” available via AWS Bedrock (us-east-1).

textvisionmultimodal
131,072 ctxFree in
Explore specs and pricingView details β†’

Gemma 3 27B PT

google

Gemma 3 27B PT β€” available via AWS Bedrock (us-east-1).

textvisionmultimodal
131,072 ctxFree in
Explore specs and pricingView details β†’

Google: Gemini 2.0 Flash

google

Gemini Flash 2.0 offers a significantly faster time to first token (TTFT) compared to [Gemini Flash 1.5](/google/gemini-flash-1.5), while maintaining quality on par with larger models like [Gemini Pro 1.5](/google/gemini-pro-1.5). It...

textvisionmultimodal
1,000,000 ctx$0.10/1M in
Explore specs and pricingView details β†’

Google: Gemini 2.0 Flash Lite

google

Gemini 2.0 Flash Lite offers a significantly faster time to first token (TTFT) compared to [Gemini Flash 1.5](/google/gemini-flash-1.5), while maintaining quality on par with larger models like [Gemini Pro 1.5](/google/gemini-pro-1.5),...

textvisionmultimodal
1,048,576 ctx$0.07/1M in
Explore specs and pricingView details β†’

Google: Gemma 3 27B

google

Gemma 3 introduces multimodality, supporting vision-language input and text outputs. It handles context windows up to 128k tokens, understands over 140 languages, and offers improved math, reasoning, and chat capabilities,...

textvisionmultimodal
Run locally
131,072 ctx$0.08/1M in
Explore specs and pricingView details β†’

Google: Gemma 3 12B

google

Gemma 3 introduces multimodality, supporting vision-language input and text outputs. It handles context windows up to 128k tokens, understands over 140 languages, and offers improved math, reasoning, and chat capabilities,...

textvisionmultimodal
Run locally
131,072 ctx$0.04/1M in
Explore specs and pricingView details β†’

Google: Gemma 3 4B

google

Gemma 3 introduces multimodality, supporting vision-language input and text outputs. It handles context windows up to 128k tokens, understands over 140 languages, and offers improved math, reasoning, and chat capabilities,...

textvisionmultimodal
Run locally
131,072 ctx$0.04/1M in
Explore specs and pricingView details β†’

Google: Gemini 2.5 Pro Preview 05-06

google

Gemini 2.5 Pro is Google’s state-of-the-art AI model designed for advanced reasoning, coding, mathematics, and scientific tasks. It employs β€œthinking” capabilities, enabling it to reason through responses with enhanced accuracy...

textvisionmultimodal
Run locally
1,048,576 ctx$1.25/1M in
Explore specs and pricingView details β†’

Google: Gemini 2.5 Pro Preview 06-05

google

Gemini 2.5 Pro is Google’s state-of-the-art AI model designed for advanced reasoning, coding, mathematics, and scientific tasks. It employs β€œthinking” capabilities, enabling it to reason through responses with enhanced accuracy...

textvisionmultimodal
Run locally
1,048,576 ctx$1.25/1M in
Explore specs and pricingView details β†’

Google: Gemini 2.5 Pro

google

Gemini 2.5 Pro is Google’s state-of-the-art AI model designed for advanced reasoning, coding, mathematics, and scientific tasks. It employs β€œthinking” capabilities, enabling it to reason through responses with enhanced accuracy...

textvisionmultimodal
Run locally
1,048,576 ctx$1.25/1M in
Explore specs and pricingView details β†’

Google: Gemini 2.5 Flash

google

Gemini 2.5 Flash is Google's state-of-the-art workhorse model, specifically designed for advanced reasoning, coding, mathematics, and scientific tasks. It includes built-in "thinking" capabilities, enabling it to provide responses with greater...

textvisionmultimodal
Run locally
1,048,576 ctx$0.30/1M in
Explore specs and pricingView details β†’

Google: Gemini 2.5 Flash Lite

google

Gemini 2.5 Flash-Lite is a lightweight reasoning model in the Gemini 2.5 family, optimized for ultra-low latency and cost efficiency. It offers improved throughput, faster token generation, and better performance...

textvisionimage
1,048,576 ctx$0.10/1M in
Explore specs and pricingView details β†’

Google: Gemini 2.5 Flash Lite Preview 09-2025

google

Gemini 2.5 Flash-Lite is a lightweight reasoning model in the Gemini 2.5 family, optimized for ultra-low latency and cost efficiency. It offers improved throughput, faster token generation, and better performance...

textvisionimage
1,048,576 ctx$0.10/1M in
Explore specs and pricingView details β†’

Google: Gemini 3 Flash Preview

google

Gemini 3 Flash Preview is a high speed, high value thinking model designed for agentic workflows, multi turn chat, and coding assistance. It delivers near Pro level reasoning and tool...

textvisionmultimodal
1,048,576 ctx$0.50/1M in
Explore specs and pricingView details β†’

Google: Gemini 3.1 Pro Preview

google

Gemini 3.1 Pro Preview is Google’s frontier reasoning model, delivering enhanced software engineering performance, improved agentic reliability, and more efficient token usage across complex workflows. Building on the multimodal foundation...

textvisionmultimodal
1,048,576 ctx$2.00/1M in
Explore specs and pricingView details β†’

Google: Gemini 3.1 Pro Preview Custom Tools

google

Gemini 3.1 Pro Preview Custom Tools is a variant of Gemini 3.1 Pro that improves tool selection behavior by preventing overuse of a general bash tool when more efficient third-party...

textvisionmultimodal
1,048,576 ctx$2.00/1M in
Explore specs and pricingView details β†’

Google: Gemini 3.1 Flash Lite Preview

google

Gemini 3.1 Flash Lite Preview is Google's high-efficiency model optimized for high-volume use cases. It outperforms Gemini 2.5 Flash Lite on overall quality and approaches Gemini 2.5 Flash performance across...

textvisionmultimodal
1,048,576 ctx$0.25/1M in
Explore specs and pricingView details β†’

Google: Lyria 3 Clip Preview

google

30 second duration clips are priced at $0.04 per clip. Lyria 3 is Google's family of music generation models, available through the Gemini API. With Lyria 3, you can generate...

textvisionimage
1,048,576 ctx$0.00/1M in
Explore specs and pricingView details β†’

Google: Lyria 3 Pro Preview

google

Full-length songs are priced at $0.08 per song. Lyria 3 is Google's family of music generation models, available through the Gemini API. With Lyria 3, you can generate high-quality, 48kHz...

textvisionimage
1,048,576 ctx$0.00/1M in
Explore specs and pricingView details β†’

Google: Gemma 4 31B (free)

google

Gemma 4 31B Instruct is Google DeepMind's 30.7B dense multimodal model supporting text and image input with text output. Features a 256K token context window, configurable thinking/reasoning mode, native function...

textvisionmultimodal
262,144 ctx$0.14/1M in
Explore specs and pricingView details β†’

Google: Gemma 4 26B A4B (free)

google

Gemma 4 26B A4B IT is an instruction-tuned Mixture-of-Experts (MoE) model from Google DeepMind. Despite 25.2B total parameters, only 3.8B activate per token during inference β€” delivering near-31B quality at...

textvisionmultimodal
262,144 ctx$0.12/1M in
Explore specs and pricingView details β†’