modelstop.top
Home/All Models

AI Model Catalogue

Browse 120 models across providers, modalities, and use cases.

🌐 All Models

120 models Β· Page 3 of 4

Google Veo 3.0 Fast + Audio

google

textaudiofree
ctxFree in
Explore specs and pricingView details β†’

imagen-4

google

Google's Imagen 4 flagship model

visionfree
ctxFree in
Explore specs and pricingView details β†’

imagen-4-fast

google

Use this fast version of Imagen 4 when speed and cost are more important than quality

visionfree
ctxFree in
Explore specs and pricingView details β†’

upscaler

google

Upscale images 2x or 4x times

visionfree
Run locally
ctxFree in
Explore specs and pricingView details β†’

gemini-2.5-flash

google

Google’s hybrid β€œthinking” AI model optimized for speed and cost-efficiency

textreasoningfree
ctxFree in
Explore specs and pricingView details β†’

imagen-3-fast

google

A faster and cheaper Imagen 3 model, for when price or speed are more important than final image quality

visionfree
ctxFree in
Explore specs and pricingView details β†’

imagen-3

google

Google's highest quality text-to-image model, capable of generating images with detail, rich lighting and beauty

visionimagefree
ctxFree in
Explore specs and pricingView details β†’

gemini-2.5-flash-image

google

Google's latest image generation model in Gemini 2.5

visionimagefree
ctxFree in
Explore specs and pricingView details β†’

nano-banana

google

Google's latest image editing model in Gemini 2.5

visionfree
ctxFree in
Explore specs and pricingView details β†’

gemini-3.1-pro

google

Google's most intelligent model, with improved reasoning and a new medium thinking level

textreasoningfree
Run locally
ctxFree in
Explore specs and pricingView details β†’

nano-banana-2

google

Google's fast image generation model with conversational editing, multi-image fusion, and character consistency

visionimagefree
ctxFree in
Explore specs and pricingView details β†’

nano-banana-pro

google

Google's state of the art image generation and editing model 🍌🍌

visionimagefree
ctxFree in
Explore specs and pricingView details β†’

lyria-3-pro

google

Generate full-length songs up to 3 minutes from text prompts or images with Lyria 3 Pro, Google's most capable music generation model

visionimagefree
ctxFree in
Explore specs and pricingView details β†’

imagen-4-ultra

google

Use this ultra version of Imagen 4 when quality matters more than speed and cost

visionfree
ctxFree in
Explore specs and pricingView details β†’

lyria-3

google

Generate 30-second music clips from text prompts or images with Lyria 3, Google's music generation model

visionimagefree
ctxFree in
Explore specs and pricingView details β†’

gemini-3.1-flash-tts

google

Google's fast, expressive text-to-speech model with 30 voices and 70+ language support

textfree
ctx$5.00/1M in
Explore specs and pricingView details β†’

veo-3.1-fast

google

New and improved version of Veo 3 Fast, with higher-fidelity video, context-aware audio and last frame support

audiofree
ctxFree in
Explore specs and pricingView details β†’

veo-3.1

google

New and improved version of Veo 3, with higher-fidelity video, context-aware audio, reference image and last frame support

visionaudiofree
ctxFree in
Explore specs and pricingView details β†’

veo-3.1-lite

google

Google's cost-efficient video generation model with native audio, optimized for high-volume applications

audiofree
ctxFree in
Explore specs and pricingView details β†’

gemma-3-1b-it

google

Open-source gemma-3-1b-it model from google β€” available for download and self-hosting on Hugging Face.

textfree
ctxFree in
Explore specs and pricingView details β†’

google/pegasus-xsum

google

google/pegasus-xsum is a summarization model on Hugging Face with ~310,440 monthly downloads. Open access.

open-source
Run locally
ctx$0.00/1M in
Explore specs and pricingView details β†’

google/madlad400-3b-mt

google

google/madlad400-3b-mt is a translation model on Hugging Face with ~316,407 monthly downloads. Open access.

open-source
Run locally
ctx$0.00/1M in
Explore specs and pricingView details β†’

t5gemma-s-s-prefixlm

google

Open-source t5gemma-s-s-prefixlm model from google β€” available for download and self-hosting on Hugging Face.

textfree
Run locally
ctx$0.00/1M in
Explore specs and pricingView details β†’

Google: Gemma 2 9B

google

Gemma 2 9B by Google is an advanced, open-source language model that sets a new standard for efficiency and performance in its size class. Designed for a wide variety of...

cheap
8,192 ctx$0.03/1M in
Explore specs and pricingView details β†’

Google: Gemma 2 27B

google

Gemma 2 27B by Google is an open model built from the same research and technology used to create the [Gemini models](/models?q=gemini). Gemma models are well-suited for a variety of...

textcheap
Run locally
8,192 ctx$0.65/1M in
Explore specs and pricingView details β†’

Google: Gemini 2.0 Flash

google

Gemini Flash 2.0 offers a significantly faster time to first token (TTFT) compared to [Gemini Flash 1.5](/google/gemini-flash-1.5), while maintaining quality on par with larger models like [Gemini Pro 1.5](/google/gemini-pro-1.5). It...

textvisionmultimodal
1,000,000 ctx$0.10/1M in
Explore specs and pricingView details β†’

Google: Gemini 2.0 Flash Lite

google

Gemini 2.0 Flash Lite offers a significantly faster time to first token (TTFT) compared to [Gemini Flash 1.5](/google/gemini-flash-1.5), while maintaining quality on par with larger models like [Gemini Pro 1.5](/google/gemini-pro-1.5),...

textvisionmultimodal
1,048,576 ctx$0.07/1M in
Explore specs and pricingView details β†’

Google: Gemma 3 27B

google

Gemma 3 introduces multimodality, supporting vision-language input and text outputs. It handles context windows up to 128k tokens, understands over 140 languages, and offers improved math, reasoning, and chat capabilities,...

textvisionmultimodal
Run locally
131,072 ctx$0.08/1M in
Explore specs and pricingView details β†’

Google: Gemma 3 12B

google

Gemma 3 introduces multimodality, supporting vision-language input and text outputs. It handles context windows up to 128k tokens, understands over 140 languages, and offers improved math, reasoning, and chat capabilities,...

textvisionmultimodal
Run locally
131,072 ctx$0.04/1M in
Explore specs and pricingView details β†’

Google: Gemma 3 4B

google

Gemma 3 introduces multimodality, supporting vision-language input and text outputs. It handles context windows up to 128k tokens, understands over 140 languages, and offers improved math, reasoning, and chat capabilities,...

textvisionmultimodal
Run locally
131,072 ctx$0.04/1M in
Explore specs and pricingView details β†’

Google: Gemini 2.5 Pro Preview 05-06

google

Gemini 2.5 Pro is Google’s state-of-the-art AI model designed for advanced reasoning, coding, mathematics, and scientific tasks. It employs β€œthinking” capabilities, enabling it to reason through responses with enhanced accuracy...

textvisionmultimodal
Run locally
1,048,576 ctx$1.25/1M in
Explore specs and pricingView details β†’

Google: Gemma 3n 4B

google

Gemma 3n E4B-it is optimized for efficient execution on mobile and low-resource devices, such as phones, laptops, and tablets. It supports multimodal inputsβ€”including text, visual data, and audioβ€”enabling diverse tasks...

textvisionaudio
Run locally
32,768 ctx$0.02/1M in
Explore specs and pricingView details β†’

Google: Gemini 2.5 Pro Preview 06-05

google

Gemini 2.5 Pro is Google’s state-of-the-art AI model designed for advanced reasoning, coding, mathematics, and scientific tasks. It employs β€œthinking” capabilities, enabling it to reason through responses with enhanced accuracy...

textvisionmultimodal
Run locally
1,048,576 ctx$1.25/1M in
Explore specs and pricingView details β†’

Google: Gemini 2.5 Pro

google

Gemini 2.5 Pro is Google’s state-of-the-art AI model designed for advanced reasoning, coding, mathematics, and scientific tasks. It employs β€œthinking” capabilities, enabling it to reason through responses with enhanced accuracy...

textvisionmultimodal
Run locally
1,048,576 ctx$1.25/1M in
Explore specs and pricingView details β†’

Google: Gemini 2.5 Flash

google

Gemini 2.5 Flash is Google's state-of-the-art workhorse model, specifically designed for advanced reasoning, coding, mathematics, and scientific tasks. It includes built-in "thinking" capabilities, enabling it to provide responses with greater...

textvisionmultimodal
Run locally
1,048,576 ctx$0.30/1M in
Explore specs and pricingView details β†’

Google: Gemini 2.5 Flash Lite

google

Gemini 2.5 Flash-Lite is a lightweight reasoning model in the Gemini 2.5 family, optimized for ultra-low latency and cost efficiency. It offers improved throughput, faster token generation, and better performance...

textvisionimage
1,048,576 ctx$0.10/1M in
Explore specs and pricingView details β†’