Playground Find a Model ⚡ Pro Tools Pulse API Advertise PricingLoading...

Loading...

The most comprehensive directory of AI models, providers, and agents. Updated daily.

Explore

All Models
Collections
Leaderboard
Compare
Pro Tools
Pulse Feed
API Docs

Categories

Language Models
Inference Providers
Agents & SaaS
Open Source

Stay Updated

Weekly digest of new models and price changes.

Business contact

Support: support@modelstop.top

Enquiries: hello@modelstop.top

Billing: billing@modelstop.top

Privacy: privacy@modelstop.top

Legal: legal@modelstop.top

© 2026 modelstop.top. All rights reserved.Updated daily · 4695+ models indexed

Home/All Models

AI Model Catalogue

Browse 120 models across providers, modalities, and use cases.

🌐All Models 💬Text Generation 💻Code & Reasoning 👁️Vision & Multimodal 🎨Image Generation 🎙️Audio & Speech 🤖Agents & Tools 📄Long Context 🆓Free & Open

🌍Multilingual

Providers:⚡OpenAI 🔷Anthropic 🔍Google 🦙Meta 🌀Mistral ✕xAI 🚀Groq 🐋DeepSeek 🌐Cohere ☁️Amazon

Filter & Sort

🌐 All Models

120 models · Page 3 of 4

Google Imagen 4.0 Fast

Explore specs and pricingView details →

imagen-3-fast

A faster and cheaper Imagen 3 model, for when price or speed are more important than final image quality

Explore specs and pricingView details →

imagen-4

Google's Imagen 4 flagship model

Explore specs and pricingView details →

imagen-3

Google's highest quality text-to-image model, capable of generating images with detail, rich lighting and beauty

visionimagefree

Explore specs and pricingView details →

imagen-4-fast

Use this fast version of Imagen 4 when speed and cost are more important than quality

Explore specs and pricingView details →

gemini-2.5-flash-image

Google's latest image generation model in Gemini 2.5

visionimagefree

Explore specs and pricingView details →

nano-banana

Google's latest image editing model in Gemini 2.5

Explore specs and pricingView details →

upscaler

Upscale images 2x or 4x times

Explore specs and pricingView details →

gemini-2.5-flash

Google’s hybrid “thinking” AI model optimized for speed and cost-efficiency

textreasoningfree

Explore specs and pricingView details →

nano-banana-2

Google's fast image generation model with conversational editing, multi-image fusion, and character consistency

visionimagefree

Explore specs and pricingView details →

gemini-3.1-pro

Google's most intelligent model, with improved reasoning and a new medium thinking level

textreasoningfree

Explore specs and pricingView details →

nano-banana-pro

Google's state of the art image generation and editing model 🍌🍌

visionimagefree

Explore specs and pricingView details →

veo-3.1

New and improved version of Veo 3, with higher-fidelity video, context-aware audio, reference image and last frame support

visionaudiofree

Explore specs and pricingView details →

veo-3.1-fast

New and improved version of Veo 3 Fast, with higher-fidelity video, context-aware audio and last frame support

Explore specs and pricingView details →

lyria-3-pro

Generate full-length songs up to 3 minutes from text prompts or images with Lyria 3 Pro, Google's most capable music generation model

visionimagefree

Explore specs and pricingView details →

veo-3.1-lite

Google's cost-efficient video generation model with native audio, optimized for high-volume applications

Explore specs and pricingView details →

imagen-4-ultra

Use this ultra version of Imagen 4 when quality matters more than speed and cost

Explore specs and pricingView details →

lyria-3

Generate 30-second music clips from text prompts or images with Lyria 3, Google's music generation model

visionimagefree

Explore specs and pricingView details →

gemini-3.1-flash-tts

Google's fast, expressive text-to-speech model with 30 voices and 70+ language support

Input$5.0000/1M

Output$15.0000/1M

⭐1242.0%score

Explore specs and pricingView details →

gemma-3-1b-it

Open-source gemma-3-1b-it model from google — available for download and self-hosting on Hugging Face.

Explore specs and pricingView details →

google/pegasus-xsum

google/pegasus-xsum is a summarization model on Hugging Face with ~310,440 monthly downloads. Open access.

Output$0.0000/1M

Explore specs and pricingView details →

google/madlad400-3b-mt

google/madlad400-3b-mt is a translation model on Hugging Face with ~316,407 monthly downloads. Open access.

Output$0.0000/1M

Explore specs and pricingView details →

t5gemma-s-s-prefixlm

Open-source t5gemma-s-s-prefixlm model from google — available for download and self-hosting on Hugging Face.

Output$0.0000/1M

Explore specs and pricingView details →

Google: Gemma 2 9B

Gemma 2 9B by Google is an advanced, open-source language model that sets a new standard for efficiency and performance in its size class. Designed for a wide variety of...

Input$0.0300/1M

Output$0.0900/1M

Explore specs and pricingView details →

Google: Gemma 2 27B

Gemma 2 27B by Google is an open model built from the same research and technology used to create the [Gemini models](/models?q=gemini). Gemma models are well-suited for a variety of...

Input$0.6500/1M

Output$0.6500/1M

Explore specs and pricingView details →

Google: Gemini 2.0 Flash

Gemini Flash 2.0 offers a significantly faster time to first token (TTFT) compared to [Gemini Flash 1.5](/google/gemini-flash-1.5), while maintaining quality on par with larger models like [Gemini Pro 1.5](/google/gemini-pro-1.5). It...

textvisionmultimodal

Input$0.1000/1M

Output$0.4000/1M

📏1000kcontext

Explore specs and pricingView details →

Google: Gemini 2.0 Flash Lite

Gemini 2.0 Flash Lite offers a significantly faster time to first token (TTFT) compared to [Gemini Flash 1.5](/google/gemini-flash-1.5), while maintaining quality on par with larger models like [Gemini Pro 1.5](/google/gemini-pro-1.5),...

textvisionmultimodal

Input$0.0750/1M

Output$0.3000/1M

📏1049kcontext

⭐1242.0%score

Explore specs and pricingView details →

Google: Gemma 3 27B

Gemma 3 introduces multimodality, supporting vision-language input and text outputs. It handles context windows up to 128k tokens, understands over 140 languages, and offers improved math, reasoning, and chat capabilities,...

textvisionmultimodal

Input$0.0800/1M

Output$0.1600/1M

📏131kcontext

Explore specs and pricingView details →

Google: Gemma 3 12B

Gemma 3 introduces multimodality, supporting vision-language input and text outputs. It handles context windows up to 128k tokens, understands over 140 languages, and offers improved math, reasoning, and chat capabilities,...

textvisionmultimodal

Input$0.0400/1M

Output$0.1300/1M

📏131kcontext

Explore specs and pricingView details →

Google: Gemma 3 4B

Gemma 3 introduces multimodality, supporting vision-language input and text outputs. It handles context windows up to 128k tokens, understands over 140 languages, and offers improved math, reasoning, and chat capabilities,...

textvisionmultimodal

Input$0.0400/1M

Output$0.0800/1M

📏131kcontext

Explore specs and pricingView details →

Google: Gemini 2.5 Pro Preview 05-06

Gemini 2.5 Pro is Google’s state-of-the-art AI model designed for advanced reasoning, coding, mathematics, and scientific tasks. It employs “thinking” capabilities, enabling it to reason through responses with enhanced accuracy...

textvisionmultimodal

📏1049kcontext

Explore specs and pricingView details →

Google: Gemma 3n 4B

Gemma 3n E4B-it is optimized for efficient execution on mobile and low-resource devices, such as phones, laptops, and tablets. It supports multimodal inputs—including text, visual data, and audio—enabling diverse tasks...

textvisionaudio

Input$0.0200/1M

Output$0.0400/1M

Explore specs and pricingView details →

Google: Gemini 2.5 Pro Preview 06-05

Gemini 2.5 Pro is Google’s state-of-the-art AI model designed for advanced reasoning, coding, mathematics, and scientific tasks. It employs “thinking” capabilities, enabling it to reason through responses with enhanced accuracy...

textvisionmultimodal

📏1049kcontext

Explore specs and pricingView details →

Google: Gemini 2.5 Pro

Gemini 2.5 Pro is Google’s state-of-the-art AI model designed for advanced reasoning, coding, mathematics, and scientific tasks. It employs “thinking” capabilities, enabling it to reason through responses with enhanced accuracy...

textvisionmultimodal

📏1049kcontext

Explore specs and pricingView details →

Google: Gemini 2.5 Flash

Gemini 2.5 Flash is Google's state-of-the-art workhorse model, specifically designed for advanced reasoning, coding, mathematics, and scientific tasks. It includes built-in "thinking" capabilities, enabling it to provide responses with greater...

textvisionmultimodal

📏1049kcontext

Explore specs and pricingView details →

Google: Gemini 2.5 Flash Lite

Gemini 2.5 Flash-Lite is a lightweight reasoning model in the Gemini 2.5 family, optimized for ultra-low latency and cost efficiency. It offers improved throughput, faster token generation, and better performance...

textvisionimage

Input$0.1000/1M

Output$0.4000/1M

📏1049kcontext

Explore specs and pricingView details →

← Prev 1 2 3 4 Next →