Playground Find a Model ⚡ Pro Tools Pulse API Advertise PricingLoading...

Loading...

The most comprehensive directory of AI models, providers, and agents. Updated daily.

Explore

All Models
Collections
Leaderboard
Compare
Pro Tools
Pulse Feed
API Docs

Categories

Language Models
Inference Providers
Agents & SaaS
Open Source

Stay Updated

Weekly digest of new models and price changes.

Business contact

Support: support@modelstop.top

Enquiries: hello@modelstop.top

Billing: billing@modelstop.top

Privacy: privacy@modelstop.top

Legal: legal@modelstop.top

© 2026 modelstop.top. All rights reserved.Updated daily · 4695+ models indexed

Home/All Models

AI Model Catalogue

Browse 279 models across providers, modalities, and use cases.

🌐All Models 💬Text Generation 💻Code & Reasoning 👁️Vision & Multimodal 🎨Image Generation 🎙️Audio & Speech 🤖Agents & Tools 📄Long Context 🆓Free & Open

🧠

Reasoning

🌍Multilingual

Providers:⚡OpenAI 🔷Anthropic 🔍Google 🦙Meta 🌀Mistral ✕xAI 🚀Groq 🐋DeepSeek 🌐Cohere ☁️Amazon

Filter & Sort

🎨 Image Generation

279 models · Page 8 of 8

stems-separator

Image to separate stems from a song, using demucs and spleeter

Explore specs and pricingView details →

lyria-3-pro

Generate full-length songs up to 3 minutes from text prompts or images with Lyria 3 Pro, Google's most capable music generation model

visionimagefree

Explore specs and pricingView details →

depth-anything-v3-metric-pano

Monocular metric depth estimation for panoramic images

Explore specs and pricingView details →

kling-v2.6-motion-control

Enables precise control of character actions and expressions from a reference image.

Explore specs and pricingView details →

imagen-4-ultra

Use this ultra version of Imagen 4 when quality matters more than speed and cost

Explore specs and pricingView details →

irwin-image-lora

Explore specs and pricingView details →

lyria-3

Generate 30-second music clips from text prompts or images with Lyria 3, Google's music generation model

visionimagefree

Explore specs and pricingView details →

ernie-image-turbo

ERNIE-Image is an open text-to-image generation model developed by the ERNIE-Image team at Baidu

visionimagefree

Explore specs and pricingView details →

q3-turbo

Fast video generation with text-to-video, image-to-video, and start-end-to-video modes. Up to 16 seconds at 1080p with synchronized audio.

visionimageaudio

Explore specs and pricingView details →

wan-2.7-i2v

Generate videos from images, with support for first-and-last-frame control, clip continuation, and audio synchronization using Alibaba's Wan 2.7 model

visionimageaudio

Explore specs and pricingView details →

kling-v2.5-turbo-pro

Kling 2.5 Turbo Pro: Unlock pro-level text-to-video and image-to-video creation with smooth motion, cinematic depth, and remarkable prompt adherence.

Explore specs and pricingView details →

sam3-video

A unified foundation model for prompt-based segmentation in images and videos

Explore specs and pricingView details →

p-video

Fast video generation with built-in draft mode for rapid creative iteration. Text-to-video, image-to-video, and audio-to-video in a single endpoint.

visionimageaudio

Explore specs and pricingView details →

wan-2.7-image-pro

Generate and edit high-quality images with Alibaba's Wan 2.7 Pro with 4K output, thinking mode, text-to-image, multi-image editing, and image set generation

visionimagereasoning

Explore specs and pricingView details →

seedream-4.5

Seedream 4.5: Upgraded Bytedance image model with stronger spatial understanding and world knowledge

Explore specs and pricingView details →

metric3dv2

Metric3D v2 (TPAMI 2024): Monocular metric depth and surface normals from a single image. Predicts real-world depth in meters. Works indoor and outdoor.

Explore specs and pricingView details →

flux-fill-pro

black-forest-labs

Professional inpainting and outpainting model with state-of-the-art performance. Edit or extend images with natural, seamless results.

Explore specs and pricingView details →

wan-2.7-r2v

Generate videos from reference images or clips while preserving subject identity using Alibaba's Wan 2.7 reference-to-video model

visionimagefree

Explore specs and pricingView details →

reframe-image

Change the aspect ratio of any photo using AI (not cropping)

Explore specs and pricingView details →

p-image-edit

A sub 1 second 0.01$ multi-image editing model built for production use cases. For image generation, check out p-image here: https://replicate.com/prunaai/p-image

visionimagefree

Explore specs and pricingView details →

lucy-edit-2

Edit and transform videos with text prompts and reference images. Style transfers, object replacement, character transformation, and more.

Explore specs and pricingView details →

grok-imagine-r2v

Generate videos guided by reference images using xAI's Grok Imagine Video model

visionimagefree

Explore specs and pricingView details →

flux-2-flex

black-forest-labs

Max-quality image generation and editing with support for ten reference images

visionimagefree

Explore specs and pricingView details →

p-image-upscale

Fastest image upscaler in the world (<1s) supporting outputs up to 128 MP.

Explore specs and pricingView details →

Stable Diffusion 3.5 Large

Stable Diffusion 3.5 Large is Stability AI's most capable text-to-image model, delivering photorealistic and creative imagery with excellent prompt adherence and detail. Features multimodal diffusion transformer architecture.

visionopen-source

Output$0.0000/1M

Explore specs and pricingView details →

Amazon Nova Pro

Amazon Nova Pro is a highly capable multimodal model with the best combination of accuracy, speed, and cost across a wide range of tasks. Supports text, image, and video inputs.

visionmultimodallong-context

Input$0.8000/1M

Output$3.2000/1M

📏300kcontext

Explore specs and pricingView details →

Amazon Nova Lite

Amazon Nova Lite is a very low-cost multimodal model that can process image, video, and text inputs. Fast and accurate for a wide range of tasks requiring visual and language understanding.

visionmultimodalcheap

Input$0.0600/1M

Output$0.2400/1M

📏300kcontext

Explore specs and pricingView details →

← Prev 2 3 4 5 6 7 8 Next →