modelstop.top
Home/All Models

AI Model Catalogue

Browse 103 models across providers, modalities, and use cases.

๐ŸŽจ Image Generation

103 models ยท Page 1 of 3

imagen-4-fast

google

Use this fast version of Imagen 4 when speed and cost are more important than quality

visionfree
ctxFree in
Explore specs and pricingView details โ†’

p-image-trainer

prunaai

Fast LoRA trainer for p-image, a super fast text-to-image model developed by Pruna AI. Use LoRAs here: https://replicate.com/prunaai/p-image-lora. Find or contribute LoRAs here: https://huggingface.co/collections/PrunaAI/p-image

visionimagefree
ctxFree in
Explore specs and pricingView details โ†’

generate-background

bria

Bria Background Generation allows for efficient swapping of backgrounds in images via text prompts or reference image, delivering realistic and polished results. Trained exclusively on licensed data for safe and risk-free commercial use

visionimagefree
ctxFree in
Explore specs and pricingView details โ†’

riverflow-2.0-pro

sourceful

Agentic image model optimized for robust, high-precision generations supporting font control

visionimageagents
ctxFree in
Explore specs and pricingView details โ†’

image-3.2

bria

Commercial-ready, trained entirely on licensed data, text-to-image model. With only 4B parameters provides exceptional aesthetics and text rendering. Evaluated to be on par to other leading models in the market

visionimagefree
ctxFree in
Explore specs and pricingView details โ†’

fabric-1.0

veed

VEED Fabric 1.0 is an image-to-video API that turns any image into a talking video

visionfree
ctxFree in
Explore specs and pricingView details โ†’

imagen-3

google

Google's highest quality text-to-image model, capable of generating images with detail, rich lighting and beauty

visionimagefree
ctxFree in
Explore specs and pricingView details โ†’

increase-resolution

bria

Bria Increase resolution upscales the resolution of any image. It increases resolution using a dedicated upscaling method that preserves the original image content without regeneration.

visionimagefree
ctxFree in
Explore specs and pricingView details โ†’

nano-banana

google

Google's latest image editing model in Gemini 2.5

visionfree
ctxFree in
Explore specs and pricingView details โ†’

firered-image-edit

prunaai

FireRed-Image-Edit is a general-purpose image editing model that delivers high-fidelity and consistent editing across a wide range of scenarios.

visionfree
ctxFree in
Explore specs and pricingView details โ†’

fibo

bria

SOTA Open source model trained on licensed data, transforming intent into structured control for precise, high-quality AI image generation in enterprise and agentic workflows.

visionimageagents
ctxFree in
Explore specs and pricingView details โ†’

gemini-2.5-flash-image

google

Google's latest image generation model in Gemini 2.5

visionimagefree
ctxFree in
Explore specs and pricingView details โ†’

p-image-edit-lora

prunaai

Use trained LoRAs from the https://replicate.com/prunaai/p-image-edit-trainer. Find or contribute LoRAs here: https://huggingface.co/collections/PrunaAI/p-image-edit-loras.

visionfree
ctxFree in
Explore specs and pricingView details โ†’

fibo-edit

bria

FIBO-Edit brings the power of structured prompt generation to image editing

visionimagefree
ctxFree in
Explore specs and pricingView details โ†’

upscaler

google

Upscale images 2x or 4x times

visionfree
ctxFree in
Explore specs and pricingView details โ†’

imagen-4

google

Google's Imagen 4 flagship model

visionfree
ctxFree in
Explore specs and pricingView details โ†’

dreamactor-m2.0

bytedance

Animate any character, humans, cartoons, animals, even non-humans, from a single image + driving video

visionfree
ctxFree in
Explore specs and pricingView details โ†’

image-colorization

topazlabs

Image colorization model from Topaz Labs

visionfree
ctxFree in
Explore specs and pricingView details โ†’

imagen-3-fast

google

A faster and cheaper Imagen 3 model, for when price or speed are more important than final image quality

visionfree
ctxFree in
Explore specs and pricingView details โ†’

eraser

bria

SOTA Object removal, enables precise removal of unwanted objects from images while maintaining high-quality outputs. Trained exclusively on licensed data for safe and risk-free commercial use

visionfree
ctxFree in
Explore specs and pricingView details โ†’

grok-imagine-image

xai

SOTA image model from xAI

visionfree
ctxFree in
Explore specs and pricingView details โ†’

wan2.6-i2v-flash

wan-video

Image-to-video generation with optional audio, multi-shot narrative support, and faster inference

visionimageaudio
ctxFree in
Explore specs and pricingView details โ†’

qwen-image-2

qwen

A next-generation image generation and editing model from Alibaba's Qwen team. Supports text-to-image and image editing with strong text rendering, especially for Chinese.

visionimagefree
ctxFree in
Explore specs and pricingView details โ†’

recraft-crisp-upscale

recraft-ai

Designed to make images sharper and cleaner, Crisp Upscale increases overall quality, making visuals suitable for web use or print-ready materials.

visionfree
ctxFree in
Explore specs and pricingView details โ†’

nano-banana-2-transparent

jide

Nano Banana 2 with alpha transparency. Generates images with real RGBA transparency using triangulation matting โ€” clean edges, proper semi-transparency, and accurate colors. Powered by Google Gemini 3.1 Flash Image.

visionimagefree
ctxFree in
Explore specs and pricingView details โ†’

seedream-5-lite

bytedance

Seedream 5.0 lite: image generation with built-in reasoning, example-based editing, and deep domain knowledge

visionimagereasoning
ctxFree in
Explore specs and pricingView details โ†’

kling-v3-omni-video

kwaivgi

Kling Video 3.0 Omni: Unified multimodal video generation with reference images, video editing, native audio, and multi-shot control

visionimageaudio
ctxFree in
Explore specs and pricingView details โ†’

flux-lora-fashion-plate

vestigia

FLUX LoRA trained on 18th century fashion plates. Hand-coloured engraved illustrations, clean white backgrounds, elegant elongated figures, precise draughtsmanship.

visionfree
ctxFree in
Explore specs and pricingView details โ†’

recraft-v4-pro

recraft-ai

Recraft's latest image generation model at ~2048px resolution. Same design taste and prompt accuracy as V4, with higher resolution for print-ready and large-scale work.

visionimagefree
ctxFree in
Explore specs and pricingView details โ†’

flux-lora-first-empire-paintings

vestigia

FLUX LoRA trained on French First Empire paintings. Napoleonic state portraiture โ€” generals in gleaming uniforms, imperial ceremonies, neoclassical grandeur.

visionfree
ctxFree in
Explore specs and pricingView details โ†’

nano-banana-2

google

Google's fast image generation model with conversational editing, multi-image fusion, and character consistency

visionimagefree
ctxFree in
Explore specs and pricingView details โ†’

ltx-2.3-pro

lightricks

High-fidelity video generation with portrait support, audio-to-video, retake, and extend. Text, image, and audio-driven creation up to 4K at 50 FPS.

visionimageaudio
ctxFree in
Explore specs and pricingView details โ†’

p-image

prunaai

A sub 1 second text-to-image model built for production use cases.

visionimagefree
ctxFree in
Explore specs and pricingView details โ†’

nano-banana-pro

google

Google's state of the art image generation and editing model ๐ŸŒ๐ŸŒ

visionimagefree
ctxFree in
Explore specs and pricingView details โ†’

sharp-ml

kfarr

Apple's SHARP model โ€” single image to 3D Gaussian splats

visionfree
ctxFree in
Explore specs and pricingView details โ†’

z-image-turbo

prunaai

Z-Image Turbo is a super fast text-to-image model of 6B parameters developed by Tongyi-MAI.

visionimagefree
ctxFree in
Explore specs and pricingView details โ†’