modelstop.top
Home/All Models

AI Model Catalogue

Browse 273 models across providers, modalities, and use cases.

๐ŸŒ All Models

273 models ยท Page 7 of 8

flux-lora-bronzino-painting

vestigia

FLUX LoRA trained on paintings by Bronzino. Cold enamel-smooth surfaces, austere aristocratic sitters, jewelled costumes, Florentine Mannerist style.

visionfree
ctxFree in
Explore specs and pricingView details โ†’

flux-lora-lawrence-painting

vestigia

FLUX LoRA trained on portraits by Thomas Lawrence, the leading British portraitist of the Regency era. Fluid brushwork, romantic lighting, aristocratic ease.

visionfree
ctxFree in
Explore specs and pricingView details โ†’

floorplan-recognition

ton731

Segment floorplan images into walls, doors, windows, and kitchen zones using a deep learning model, then extract structured contours and center lines as JSON for downstream applications.

visionfree
ctxFree in
Explore specs and pricingView details โ†’

expand-image

bria

Bria Expand expands images beyond their borders in high quality. Resizing the image by generating new pixels to expand to the desired aspect ratio. Trained exclusively on licensed data for safe and risk-free commercial use

visionimagefree
ctxFree in
Explore specs and pricingView details โ†’

resnet

bfirsh

Classifies images with ResNet-50

visionfree
ctxFree in
Explore specs and pricingView details โ†’

recraft-creative-upscale

recraft-ai

Creative Upscale focuses on enhancing details and refining complex elements in the image. It doesnโ€™t just increase resolution but adds depth by improving textures, fine details, and facial features.

visionfree
ctxFree in
Explore specs and pricingView details โ†’

kling-v3-motion-control

kwaivgi

Kling 3.0 motion control: transfer motion from a reference video to any character image with improved consistency and quality.

visionfree
ctxFree in
Explore specs and pricingView details โ†’

qwen-image-2-pro

qwen

The pro version of Qwen Image 2 from Alibaba's Qwen team. Enhanced text rendering, realism, and semantic adherence for high-quality image generation and editing.

visionimagefree
ctxFree in
Explore specs and pricingView details โ†’

nano-banana-2

google

Google's fast image generation model with conversational editing, multi-image fusion, and character consistency

visionimagefree
ctxFree in
Explore specs and pricingView details โ†’

nano-banana-2-transparent

jide

Nano Banana 2 with alpha transparency. Generates images with real RGBA transparency using triangulation matting โ€” clean edges, proper semi-transparency, and accurate colors. Powered by Google Gemini 3.1 Flash Image.

visionimagefree
ctxFree in
Explore specs and pricingView details โ†’

flux-lora-first-empire-paintings

vestigia

FLUX LoRA trained on French First Empire paintings. Napoleonic state portraiture โ€” generals in gleaming uniforms, imperial ceremonies, neoclassical grandeur.

visionfree
ctxFree in
Explore specs and pricingView details โ†’

flux-lora-nattier-painting

vestigia

FLUX LoRA trained on paintings by Jean-Marc Nattier. Ladies of the court depicted as goddesses, flowing blue drapery, soft pastel tones, Rococo elegance.

visionfree
ctxFree in
Explore specs and pricingView details โ†’

flux-lora-fashion-plate

vestigia

FLUX LoRA trained on 18th century fashion plates. Hand-coloured engraved illustrations, clean white backgrounds, elegant elongated figures, precise draughtsmanship.

visionfree
ctxFree in
Explore specs and pricingView details โ†’

recraft-vectorize

recraft-ai

Convert raster images to high-quality SVG format with precision and clean vector paths, perfect for logos, icons, and scalable graphics.

visionfree
ctxFree in
Explore specs and pricingView details โ†’

recraft-crisp-upscale

recraft-ai

Designed to make images sharper and cleaner, Crisp Upscale increases overall quality, making visuals suitable for web use or print-ready materials.

visionfree
ctxFree in
Explore specs and pricingView details โ†’

qwen-image-2

qwen

A next-generation image generation and editing model from Alibaba's Qwen team. Supports text-to-image and image editing with strong text rendering, especially for Chinese.

visionimagefree
ctxFree in
Explore specs and pricingView details โ†’

kling-v3-omni-video

kwaivgi

Kling Video 3.0 Omni: Unified multimodal video generation with reference images, video editing, native audio, and multi-shot control

visionimageaudio
Run locally
ctxFree in
Explore specs and pricingView details โ†’

metric3dv2

visionaix

Metric3D v2 (TPAMI 2024): Monocular metric depth and surface normals from a single image. Predicts real-world depth in meters. Works indoor and outdoor.

visionfree
ctxFree in
Explore specs and pricingView details โ†’

wan-2.7-image-pro

wan-video

Generate and edit high-quality images with Alibaba's Wan 2.7 Pro with 4K output, thinking mode, text-to-image, multi-image editing, and image set generation

visionimagereasoning
ctxFree in
Explore specs and pricingView details โ†’

flux-2-flex

black-forest-labs

Max-quality image generation and editing with support for ten reference images

visionimagefree
Run locally
ctxFree in
Explore specs and pricingView details โ†’

wan-2.7-r2v

wan-video

Generate videos from reference images or clips while preserving subject identity using Alibaba's Wan 2.7 reference-to-video model

visionimagefree
Run locally
ctxFree in
Explore specs and pricingView details โ†’

seedream-4.5

bytedance

Seedream 4.5: Upgraded Bytedance image model with stronger spatial understanding and world knowledge

visionfree
ctxFree in
Explore specs and pricingView details โ†’

p-video

prunaai

Fast video generation with built-in draft mode for rapid creative iteration. Text-to-video, image-to-video, and audio-to-video in a single endpoint.

visionimageaudio
ctxFree in
Explore specs and pricingView details โ†’

sam3-video

lucataco

A unified foundation model for prompt-based segmentation in images and videos

visionfree
ctxFree in
Explore specs and pricingView details โ†’

kling-v2.5-turbo-pro

kwaivgi

Kling 2.5 Turbo Pro: Unlock pro-level text-to-video and image-to-video creation with smooth motion, cinematic depth, and remarkable prompt adherence.

visionfree
ctxFree in
Explore specs and pricingView details โ†’

wan-2.7-i2v

wan-video

Generate videos from images, with support for first-and-last-frame control, clip continuation, and audio synchronization using Alibaba's Wan 2.7 model

visionimageaudio
ctxFree in
Explore specs and pricingView details โ†’

q3-turbo

vidu

Fast video generation with text-to-video, image-to-video, and start-end-to-video modes. Up to 16 seconds at 1080p with synchronized audio.

visionimageaudio
ctxFree in
Explore specs and pricingView details โ†’

ernie-image-turbo

prunaai

ERNIE-Image is an open text-to-image generation model developed by the ERNIE-Image team at Baidu

visionimagefree
ctxFree in
Explore specs and pricingView details โ†’

lyria-3

google

Generate 30-second music clips from text prompts or images with Lyria 3, Google's music generation model

visionimagefree
ctxFree in
Explore specs and pricingView details โ†’

irwin-image-lora

t-irwin-neiu

visionfree
ctxFree in
Explore specs and pricingView details โ†’

reframe-image

luma

Change the aspect ratio of any photo using AI (not cropping)

visionfree
Run locally
ctxFree in
Explore specs and pricingView details โ†’

flux-fill-pro

black-forest-labs

Professional inpainting and outpainting model with state-of-the-art performance. Edit or extend images with natural, seamless results.

visionfree
Run locally
ctxFree in
Explore specs and pricingView details โ†’

imagen-4-ultra

google

Use this ultra version of Imagen 4 when quality matters more than speed and cost

visionfree
ctxFree in
Explore specs and pricingView details โ†’

kling-v2.6-motion-control

kwaivgi

Enables precise control of character actions and expressions from a reference image.

visionfree
ctxFree in
Explore specs and pricingView details โ†’

depth-anything-v3-metric-pano

vufinder

Monocular metric depth estimation for panoramic images

visionfree
ctxFree in
Explore specs and pricingView details โ†’

lyria-3-pro

google

Generate full-length songs up to 3 minutes from text prompts or images with Lyria 3 Pro, Google's most capable music generation model

visionimagefree
ctxFree in
Explore specs and pricingView details โ†’