๐จ Image Generation
273 models ยท Page 7 of 8
flux-lora-bronzino-painting
FLUX LoRA trained on paintings by Bronzino. Cold enamel-smooth surfaces, austere aristocratic sitters, jewelled costumes, Florentine Mannerist style.
flux-lora-lawrence-painting
FLUX LoRA trained on portraits by Thomas Lawrence, the leading British portraitist of the Regency era. Fluid brushwork, romantic lighting, aristocratic ease.
floorplan-recognition
Segment floorplan images into walls, doors, windows, and kitchen zones using a deep learning model, then extract structured contours and center lines as JSON for downstream applications.
expand-image
Bria Expand expands images beyond their borders in high quality. Resizing the image by generating new pixels to expand to the desired aspect ratio. Trained exclusively on licensed data for safe and risk-free commercial use
resnet
Classifies images with ResNet-50
recraft-creative-upscale
Creative Upscale focuses on enhancing details and refining complex elements in the image. It doesnโt just increase resolution but adds depth by improving textures, fine details, and facial features.
kling-v3-motion-control
Kling 3.0 motion control: transfer motion from a reference video to any character image with improved consistency and quality.
qwen-image-2-pro
The pro version of Qwen Image 2 from Alibaba's Qwen team. Enhanced text rendering, realism, and semantic adherence for high-quality image generation and editing.
nano-banana-2
Google's fast image generation model with conversational editing, multi-image fusion, and character consistency
nano-banana-2-transparent
Nano Banana 2 with alpha transparency. Generates images with real RGBA transparency using triangulation matting โ clean edges, proper semi-transparency, and accurate colors. Powered by Google Gemini 3.1 Flash Image.
flux-lora-first-empire-paintings
FLUX LoRA trained on French First Empire paintings. Napoleonic state portraiture โ generals in gleaming uniforms, imperial ceremonies, neoclassical grandeur.
flux-lora-nattier-painting
FLUX LoRA trained on paintings by Jean-Marc Nattier. Ladies of the court depicted as goddesses, flowing blue drapery, soft pastel tones, Rococo elegance.
flux-lora-fashion-plate
FLUX LoRA trained on 18th century fashion plates. Hand-coloured engraved illustrations, clean white backgrounds, elegant elongated figures, precise draughtsmanship.
recraft-vectorize
Convert raster images to high-quality SVG format with precision and clean vector paths, perfect for logos, icons, and scalable graphics.
recraft-crisp-upscale
Designed to make images sharper and cleaner, Crisp Upscale increases overall quality, making visuals suitable for web use or print-ready materials.
qwen-image-2
A next-generation image generation and editing model from Alibaba's Qwen team. Supports text-to-image and image editing with strong text rendering, especially for Chinese.
kling-v3-omni-video
Kling Video 3.0 Omni: Unified multimodal video generation with reference images, video editing, native audio, and multi-shot control
metric3dv2
Metric3D v2 (TPAMI 2024): Monocular metric depth and surface normals from a single image. Predicts real-world depth in meters. Works indoor and outdoor.
wan-2.7-image-pro
Generate and edit high-quality images with Alibaba's Wan 2.7 Pro with 4K output, thinking mode, text-to-image, multi-image editing, and image set generation
flux-2-flex
Max-quality image generation and editing with support for ten reference images
wan-2.7-r2v
Generate videos from reference images or clips while preserving subject identity using Alibaba's Wan 2.7 reference-to-video model
seedream-4.5
Seedream 4.5: Upgraded Bytedance image model with stronger spatial understanding and world knowledge
p-video
Fast video generation with built-in draft mode for rapid creative iteration. Text-to-video, image-to-video, and audio-to-video in a single endpoint.
sam3-video
A unified foundation model for prompt-based segmentation in images and videos
kling-v2.5-turbo-pro
Kling 2.5 Turbo Pro: Unlock pro-level text-to-video and image-to-video creation with smooth motion, cinematic depth, and remarkable prompt adherence.
wan-2.7-i2v
Generate videos from images, with support for first-and-last-frame control, clip continuation, and audio synchronization using Alibaba's Wan 2.7 model
q3-turbo
Fast video generation with text-to-video, image-to-video, and start-end-to-video modes. Up to 16 seconds at 1080p with synchronized audio.
ernie-image-turbo
ERNIE-Image is an open text-to-image generation model developed by the ERNIE-Image team at Baidu
lyria-3
Generate 30-second music clips from text prompts or images with Lyria 3, Google's music generation model
irwin-image-lora
reframe-image
Change the aspect ratio of any photo using AI (not cropping)
flux-fill-pro
Professional inpainting and outpainting model with state-of-the-art performance. Edit or extend images with natural, seamless results.
imagen-4-ultra
Use this ultra version of Imagen 4 when quality matters more than speed and cost
kling-v2.6-motion-control
Enables precise control of character actions and expressions from a reference image.
depth-anything-v3-metric-pano
Monocular metric depth estimation for panoramic images
lyria-3-pro
Generate full-length songs up to 3 minutes from text prompts or images with Lyria 3 Pro, Google's most capable music generation model
