๐ All Models
1,316 models ยท Page 21 of 37
wan-2.7-r2v
Generate videos from reference images or clips while preserving subject identity using Alibaba's Wan 2.7 reference-to-video model
p-image-upscale
Fast image upscaler in the world (<1s) supporting outputs up to 8 MP. Upscales images to 4 MP in under one second.
kling-v2.5-turbo-pro
Kling 2.5 Turbo Pro: Unlock pro-level text-to-video and image-to-video creation with smooth motion, cinematic depth, and remarkable prompt adherence.
wan-2.7-videoedit
Edit videos with natural language instructions using Alibaba's Wan 2.7 VideoEdit model
snapcook
layerize
Take a flat graphic, remove text, and get structured text layers back for editing and recomposing
metric3dv2
Metric3D v2 (TPAMI 2024): Monocular metric depth and surface normals from a single image. Predicts real-world depth in meters. Works indoor and outdoor.
wikidiki
seedance-2.0-fast
A faster variant of Seedance 2.0 for quicker video generation with multimodal inputs and native audio.
watercolourisa
pixy-yolo
Object Recognition model
ernie-image
ERNIE-Image is an open text-to-image generation model developed by the ERNIE-Image team at Baidu
Phi-3-mini-4k-instruct
Open-source Phi-3-mini-4k-instruct model from microsoft โ available for download and self-hosting on Hugging Face.
reframe-image
Change the aspect ratio of any photo using AI (not cropping)
lucy-edit-2
Edit and transform videos with text prompts and reference images. Style transfers, object replacement, character transformation, and more.
siglip-large-patch16-384
Get embeddings for image using siglip-large-patch16-384
pink-phantom
Inspired by 90s streetwear, graffiti culture, and macabre aesthetics
wan-2.7-image
Generate and edit images with Alibaba's Wan 2.7
gemini-3.1-flash-tts
Google's fast, expressive text-to-speech model with 30 voices and 70+ language support
irwin-image-lora
map-anything
Universal Feed-Forward Metric 3D Reconstruction
flux-2-flex
Max-quality image generation and editing with support for ten reference images
herme
lyria-3-pro
Generate full-length songs up to 3 minutes from text prompts or images with Lyria 3 Pro, Google's most capable music generation model
gemma-4-26b-a4b-fast
This is a version of the MoE Gemma 4 26B optimised by Pruna AI.
music-2.5
Generate full-length songs with vocals, lyrics, and rich instrumentation from a text prompt
sam3-video
A unified foundation model for prompt-based segmentation in images and videos
wan-2.7-i2v
Generate videos from images, with support for first-and-last-frame control, clip continuation, and audio synchronization using Alibaba's Wan 2.7 model
music-cover
Reimagine any song in a different style โ change voice, instruments, genre, and arrangement while keeping the original melody
wan-2.7-t2v
Generate videos with audio from text prompts using Alibaba's Wan 2.7 model. 1080p, up to 15 seconds, with audio synchronization.
facefixer
ML model that detects acne, dark circles, wrinkles and oily skin in one go.
stickman-lora
jacquiedigital
theretroposter01
palomacalazans
p-video
Fast video generation with built-in draft mode for rapid creative iteration. Text-to-video, image-to-video, and audio-to-video in a single endpoint.
