๐ All Models
837 models ยท Page 17 of 24
flux-lora-bronzino-painting
FLUX LoRA trained on paintings by Bronzino. Cold enamel-smooth surfaces, austere aristocratic sitters, jewelled costumes, Florentine Mannerist style.
deepseek-v3.1
Latest hybrid thinking model from Deepseek
recraft-v4-svg
Generate production-ready SVG vector images from text prompts. Recraft V4's design taste applied to vector output โ clean geometry, structured layers, and editable paths.
recraft-remove-background
Automated background removal for images. Tuned for AI-generated content, product photos, portraits, and design workflows
ultimate_rvc
An extension of AiCoverGen, which provides several new features and improvements, enabling users to generate audio-related content using RVC with ease. Ideal for people who want to incorporate singing functionality into their AI assistant/chatbot/vtuber,
recraft-v4
Recraft's latest image generation model, built around design taste. Strong prompt accuracy, art-directed composition, and integrated text rendering. Fast and cost-efficient at standard resolution.
map-anything-pi3x
A feed-forward neural network that offers a novel approach to visual geometry reconstruction.
p-image
A sub 1 second text-to-image model built for production use cases.
ltx-2-pro
Delivers high visual fidelity with fast turnaround. Great for daily content creation, marketing teams, and iterative creative workflows.
studioisatwo
video-agent
Turn a text prompt into a complete, polished video with AI-generated script, avatar presenter, voiceover, visuals, and editing.
nano-banana-pro
Google's state of the art image generation and editing model ๐๐
flux-lora-fashion-plate
FLUX LoRA trained on 18th century fashion plates. Hand-coloured engraved illustrations, clean white backgrounds, elegant elongated figures, precise draughtsmanship.
ace-step-1.5
Ace Step 1.5 open source music generation model
reframe-image
Change the aspect ratio of any photo using AI (not cropping)
pisces-rising-style
Arise V2
wan-2.7-i2v
Generate videos from images, with support for first-and-last-frame control, clip continuation, and audio synchronization using Alibaba's Wan 2.7 model
wan-2.7-t2v
Generate videos with audio from text prompts using Alibaba's Wan 2.7 model. 1080p, up to 15 seconds, with audio synchronization.
jacquiedigital
seedance-2.0
ByteDance's multimodal video generation model with native audio, multimodal reference inputs, and intelligent duration control.
gemini-3.1-flash-tts
Google's fast, expressive text-to-speech model with 30 voices and 70+ language support
gemma-4-26b-a4b-fast
This is a version of the MoE Gemma 4 26B optimised by Pruna AI.
sam3-video
A unified foundation model for prompt-based segmentation in images and videos
lofi
Lo-fi hip-hop music generation with ACE-Step 1.5 + LoRA
logo-marks-v1
snapcook
pink-phantom
Inspired by 90s streetwear, graffiti culture, and macabre aesthetics
q3-pro
High-fidelity video generation with text-to-video, image-to-video, and start-end-to-video modes. Up to 16 seconds at 1080p with synchronized audio.
Phi-3-mini-4k-instruct
Open-source Phi-3-mini-4k-instruct model from microsoft โ available for download and self-hosting on Hugging Face.
layerize
Take a flat graphic, remove text, and get structured text layers back for editing and recomposing
tts-1.5-mini
Ultra-fast, cost-efficient text-to-speech with ~120ms latency and 15-language support
pixy-yolo
Object Recognition model
wan-2.7-videoedit
Edit videos with natural language instructions using Alibaba's Wan 2.7 VideoEdit model
p-image-upscale
Fast image upscaler in the world (<1s) supporting outputs up to 8 MP. Upscales images to 4 MP in under one second.
qwen3guard-gen-4b
A 4B-parameter safety and content moderation model that classifies user prompts and assistant responses as Safe, Unsafe, or Controversial with fine-grained category labels and refusal detection. Supports 119 languages.
qwen3-tts
A unified Text-to-Speech demo featuring three powerful modes: Voice, Clone and Design
