π Free & Open
837 models Β· Page 13 of 24
gpt-4o-mini-audio-preview
o3-mini-2025-01-31
gpt-4.1-nano
firered-image-edit
FireRed-Image-Edit is a general-purpose image editing model that delivers high-fidelity and consistent editing across a wide range of scenarios.
remove-background
Bria AI's remove background model
gemini-2.5-flash
Googleβs hybrid βthinkingβ AI model optimized for speed and cost-efficiency
gpt-5.4-mini-2026-03-17
gpt-3.5-turbo-1106
gpt-3.5-turbo-0125
gpt-4o-mini-search-preview-2025-03-11
gpt-4.1-mini-2025-04-14
gpt-4.1-mini
depth-anything-v3-mono
Monocular relative depth estimation
depth-anything-v3-metric
Monocular metric depth estimation
nghitts3
NghiTTS API for Vietnamese
gpt-5.4
gpt-5.4-nano-2026-03-17
gpt-5.4-nano
gpt-image-1
speech-2.8-hd
Minimax Speech 2.8 HD focuses on high-fidelity audio generation with features like studio-grade quality, flexible emotion control, multilingual support, and voice cloning capabilities
wan2.6-i2v-flash
Image-to-video generation with optional audio, multi-shot narrative support, and faster inference
claude-opus-4.6
Anthropic's most intelligent model with state-of-the-art coding, reasoning, and agentic capabilities
gpt-5.4-mini
drapolinar
imagen-3-fast
A faster and cheaper Imagen 3 model, for when price or speed are more important than final image quality
grok-imagine-image
SOTA image model from xAI
p-image-trainer
Fast LoRA trainer for p-image, a super fast text-to-image model developed by Pruna AI. Use LoRAs here: https://replicate.com/prunaai/p-image-lora. Find or contribute LoRAs here: https://huggingface.co/collections/PrunaAI/p-image
p-image-edit-lora
Use trained LoRAs from the https://replicate.com/prunaai/p-image-edit-trainer. Find or contribute LoRAs here: https://huggingface.co/collections/PrunaAI/p-image-edit-loras.
fabric-1.0
VEED Fabric 1.0 is an image-to-video API that turns any image into a talking video
prefectillustriousxlv60
Anime Illust model
hassan-face-lora
fibo
SOTA Open source model trained on licensed data, transforming intent into structured control for precise, high-quality AI image generation in enterprise and agentic workflows.
image-3.2
Commercial-ready, trained entirely on licensed data, text-to-image model. With only 4B parameters provides exceptional aesthetics and text rendering. Evaluated to be on par to other leading models in the market
sam-audio-base
A foundation model for isolating any sound in audio using text, visual, or temporal prompts
