π All Models
1,316 models Β· Page 16 of 37
sam-audio-large
SAM-Audio is a foundation model for isolating any sound in audio using text
gpt-5.1
o3-mini-2025-01-31
gpt-4.1-nano
sam-audio-base
A foundation model for isolating any sound in audio using text, visual, or temporal prompts
o4-mini-2025-04-16
gpt-5.1-codex
riverflow-2.0-pro
Agentic image model optimized for robust, high-precision generations supporting font control
fabric-1.0
VEED Fabric 1.0 is an image-to-video API that turns any image into a talking video
prefectillustriousxlv60
Anime Illust model
hassan-face-lora
speech-2.8-turbo
Minimax Speech 2.8 Turbo: Turn text into natural, expressive speech with voice cloning, emotion control, and support for 40+ languages
fibo
SOTA Open source model trained on licensed data, transforming intent into structured control for precise, high-quality AI image generation in enterprise and agentic workflows.
annanovo
nghitts3
NghiTTS API for Vietnamese
speech-2.8-hd
Minimax Speech 2.8 HD focuses on high-fidelity audio generation with features like studio-grade quality, flexible emotion control, multilingual support, and voice cloning capabilities
gpt-5.4-mini
gpt-4o-mini-search-preview-2025-03-11
depth-anything-v3-mono
Monocular relative depth estimation
depth-anything-v3-metric
Monocular metric depth estimation
image-3.2
Commercial-ready, trained entirely on licensed data, text-to-image model. With only 4B parameters provides exceptional aesthetics and text rendering. Evaluated to be on par to other leading models in the market
gpt-5.2-2025-12-11
dreamactor-m2.0
Animate any character, humans, cartoons, animals, even non-humans, from a single image + driving video
kidstable-illustrator
ltx-2-fast
Ideal for rapid ideation and mobile workflows. Perfect for creators who need instant feedback, real-time previews, or high-throughput content.
ltx-2.3-pro
High-fidelity video generation with portrait support, audio-to-video, retake, and extend. Text, image, and audio-driven creation up to 4K at 50 FPS.
map-anything-pi3x
A feed-forward neural network that offers a novel approach to visual geometry reconstruction.
gpt-oss-20b
20b open-weight language model from OpenAI
demucs-api
Demucs v4 API
parks-illustrations
pokemon-trainers
Creates Pokemon trainers
vintage
kokoro-82m-zh
δΈδΈͺδ½η§―θ½ε°δ½εθ½εΌΊε€§η TTS 樑εγ
z-image-turbo
Z-Image Turbo is a super fast text-to-image model of 6B parameters developed by Tongyi-MAI.
stable-diffusion
Private instance of stable-diffusion
nova-anime-xl-14
Nova Anime XL (Illustrious) v14.0
