modelstop.top
Home/All Models

AI Model Catalogue

Browse 1,316 models across providers, modalities, and use cases.

🌐 All Models

1,316 models Β· Page 16 of 37

sam-audio-large

geopti

SAM-Audio is a foundation model for isolating any sound in audio using text

audiofree
ctxFree in
Explore specs and pricingView details β†’

gpt-5.1

openai

textfree
ctxFree in
Explore specs and pricingView details β†’

o3-mini-2025-01-31

openai

textfree
ctxFree in
Explore specs and pricingView details β†’

gpt-4.1-nano

openai

textfree
ctxFree in
Explore specs and pricingView details β†’

sam-audio-base

geopti

A foundation model for isolating any sound in audio using text, visual, or temporal prompts

audiofree
ctxFree in
Explore specs and pricingView details β†’

o4-mini-2025-04-16

openai

textfree
ctxFree in
Explore specs and pricingView details β†’

gpt-5.1-codex

openai

textfree
ctxFree in
Explore specs and pricingView details β†’

riverflow-2.0-pro

sourceful

Agentic image model optimized for robust, high-precision generations supporting font control

visionimageagents
ctxFree in
Explore specs and pricingView details β†’

fabric-1.0

veed

VEED Fabric 1.0 is an image-to-video API that turns any image into a talking video

visionfree
ctxFree in
Explore specs and pricingView details β†’

prefectillustriousxlv60

skullycute

Anime Illust model

textfree
ctxFree in
Explore specs and pricingView details β†’

hassan-face-lora

hassan-a190

textfree
ctxFree in
Explore specs and pricingView details β†’

speech-2.8-turbo

minimax

Minimax Speech 2.8 Turbo: Turn text into natural, expressive speech with voice cloning, emotion control, and support for 40+ languages

textfree
ctxFree in
Explore specs and pricingView details β†’

fibo

bria

SOTA Open source model trained on licensed data, transforming intent into structured control for precise, high-quality AI image generation in enterprise and agentic workflows.

visionimageagents
ctxFree in
Explore specs and pricingView details β†’

annanovo

annaclaradsg20

textfree
ctxFree in
Explore specs and pricingView details β†’

nghitts3

thanhnew2001test

NghiTTS API for Vietnamese

textfree
ctxFree in
Explore specs and pricingView details β†’

speech-2.8-hd

minimax

Minimax Speech 2.8 HD focuses on high-fidelity audio generation with features like studio-grade quality, flexible emotion control, multilingual support, and voice cloning capabilities

audiomultilingualfree
ctxFree in
Explore specs and pricingView details β†’

gpt-5.4-mini

openai

textfree
ctxFree in
Explore specs and pricingView details β†’

gpt-4o-mini-search-preview-2025-03-11

openai

textfree
ctxFree in
Explore specs and pricingView details β†’

depth-anything-v3-mono

vufinder

Monocular relative depth estimation

textfree
ctxFree in
Explore specs and pricingView details β†’

depth-anything-v3-metric

vufinder

Monocular metric depth estimation

textfree
ctxFree in
Explore specs and pricingView details β†’

image-3.2

bria

Commercial-ready, trained entirely on licensed data, text-to-image model. With only 4B parameters provides exceptional aesthetics and text rendering. Evaluated to be on par to other leading models in the market

visionimagefree
ctxFree in
Explore specs and pricingView details β†’

gpt-5.2-2025-12-11

openai

textfree
ctxFree in
Explore specs and pricingView details β†’

dreamactor-m2.0

bytedance

Animate any character, humans, cartoons, animals, even non-humans, from a single image + driving video

visionfree
ctxFree in
Explore specs and pricingView details β†’

kidstable-illustrator

chesterm2022

textfree
ctxFree in
Explore specs and pricingView details β†’

ltx-2-fast

lightricks

Ideal for rapid ideation and mobile workflows. Perfect for creators who need instant feedback, real-time previews, or high-throughput content.

textfree
ctxFree in
Explore specs and pricingView details β†’

ltx-2.3-pro

lightricks

High-fidelity video generation with portrait support, audio-to-video, retake, and extend. Text, image, and audio-driven creation up to 4K at 50 FPS.

visionimageaudio
ctxFree in
Explore specs and pricingView details β†’

map-anything-pi3x

vufinder

A feed-forward neural network that offers a novel approach to visual geometry reconstruction.

textfree
ctxFree in
Explore specs and pricingView details β†’

gpt-oss-20b

openai

20b open-weight language model from OpenAI

textfreelong-context
131,072 ctxFree in
Explore specs and pricingView details β†’

demucs-api

iboostai

Demucs v4 API

textfree
ctxFree in
Explore specs and pricingView details β†’

parks-illustrations

rbiddle50

textfree
ctxFree in
Explore specs and pricingView details β†’

pokemon-trainers

resilientcoders

Creates Pokemon trainers

textfree
ctxFree in
Explore specs and pricingView details β†’

vintage

benmorton74-hash

textfree
ctxFree in
Explore specs and pricingView details β†’

kokoro-82m-zh

echo-the-coder

δΈ€δΈͺδ½“η§―θ™½ε°δ½†εŠŸθƒ½εΌΊε€§ηš„ TTS ζ¨‘εž‹γ€‚

textfree
ctxFree in
Explore specs and pricingView details β†’

z-image-turbo

prunaai

Z-Image Turbo is a super fast text-to-image model of 6B parameters developed by Tongyi-MAI.

visionimagefree
ctxFree in
Explore specs and pricingView details β†’

stable-diffusion

zedge

Private instance of stable-diffusion

textimagefree
ctxFree in
Explore specs and pricingView details β†’

nova-anime-xl-14

kirirururu

Nova Anime XL (Illustrious) v14.0

textfree
ctxFree in
Explore specs and pricingView details β†’