modelstop.top
Home/All Models

AI Model Catalogue

Browse 837 models across providers, modalities, and use cases.

๐ŸŒ All Models

837 models ยท Page 17 of 24

flux-lora-bronzino-painting

vestigia

FLUX LoRA trained on paintings by Bronzino. Cold enamel-smooth surfaces, austere aristocratic sitters, jewelled costumes, Florentine Mannerist style.

visionfree
ctxFree in
Explore specs and pricingView details โ†’

deepseek-v3.1

deepseek-ai

Latest hybrid thinking model from Deepseek

textreasoningfree
ctxFree in
Explore specs and pricingView details โ†’

recraft-v4-svg

recraft-ai

Generate production-ready SVG vector images from text prompts. Recraft V4's design taste applied to vector output โ€” clean geometry, structured layers, and editable paths.

visionimagefree
ctxFree in
Explore specs and pricingView details โ†’

recraft-remove-background

recraft-ai

Automated background removal for images. Tuned for AI-generated content, product photos, portraits, and design workflows

visionimagefree
ctxFree in
Explore specs and pricingView details โ†’

ultimate_rvc

meta-innovation

An extension of AiCoverGen, which provides several new features and improvements, enabling users to generate audio-related content using RVC with ease. Ideal for people who want to incorporate singing functionality into their AI assistant/chatbot/vtuber,

audiofree
ctxFree in
Explore specs and pricingView details โ†’

recraft-v4

recraft-ai

Recraft's latest image generation model, built around design taste. Strong prompt accuracy, art-directed composition, and integrated text rendering. Fast and cost-efficient at standard resolution.

visionimagefree
ctxFree in
Explore specs and pricingView details โ†’

map-anything-pi3x

vufinder

A feed-forward neural network that offers a novel approach to visual geometry reconstruction.

textfree
ctxFree in
Explore specs and pricingView details โ†’

p-image

prunaai

A sub 1 second text-to-image model built for production use cases.

visionimagefree
ctxFree in
Explore specs and pricingView details โ†’

ltx-2-pro

lightricks

Delivers high visual fidelity with fast turnaround. Great for daily content creation, marketing teams, and iterative creative workflows.

textfree
ctxFree in
Explore specs and pricingView details โ†’

studioisatwo

hebhar

textfree
ctxFree in
Explore specs and pricingView details โ†’

video-agent

heygen

Turn a text prompt into a complete, polished video with AI-generated script, avatar presenter, voiceover, visuals, and editing.

agentsfree
ctxFree in
Explore specs and pricingView details โ†’

nano-banana-pro

google

Google's state of the art image generation and editing model ๐ŸŒ๐ŸŒ

visionimagefree
ctxFree in
Explore specs and pricingView details โ†’

flux-lora-fashion-plate

vestigia

FLUX LoRA trained on 18th century fashion plates. Hand-coloured engraved illustrations, clean white backgrounds, elegant elongated figures, precise draughtsmanship.

visionfree
ctxFree in
Explore specs and pricingView details โ†’

ace-step-1.5

fishaudio

Ace Step 1.5 open source music generation model

audiofree
ctxFree in
Explore specs and pricingView details โ†’

reframe-image

luma

Change the aspect ratio of any photo using AI (not cropping)

visionfree
ctxFree in
Explore specs and pricingView details โ†’

pisces-rising-style

deborahwon

Arise V2

textfree
ctxFree in
Explore specs and pricingView details โ†’

wan-2.7-i2v

wan-video

Generate videos from images, with support for first-and-last-frame control, clip continuation, and audio synchronization using Alibaba's Wan 2.7 model

visionimageaudio
ctxFree in
Explore specs and pricingView details โ†’

wan-2.7-t2v

wan-video

Generate videos with audio from text prompts using Alibaba's Wan 2.7 model. 1080p, up to 15 seconds, with audio synchronization.

audiofree
ctxFree in
Explore specs and pricingView details โ†’

jacquiedigital

jacquiedeering

textfree
ctxFree in
Explore specs and pricingView details โ†’

seedance-2.0

bytedance

ByteDance's multimodal video generation model with native audio, multimodal reference inputs, and intelligent duration control.

visionaudiofree
ctxFree in
Explore specs and pricingView details โ†’

gemini-3.1-flash-tts

google

Google's fast, expressive text-to-speech model with 30 voices and 70+ language support

textfree
ctx$5.00/1M in
Explore specs and pricingView details โ†’

gemma-4-26b-a4b-fast

prunaai

This is a version of the MoE Gemma 4 26B optimised by Pruna AI.

textfree
ctxFree in
Explore specs and pricingView details โ†’

sam3-video

lucataco

A unified foundation model for prompt-based segmentation in images and videos

visionfree
ctxFree in
Explore specs and pricingView details โ†’

lofi

frow

Lo-fi hip-hop music generation with ACE-Step 1.5 + LoRA

audiofree
ctxFree in
Explore specs and pricingView details โ†’

logo-marks-v1

stefivanovs

textfree
ctxFree in
Explore specs and pricingView details โ†’

snapcook

cleent06

textfree
ctxFree in
Explore specs and pricingView details โ†’

pink-phantom

tattzy25

Inspired by 90s streetwear, graffiti culture, and macabre aesthetics

textfree
ctxFree in
Explore specs and pricingView details โ†’

q3-pro

vidu

High-fidelity video generation with text-to-video, image-to-video, and start-end-to-video modes. Up to 16 seconds at 1080p with synchronized audio.

visionimageaudio
ctxFree in
Explore specs and pricingView details โ†’

Phi-3-mini-4k-instruct

microsoft

Open-source Phi-3-mini-4k-instruct model from microsoft โ€” available for download and self-hosting on Hugging Face.

textinstructfree
ctxFree in
Explore specs and pricingView details โ†’

layerize

ideogram-ai

Take a flat graphic, remove text, and get structured text layers back for editing and recomposing

textfree
ctxFree in
Explore specs and pricingView details โ†’

tts-1.5-mini

inworld

Ultra-fast, cost-efficient text-to-speech with ~120ms latency and 15-language support

textfree
ctxFree in
Explore specs and pricingView details โ†’

pixy-yolo

cynthiachehayeb

Object Recognition model

textfree
ctxFree in
Explore specs and pricingView details โ†’

wan-2.7-videoedit

wan-video

Edit videos with natural language instructions using Alibaba's Wan 2.7 VideoEdit model

free
ctxFree in
Explore specs and pricingView details โ†’

p-image-upscale

prunaai

Fast image upscaler in the world (<1s) supporting outputs up to 8 MP. Upscales images to 4 MP in under one second.

visionfree
ctxFree in
Explore specs and pricingView details โ†’

qwen3guard-gen-4b

ditto--ai

A 4B-parameter safety and content moderation model that classifies user prompts and assistant responses as Safe, Unsafe, or Controversial with fine-grained category labels and refusal detection. Supports 119 languages.

textfree
ctxFree in
Explore specs and pricingView details โ†’

qwen3-tts

qwen

A unified Text-to-Speech demo featuring three powerful modes: Voice, Clone and Design

textfree
ctxFree in
Explore specs and pricingView details โ†’