modelstop.top
Home/All Models

AI Model Catalogue

Browse 837 models across providers, modalities, and use cases.

πŸ†“ Free & Open

837 models Β· Page 18 of 24

geocalib

visionaix

GeoCalib (ECCV 2024): Single-image camera calibration. Estimates focal length, FoV, distortion, roll and pitch from one image using a deep net + Levenberg-Marquardt optimizer. Works on both outdoor and indoor scenes.

visionfree
ctxFree in
Explore specs and pricingView details β†’

video-super-resolution-rife-pro

bitflow

Super video quality enhancement featuring fast upscaling with TensorRT and frame interpolation with RIFE.

free
ctxFree in
Explore specs and pricingView details β†’

android-dream-v4

interfaceconjurer

A custom Flux LoRA model trained on painterly illustrated poster art inspired by Blade Runner 2049. The style features atmospheric cyberpunk cityscapes with dramatic scale β€” tiny silhouetted figures dwarfed by massive holographic projections and towering

visionfree
ctxFree in
Explore specs and pricingView details β†’

veo-3.1-fast

google

New and improved version of Veo 3 Fast, with higher-fidelity video, context-aware audio and last frame support

audiofree
ctxFree in
Explore specs and pricingView details β†’

p-image-edit

prunaai

A sub 1 second 0.01$ multi-image editing model built for production use cases. For image generation, check out p-image here: https://replicate.com/prunaai/p-image

visionimagefree
ctxFree in
Explore specs and pricingView details β†’

product-photo-studio

i-tokyo

Generate professional e-commerce product photos from a single image. Automatically removes background, creates realistic studio scenes, and adds natural shadows.

visionimagefree
ctxFree in
Explore specs and pricingView details β†’

flux-2-max

black-forest-labs

The highest fidelity image model from Black Forest Labs

visionfree
ctxFree in
Explore specs and pricingView details β†’

qwen-3.5-35b-a3b-fast

prunaai

This is a version of the MoE Qwen 3.5 35B optimised by Pruna AI.

textfree
ctxFree in
Explore specs and pricingView details β†’

dotted-waveform-visualizer

lucataco

Create a dotted waveform video from an audio file

audiofree
ctxFree in
Explore specs and pricingView details β†’

ffhqdat-4x-upscaler

supersambat

4x face image upscaler trained on FFHQ dataset using DAT (Dual Aggregation Transformer) architecture. Optimized for portrait and face photos.

visionfree
ctxFree in
Explore specs and pricingView details β†’

video-color-filter-lut

bitflow

This LUT-based color filter is ideal for color grading user-generated AI videos or short videos shot on smartphones.

free
ctxFree in
Explore specs and pricingView details β†’

fineline

futuranota

textfree
ctxFree in
Explore specs and pricingView details β†’

rvm

hcolde

RobustVideoMatting on Replicate: input mp4 video, output black-and-white alpha-mask.mp4.

free
ctxFree in
Explore specs and pricingView details β†’

kling-v2.5-turbo-pro

kwaivgi

Kling 2.5 Turbo Pro: Unlock pro-level text-to-video and image-to-video creation with smooth motion, cinematic depth, and remarkable prompt adherence.

visionfree
ctxFree in
Explore specs and pricingView details β†’

wan-2.7-image

wan-video

Generate and edit images with Alibaba's Wan 2.7

visionimagefree
ctxFree in
Explore specs and pricingView details β†’

metric3dv2

visionaix

Metric3D v2 (TPAMI 2024): Monocular metric depth and surface normals from a single image. Predicts real-world depth in meters. Works indoor and outdoor.

visionfree
ctxFree in
Explore specs and pricingView details β†’

flux-2-flex

black-forest-labs

Max-quality image generation and editing with support for ten reference images

visionimagefree
ctxFree in
Explore specs and pricingView details β†’

stickman-lora

ibasusta-2025

textfree
ctxFree in
Explore specs and pricingView details β†’

facefixer

dobariyz

ML model that detects acne, dark circles, wrinkles and oily skin in one go.

textfree
ctxFree in
Explore specs and pricingView details β†’

q3-turbo

vidu

Fast video generation with text-to-video, image-to-video, and start-end-to-video modes. Up to 16 seconds at 1080p with synchronized audio.

visionimageaudio
ctxFree in
Explore specs and pricingView details β†’

sdxl-cheetah

prunaai

textfree
ctxFree in
Explore specs and pricingView details β†’

wan-2.7-r2v

wan-video

Generate videos from reference images or clips while preserving subject identity using Alibaba's Wan 2.7 reference-to-video model

visionimagefree
ctxFree in
Explore specs and pricingView details β†’

yolov8s-worldv2

ultralytics

Ultralytics YOLOv8s worldv2 Real-Time Open-Vocabulary Object Detection model with 12.7M parameters. Achieves 37.7 mAP50-95 on COCO dataset. Optimized for real-time inference

textfree
ctxFree in
Explore specs and pricingView details β†’

depth-anything-v3-metric-pano

vufinder

Monocular metric depth estimation for panoramic images

visionfree
ctxFree in
Explore specs and pricingView details β†’

seedream-4.5

bytedance

Seedream 4.5: Upgraded Bytedance image model with stronger spatial understanding and world knowledge

visionfree
ctxFree in
Explore specs and pricingView details β†’

op-replay-clipper-beta

nelsonjchen

Beta/RFC version of https://replicate.com/nelsonjchen/op-replay-clipper

textfree
ctxFree in
Explore specs and pricingView details β†’

enterprise-glass-v1

evancnavarro

A visual style focused on modern enterprise architectureβ€”reflective blue glass skyscrapers captured from upward perspectives, emphasizing symmetry, scale, and a clean, premium corporate aesthetic.

textfree
ctxFree in
Explore specs and pricingView details β†’

music-cover

minimax

Reimagine any song in a different style β€” change voice, instruments, genre, and arrangement while keeping the original melody

audiofree
ctxFree in
Explore specs and pricingView details β†’

kling-v2.6-motion-control

kwaivgi

Enables precise control of character actions and expressions from a reference image.

visionfree
ctxFree in
Explore specs and pricingView details β†’

isawatercolour

hebhar

textfree
ctxFree in
Explore specs and pricingView details β†’

stems-separator

triadmusic

Image to separate stems from a song, using demucs and spleeter

visionfree
ctxFree in
Explore specs and pricingView details β†’

lyria-3-pro

google

Generate full-length songs up to 3 minutes from text prompts or images with Lyria 3 Pro, Google's most capable music generation model

visionimagefree
ctxFree in
Explore specs and pricingView details β†’

lucy-edit-2

decart

Edit and transform videos with text prompts and reference images. Style transfers, object replacement, character transformation, and more.

visionfree
ctxFree in
Explore specs and pricingView details β†’

veo-3.1

google

New and improved version of Veo 3, with higher-fidelity video, context-aware audio, reference image and last frame support

visionaudiofree
ctxFree in
Explore specs and pricingView details β†’

isaindia

hebhar

textfree
ctxFree in
Explore specs and pricingView details β†’

hidream-l1-fast

prunaai

This is an optimised version of the hidream-l1 model using the pruna ai optimisation toolkit!

textfree
ctxFree in
Explore specs and pricingView details β†’