๐ All Models
850 models ยท Page 17 of 24
granite-4.0-h-small
Granite-4.0-H-Small is a 32B parameter long-context instruct model finetuned from Granite-4.0-H-Small-Base using a combination of open source instruction datasets with permissive license and internally collected synthetic datasets.
90s-photographs-bjork
90's Photographs in the style of Bjorks promo material
gpt-oss-20b-fast
Advanced 20B open-weight reasoning models to customize for any use case and run anywhere.
grok-4
Grok 4 is xAIโs most advanced reasoning model. Excels at logical thinking and in-depth analysis. Ideal for insightful discussions and complex problem-solving.
tts-1.5-max
Highest-quality text-to-speech with <200ms latency, emotion control, and 15-language support
nano-banana-2-bg-remove
Remove backgrounds with real alpha transparency using Nano Banana 2. Triangulation matting produces clean edges, proper semi-transparency, and accurate colors โ superior to traditional background removal. Optionally specify what to isolate.
isawatercolour
Phi-3-mini-4k-instruct
Open-source Phi-3-mini-4k-instruct model from microsoft โ available for download and self-hosting on Hugging Face.
yolo11n
Ultralytics YOLO11n object detection model with 2.6M parameters. Achieves 39.5 mAP50-95 on COCO dataset. Optimized for real-time inference with 1.55 ms speed on T4 GPU..
hidream-l1-fast
This is an optimised version of the hidream-l1 model using the pruna ai optimisation toolkit!
yolov8s-worldv2
Ultralytics YOLOv8s worldv2 Real-Time Open-Vocabulary Object Detection model with 12.7M parameters. Achieves 37.7 mAP50-95 on COCO dataset. Optimized for real-time inference
op-replay-clipper-beta
Beta/RFC version of https://replicate.com/nelsonjchen/op-replay-clipper
sdxl-cheetah
llada2.1-flash
The smartest diffusion language model up to ~800+ tps
platmoji-2.0
This is Platmoji 2, trained more to mimic emojis in an extremely similar way. (Realism in emojis go to Platmoji 1)
yoloe-11s
Ultralytics YOLOE-L Real-Time Seeing Anything model with 26.2M parameters. Achieves 52.6 mAP50-95 on COCO dataset. Optimized for real-time inference with 6.2 ms speed on T4 GPU..
watercolourisa
gemma-4-26b-a4b-fast
This is a version of the MoE Gemma 4 26B optimised by Pruna AI.
snapcook
gemini-3.1-flash-tts
Google's fast, expressive text-to-speech model with 30 voices and 70+ language support
isaindia
pisces-rising-style
Arise V2
theretroposter01
palomacalazans
pink-phantom
Inspired by 90s streetwear, graffiti culture, and macabre aesthetics
wikidiki
sebastian
pixy-yolo
Object Recognition model
bgogo-temp
Temporary publish target for bgogo deployment validation.
isaarchsketch
logo-marks-v1
qwen-3.5-27b-fast
This is a version of Qwen 3.5 27B optimised by Pruna AI.
qwen3guard-gen-4b
A 4B-parameter safety and content moderation model that classifies user prompts and assistant responses as Safe, Unsafe, or Controversial with fine-grained category labels and refusal detection. Supports 119 languages.
map-anything
Universal Feed-Forward Metric 3D Reconstruction
