๐ All Models
1,750 models ยท Page 34 of 49
intfloat/e5-base-v2
BAAI/bge-large-en-v1.5
Bria/enhance
Qwen/Qwen3-VL-235B-A22B-Instruct
Qwen3-VL-235B-A22B Instruct is an open-weight multimodal model that unifies strong text generation with visual understanding across images and video. The Instruct model targets general vision-language use (VQA, document parsing, chart/table...
Qwen/Qwen3-Embedding-0.6B-batch
allenai/Olmo-3.1-32B-Instruct
meta-llama/Meta-Llama-3.1-70B-Instruct-Turbo
intfloat/multilingual-e5-large
ByteDance/Seedream-4.5
Qwen/Qwen2.5-VL-32B-Instruct
embed-english-v3.0-image
Gryphe/MythoMax-L2-13b
One of the highest performing and most popular fine-tunes of Llama 2 13B, with rich descriptions and roleplay. #merge
stabilityai/sdxl-turbo
ClarityAI/creative
BAAI/bge-m3-multi
mistralai/Mistral-Small-3.2-24B-Instruct-2506
Bria/fibo_edit
Bria/gen_fill
Bria/gen_fill โ served on DeepInfra's GPU cloud for scalable, cost-efficient inference.
embed-multilingual-v3.0
Cohere's multilingual embeddings supporting 100+ languages for global semantic search.
black-forest-labs/FLUX-2-dev
embed-multilingual-v3.0-image
embed-multilingual-light-v3.0
embed-english-v3.0
State-of-the-art English text embedding model for semantic search, clustering, and classification.
deepseek-ai/DeepSeek-V3.1
deepseek-ai/DeepSeek-OCR
Qwen/Qwen3-Max-Thinking
Qwen3-Max-Thinking is the flagship reasoning model in the Qwen3 series, designed for high-stakes cognitive tasks that require deep, multi-step reasoning. By significantly scaling model capacity and reinforcement learning compute, it...
nvidia/Nemotron-3-Nano-30B-A3B
NVIDIA Nemotron 3 Nano 30B A3B is a small language MoE model with highest compute efficiency and accuracy for developers to build specialized agentic AI systems. The model is fully...
mistralai/Mistral-Small-24B-Instruct-2501
Mistral Small 3 is a 24B-parameter language model optimized for low-latency performance across common AI tasks. Released under the Apache 2.0 license, it features both pre-trained and instruction-tuned versions designed...
embed-english-light-v3.0
PrunaAI/p-image-Edit
google/gemma-3-27b-it
Gemma 3 introduces multimodality, supporting vision-language input and text outputs. It handles context windows up to 128k tokens, understands over 140 languages, and offers improved math, reasoning, and chat capabilities,...
Qwen/Qwen3.5-27B
Qwen/Qwen3.5-0.8B
Qwen/Qwen2.5-72B-Instruct
Alibaba Qwen2.5 72B Instruct on DeepInfra โ top open-source model with multilingual and coding strengths.
google/gemini-2.5-flash
openai/gpt-oss-20b
gpt-oss-20b is an open-weight 21B parameter model released by OpenAI under the Apache 2.0 license. It uses a Mixture-of-Experts (MoE) architecture with 3.6B active parameters per forward pass, optimized for...
