๐ All Models
1,316 models ยท Page 9 of 37
Sao10K/L3-8B-Lunaris-v1-Turbo
Qwen/Qwen3.5-2B
embed-multilingual-light-v3.0
Qwen/Qwen3-VL-235B-A22B-Instruct
Qwen3-VL-235B-A22B Instruct is an open-weight multimodal model that unifies strong text generation with visual understanding across images and video. The Instruct model targets general vision-language use (VQA, document parsing, chart/table...
BAAI/bge-m3-multi
anthropic/claude-4-opus
zai-org/GLM-4.7
Gryphe/MythoMax-L2-13b
One of the highest performing and most popular fine-tunes of Llama 2 13B, with rich descriptions and roleplay. #merge
embed-english-light-v3.0
Qwen/Qwen3.5-35B-A3B
stabilityai/sdxl-turbo
nvidia/Nemotron-3-Nano-30B-A3B
NVIDIA Nemotron 3 Nano 30B A3B is a small language MoE model with highest compute efficiency and accuracy for developers to build specialized agentic AI systems. The model is fully...
microsoft/phi-4
Microsoft Phi-4 14B โ small language model achieving state-of-the-art results on reasoning tasks.
mistralai/Mistral-Nemo-Instruct-2407
anthropic/claude-3-7-sonnet-latest
google/gemini-2.5-flash
intfloat/e5-base-v2
PrunaAI/p-image-Edit
mistralai/Mixtral-8x7B-Instruct-v0.1
Mixtral 8ร7B Instruct on DeepInfra โ popular MoE model with 32K context and strong multilingual performance.
Qwen/Qwen3.5-397B-A17B
Qwen/Qwen3-235B-A22B-Thinking-2507
Qwen3-235B-A22B-Thinking-2507 is a high-performance, open-weight Mixture-of-Experts (MoE) language model optimized for complex reasoning tasks. It activates 22B of its 235B parameters per forward pass and natively supports up to 262,144...
Qwen/Qwen3-14B
Qwen3-14B is a dense 14.8B parameter causal language model from the Qwen3 series, designed for both complex reasoning and efficient dialogue. It supports seamless switching between a "thinking" mode for...
embed-english-v3.0
State-of-the-art English text embedding model for semantic search, clustering, and classification.
Qwen/Qwen3-Embedding-4B-batch
Bria/fibo_edit
black-forest-labs/FLUX-2-pro
embed-v4.0
Cohere's latest multimodal embedding model supporting text and images for advanced semantic search.
Qwen/Qwen3-Embedding-0.6B
thenlper/gte-large
Qwen/Qwen3-Max-Thinking
Qwen3-Max-Thinking is the flagship reasoning model in the Qwen3 series, designed for high-stakes cognitive tasks that require deep, multi-step reasoning. By significantly scaling model capacity and reinforcement learning compute, it...
