π All Models
194 models Β· Page 1 of 6
Qwen3.6-35B-A3B-Claude-4.7-Opus-Reasoning-Distilled-GGUF
Open-source Qwen3.6-35B-A3B-Claude-4.7-Opus-Reasoning-Distilled-GGUF model from mradermacher β available for download and self-hosting on Hugging Face.
qwen3-30b-a3b-fp8
Qwen3 is the latest generation of large language models in Qwen series, offering a comprehensive suite of dense and mixture-of-experts (MoE) models. Built upon extensive training, Qwen3 delivers groundbreaking advancements in reasoning, instruction-following, agent capabilities, and multilingual support.
qwq-32b
QwQ is the reasoning model of the Qwen series. Compared with conventional instruction-tuned models, QwQ, which is capable of thinking and reasoning, can achieve significantly enhanced performance in downstream tasks, especially hard problems. QwQ-32B is the medium-sized reasoning model, which is capable of achieving competitive performance against state-of-the-art reasoning models, e.g., DeepSeek-R1, o1-mini.
gemma-3-12b-it
Gemma 3 models are well-suited for a variety of text generation and image understanding tasks, including question answering, summarization, and reasoning. Gemma 3 models are multimodal, handling text and image input and generating text output, with a large, 128K context window, multilingual support in over 140 languages, and is available in more sizes than previous versions.
gpt-oss-20b
OpenAIβs open-weight models designed for powerful reasoning, agentic tasks, and versatile developer use cases β gpt-oss-20b is for lower latency, and local or specialized use-cases.
llama-3.2-11b-vision-instruct
The Llama 3.2-Vision instruction-tuned models are optimized for visual recognition, image reasoning, captioning, and answering general questions about an image.
llama-3-8b-instruct
Generation over generation, Meta Llama 3 demonstrates state-of-the-art performance on a wide range of industry benchmarks and offers new capabilities, including improved reasoning.
gpt-oss-120b
OpenAIβs open-weight models designed for powerful reasoning, agentic tasks, and versatile developer use cases β gpt-oss-120b is for production, general purpose, high reasoning use-cases.
Olmo-3-7B-Think
Open-source Olmo-3-7B-Think model from allenai β available for download and self-hosting on Hugging Face.
Olmo-3-7B-Think-DPO
Open-source Olmo-3-7B-Think-DPO model from allenai β available for download and self-hosting on Hugging Face.
Phi-4-mini-flash-reasoning
Open-source Phi-4-mini-flash-reasoning model from microsoft β available for download and self-hosting on Hugging Face.
Phi-4-reasoning-plus
Open-source Phi-4-reasoning-plus model from microsoft β available for download and self-hosting on Hugging Face.
Phi-4-reasoning
Open-source Phi-4-reasoning model from microsoft β available for download and self-hosting on Hugging Face.
Phi-4-mini-reasoning
Open-source Phi-4-mini-reasoning model from microsoft β available for download and self-hosting on Hugging Face.
Ministral-3-8B-Reasoning-2512-GGUF
Open-source Ministral-3-8B-Reasoning-2512-GGUF model from mistralai β available for download and self-hosting on Hugging Face.
Ministral-3-3B-Reasoning-2512
Open-source Ministral-3-3B-Reasoning-2512 model from mistralai β available for download and self-hosting on Hugging Face.
Ministral-3-14B-Reasoning-2512
Open-source Ministral-3-14B-Reasoning-2512 model from mistralai β available for download and self-hosting on Hugging Face.
Ministral-3-8B-Reasoning-2512
Open-source Ministral-3-8B-Reasoning-2512 model from mistralai β available for download and self-hosting on Hugging Face.
Qwen3.5-0.8B-SFT-Claude-Opus-Reasoning-GGUF
Open-source Qwen3.5-0.8B-SFT-Claude-Opus-Reasoning-GGUF model from mradermacher β available for download and self-hosting on Hugging Face.
Kimi K2 Thinking
Kimi K2 Thinking is Moonshot AIβs most advanced open reasoning model to date, extending the K2 series into agentic, long-horizon reasoning. Built on the trillion-parameter Mixture-of-Experts (MoE) architecture introduced in...
Qwen3 8B
Qwen3-8B is a dense 8.2B parameter causal language model from the Qwen3 series, designed for both reasoning-heavy tasks and efficient dialogue. It supports seamless switching between "thinking" mode for math,...
Qwen3 Next 80B A3b Instruct
Qwen3-Next-80B-A3B-Instruct is an instruction-tuned chat model in the Qwen3-Next series optimized for fast, stable responses without βthinkingβ traces. It targets complex tasks across reasoning, code generation, knowledge QA, and multilingual...
Qwen3 30B A3b
Qwen3, the latest generation in the Qwen large language model series, features both dense and mixture-of-experts (MoE) architectures to excel in reasoning, multilingual support, and advanced agent tasks. Its unique...
Qwen QwQ-32B
QwQ is the reasoning model of the Qwen series. Compared with conventional instruction-tuned models, QwQ, which is capable of thinking and reasoning, can achieve significantly enhanced performance in downstream tasks,...
Qwen3 Next 80B A3b Thinking
Qwen3-Next-80B-A3B-Thinking is a reasoning-first chat model in the Qwen3-Next line that outputs structured βthinkingβ traces by default. Itβs designed for hard multi-step problems; math proofs, code synthesis/debugging, logic, and agentic...
Qwen3-VL-8B-Instruct
Qwen3-VL-8B-Instruct is a multimodal vision-language model from the Qwen3-VL series, built for high-fidelity understanding and reasoning across text, images, and video. It features improved multimodal fusion with Interleaved-MRoPE for long-horizon...
Gemma 3 4b it
Gemma 3 introduces multimodality, supporting vision-language input and text outputs. It handles context windows up to 128k tokens, understands over 140 languages, and offers improved math, reasoning, and chat capabilities,...
Qwen3 235B A22B Thinking 2507 FP8
Qwen3-235B-A22B-Thinking-2507 is a high-performance, open-weight Mixture-of-Experts (MoE) language model optimized for complex reasoning tasks. It activates 22B of its 235B parameters per forward pass and natively supports up to 262,144...
Qwen3-VL-32B-Instruct
Qwen3-VL-32B-Instruct is a large-scale multimodal vision-language model designed for high-precision understanding and reasoning across text, images, and video. With 32 billion parameters, it combines deep visual perception with advanced text...
EssentialAI Rnj-1 Instruct
Rnj-1 is an 8B-parameter, dense, open-weight model family developed by Essential AI and trained from scratch with a focus on programming, math, and scientific reasoning. The model demonstrates strong performance...
Qwen/Qwen3-32B
Qwen3-32B is a dense 32.8B parameter causal language model from the Qwen3 series, optimized for both complex reasoning and efficient dialogue. It supports seamless switching between a "thinking" mode for...
Qwen/Qwen3-30B-A3B
Qwen3, the latest generation in the Qwen large language model series, features both dense and mixture-of-experts (MoE) architectures to excel in reasoning, multilingual support, and advanced agent tasks. Its unique...
google/gemma-3-12b-it
Gemma 3 introduces multimodality, supporting vision-language input and text outputs. It handles context windows up to 128k tokens, understands over 140 languages, and offers improved math, reasoning, and chat capabilities,...
Qwen/Qwen3-Max
Qwen3-Max is an updated release built on the Qwen3 series, offering major improvements in reasoning, instruction following, multilingual support, and long-tail knowledge coverage compared to the January 2025 version. It...
Qwen/Qwen3-Next-80B-A3B-Instruct
Qwen3-Next-80B-A3B-Instruct is an instruction-tuned chat model in the Qwen3-Next series optimized for fast, stable responses without βthinkingβ traces. It targets complex tasks across reasoning, code generation, knowledge QA, and multilingual...
meta-llama/Meta-Llama-3.1-70B-Instruct
Meta Llama 3.1 70B Instruct on DeepInfra β powerful open-source model for complex reasoning tasks.
