π§ Reasoning
194 models Β· Page 2 of 6
Qwen/Qwen3-Max
Qwen3-Max is an updated release built on the Qwen3 series, offering major improvements in reasoning, instruction following, multilingual support, and long-tail knowledge coverage compared to the January 2025 version. It...
google/gemma-4-31B-it
Gemma 4 31B Instruct is Google DeepMind's 30.7B dense multimodal model supporting text and image input with text output. Features a 256K token context window, configurable thinking/reasoning mode, native function...
Qwen/Qwen3-14B
Qwen3-14B is a dense 14.8B parameter causal language model from the Qwen3 series, designed for both complex reasoning and efficient dialogue. It supports seamless switching between a "thinking" mode for...
google/gemma-3-27b-it
Gemma 3 introduces multimodality, supporting vision-language input and text outputs. It handles context windows up to 128k tokens, understands over 140 languages, and offers improved math, reasoning, and chat capabilities,...
Qwen/Qwen3-Max-Thinking
Qwen3-Max-Thinking is the flagship reasoning model in the Qwen3 series, designed for high-stakes cognitive tasks that require deep, multi-step reasoning. By significantly scaling model capacity and reinforcement learning compute, it...
Qwen/Qwen3-235B-A22B-Thinking-2507
Qwen3-235B-A22B-Thinking-2507 is a high-performance, open-weight Mixture-of-Experts (MoE) language model optimized for complex reasoning tasks. It activates 22B of its 235B parameters per forward pass and natively supports up to 262,144...
openai/gpt-oss-120b
gpt-oss-120b is an open-weight, 117B-parameter Mixture-of-Experts (MoE) language model from OpenAI designed for high-reasoning, agentic, and general-purpose production use cases. It activates 5.1B parameters per forward pass and is optimized...
microsoft/phi-4
Microsoft Phi-4 14B β small language model achieving state-of-the-art results on reasoning tasks.
qwen/qwen3-32b
Qwen3-32B is a dense 32.8B parameter causal language model from the Qwen3 series, optimized for both complex reasoning and efficient dialogue. It supports seamless switching between a "thinking" mode for...
mistral-large-2411
Our top-tier reasoning model for high-complexity tasks with the lastest version released November 2024.
openai/gpt-oss-120b
gpt-oss-120b is an open-weight, 117B-parameter Mixture-of-Experts (MoE) language model from OpenAI designed for high-reasoning, agentic, and general-purpose production use cases. It activates 5.1B parameters per forward pass and is optimized...
magistral-medium-2509
Our frontier-class reasoning model release candidate September 2025.
openai/gpt-oss-safeguard-20b
gpt-oss-safeguard-20b is a safety reasoning model from OpenAI built upon gpt-oss-20b. This open-weight, 21B-parameter Mixture-of-Experts (MoE) model offers lower latency for safety tasks like content classification, LLM filtering, and trust...
command-a-reasoning-08-2025
magistral-medium-2509
Our frontier-class reasoning model release candidate September 2025.
magistral-small-2509
Our efficient reasoning model released September 2025.
claude-opus-4.6
Anthropic's most intelligent model with state-of-the-art coding, reasoning, and agentic capabilities
gemini-2.5-flash
Googleβs hybrid βthinkingβ AI model optimized for speed and cost-efficiency
seedream-5-lite
Seedream 5.0 lite: image generation with built-in reasoning, example-based editing, and deep domain knowledge
deepseek-r1
A reasoning model trained with reinforcement learning, on par with OpenAI o1
kimi-k2-thinking
Kimi K2 Thinking is the latest, most capable version of an open-source thinking model.
deepseek-v3
DeepSeek-V3-0324 is the leading non-reasoning model, a milestone for open source
gpt-oss-20b-fast
Advanced 20B open-weight reasoning models to customize for any use case and run anywhere.
deepseek-v3.1
Latest hybrid thinking model from Deepseek
grok-4
Grok 4 is xAIβs most advanced reasoning model. Excels at logical thinking and in-depth analysis. Ideal for insightful discussions and complex problem-solving.
gemini-3.1-pro
Google's most intelligent model, with improved reasoning and a new medium thinking level
gpt-5.4
OpenAI's most capable frontier model for complex professional work, coding, and multi-step reasoning.
wan-2.7-image-pro
Generate and edit high-quality images with Alibaba's Wan 2.7 Pro with 4K output, thinking mode, text-to-image, multi-image editing, and image set generation
HyperCLOVAX-SEED-Think-14B-GPTQ
Open-source HyperCLOVAX-SEED-Think-14B-GPTQ model from k-compression β available for download and self-hosting on Hugging Face.
kimi-k2-thinking
kimi-k2-thinking β available to run locally via Ollama on CPU and GPU hardware.
Qwen3-4B-Thinking-2507
Open-source Qwen3-4B-Thinking-2507 model from qwen β available for download and self-hosting on Hugging Face.
Qwen3 235B A22B
Qwen3 235B A22B is Alibaba's flagship mixture-of-experts model with 235B total parameters and 22B active per token. Delivers frontier-level performance on coding, reasoning, and multilingual tasks at significantly lower inference cost.
Falcon 180B
Falcon 180B is one of the largest openly available language models, trained on 3.5 trillion tokens with TII's custom RefinedWeb dataset. Excels at reasoning, summarization, and generation tasks at state-of-the-art quality for open models.
Databricks DBRX Instruct
DBRX Instruct is an open, general-purpose LLM from Databricks. Built with a fine-grained mixture-of-experts (MoE) architecture, it was the most capable open LLM at launch and excels at code, math, and language tasks.
NVIDIA Nemotron-4 340B Instruct
NVIDIA Nemotron-4 340B Instruct is a large open language model trained to generate diverse synthetic data for training other LLMs. Strong at following instructions, classification, and generating reward model training data.
Microsoft Phi-4 Mini
Microsoft Phi-4 Mini is a 3.8B parameter compact model from Microsoft. Delivers impressive reasoning capabilities for edge and mobile deployment scenarios, with strong performance on math and coding tasks relative to its size.
