modelstop.top
Home/All Models

AI Model Catalogue

Browse 194 models across providers, modalities, and use cases.

🌐 All Models

194 models Β· Page 1 of 6

Qwen3.6-35B-A3B-Claude-4.7-Opus-Reasoning-Distilled-GGUF

mradermacher

Open-source Qwen3.6-35B-A3B-Claude-4.7-Opus-Reasoning-Distilled-GGUF model from mradermacher β€” available for download and self-hosting on Hugging Face.

textreasoningfree
ctx$0.00/1M in
Explore specs and pricingView details β†’

qwen3-30b-a3b-fp8

qwen

Qwen3 is the latest generation of large language models in Qwen series, offering a comprehensive suite of dense and mixture-of-experts (MoE) models. Built upon extensive training, Qwen3 delivers groundbreaking advancements in reasoning, instruction-following, agent capabilities, and multilingual support.

textreasoningagents
32,768 ctx$0.05/1M in
Explore specs and pricingView details β†’

qwq-32b

qwen

QwQ is the reasoning model of the Qwen series. Compared with conventional instruction-tuned models, QwQ, which is capable of thinking and reasoning, can achieve significantly enhanced performance in downstream tasks, especially hard problems. QwQ-32B is the medium-sized reasoning model, which is capable of achieving competitive performance against state-of-the-art reasoning models, e.g., DeepSeek-R1, o1-mini.

textreasoningcheap
24,000 ctx$0.66/1M in
Explore specs and pricingView details β†’

gemma-3-12b-it

google

Gemma 3 models are well-suited for a variety of text generation and image understanding tasks, including question answering, summarization, and reasoning. Gemma 3 models are multimodal, handling text and image input and generating text output, with a large, 128K context window, multilingual support in over 140 languages, and is available in more sizes than previous versions.

textvisionreasoning
80,000 ctx$0.35/1M in
Explore specs and pricingView details β†’

gpt-oss-20b

openai

OpenAI’s open-weight models designed for powerful reasoning, agentic tasks, and versatile developer use cases – gpt-oss-20b is for lower latency, and local or specialized use-cases.

textreasoningagents
128,000 ctx$0.20/1M in
Explore specs and pricingView details β†’

llama-3.2-11b-vision-instruct

meta

The Llama 3.2-Vision instruction-tuned models are optimized for visual recognition, image reasoning, captioning, and answering general questions about an image.

textvisionreasoning
128,000 ctx$0.05/1M in
Explore specs and pricingView details β†’

llama-3-8b-instruct

meta

Generation over generation, Meta Llama 3 demonstrates state-of-the-art performance on a wide range of industry benchmarks and offers new capabilities, including improved reasoning.

textreasoninginstruct
7,968 ctx$0.28/1M in
Explore specs and pricingView details β†’

gpt-oss-120b

openai

OpenAI’s open-weight models designed for powerful reasoning, agentic tasks, and versatile developer use cases – gpt-oss-120b is for production, general purpose, high reasoning use-cases.

textreasoningagents
128,000 ctx$0.35/1M in
Explore specs and pricingView details β†’

Olmo-3-7B-Think

allenai

Open-source Olmo-3-7B-Think model from allenai β€” available for download and self-hosting on Hugging Face.

textreasoningfree
ctxFree in
Explore specs and pricingView details β†’

Olmo-3-7B-Think-DPO

allenai

Open-source Olmo-3-7B-Think-DPO model from allenai β€” available for download and self-hosting on Hugging Face.

textreasoningfree
ctxFree in
Explore specs and pricingView details β†’

Phi-4-mini-flash-reasoning

microsoft

Open-source Phi-4-mini-flash-reasoning model from microsoft β€” available for download and self-hosting on Hugging Face.

textreasoningfree
Run locally
ctxFree in
Explore specs and pricingView details β†’

Phi-4-reasoning-plus

microsoft

Open-source Phi-4-reasoning-plus model from microsoft β€” available for download and self-hosting on Hugging Face.

textreasoningfree
ctxFree in
Explore specs and pricingView details β†’

Phi-4-reasoning

microsoft

Open-source Phi-4-reasoning model from microsoft β€” available for download and self-hosting on Hugging Face.

textreasoningfree
Run locally
ctxFree in
Explore specs and pricingView details β†’

Phi-4-mini-reasoning

microsoft

Open-source Phi-4-mini-reasoning model from microsoft β€” available for download and self-hosting on Hugging Face.

textreasoningfree
Run locally
ctxFree in
Explore specs and pricingView details β†’

Ministral-3-8B-Reasoning-2512-GGUF

mistralai

Open-source Ministral-3-8B-Reasoning-2512-GGUF model from mistralai β€” available for download and self-hosting on Hugging Face.

textreasoningfree
ctxFree in
Explore specs and pricingView details β†’

Ministral-3-3B-Reasoning-2512

mistralai

Open-source Ministral-3-3B-Reasoning-2512 model from mistralai β€” available for download and self-hosting on Hugging Face.

textreasoningfree
Run locally
ctxFree in
Explore specs and pricingView details β†’

Ministral-3-14B-Reasoning-2512

mistralai

Open-source Ministral-3-14B-Reasoning-2512 model from mistralai β€” available for download and self-hosting on Hugging Face.

textreasoningfree
ctxFree in
Explore specs and pricingView details β†’

Ministral-3-8B-Reasoning-2512

mistralai

Open-source Ministral-3-8B-Reasoning-2512 model from mistralai β€” available for download and self-hosting on Hugging Face.

textreasoningfree
ctxFree in
Explore specs and pricingView details β†’

Qwen3.5-0.8B-SFT-Claude-Opus-Reasoning-GGUF

mradermacher

Open-source Qwen3.5-0.8B-SFT-Claude-Opus-Reasoning-GGUF model from mradermacher β€” available for download and self-hosting on Hugging Face.

textreasoningfree
ctxFree in
Explore specs and pricingView details β†’

Kimi K2 Thinking

moonshotai

Kimi K2 Thinking is Moonshot AI’s most advanced open reasoning model to date, extending the K2 series into agentic, long-horizon reasoning. Built on the trillion-parameter Mixture-of-Experts (MoE) architecture introduced in...

textreasoningagents
262,144 ctx$1.20/1M in
Explore specs and pricingView details β†’

Qwen3 8B

qwen

Qwen3-8B is a dense 8.2B parameter causal language model from the Qwen3 series, designed for both reasoning-heavy tasks and efficient dialogue. It supports seamless switching between "thinking" mode for math,...

textreasoningcheap
40,960 ctxFree in
Explore specs and pricingView details β†’

Qwen3 Next 80B A3b Instruct

qwen

Qwen3-Next-80B-A3B-Instruct is an instruction-tuned chat model in the Qwen3-Next series optimized for fast, stable responses without β€œthinking” traces. It targets complex tasks across reasoning, code generation, knowledge QA, and multilingual...

textcodereasoning
262,144 ctx$0.15/1M in
Explore specs and pricingView details β†’

Qwen3 30B A3b

qwen

Qwen3, the latest generation in the Qwen large language model series, features both dense and mixture-of-experts (MoE) architectures to excel in reasoning, multilingual support, and advanced agent tasks. Its unique...

textreasoningagents
40,960 ctxFree in
Explore specs and pricingView details β†’

Qwen QwQ-32B

qwen

QwQ is the reasoning model of the Qwen series. Compared with conventional instruction-tuned models, QwQ, which is capable of thinking and reasoning, can achieve significantly enhanced performance in downstream tasks,...

textreasoninglong-context
131,072 ctx$1.20/1M in
Explore specs and pricingView details β†’

Qwen3 Next 80B A3b Thinking

qwen

Qwen3-Next-80B-A3B-Thinking is a reasoning-first chat model in the Qwen3-Next line that outputs structured β€œthinking” traces by default. It’s designed for hard multi-step problems; math proofs, code synthesis/debugging, logic, and agentic...

textcodereasoning
262,144 ctx$0.15/1M in
Explore specs and pricingView details β†’

Qwen3-VL-8B-Instruct

qwen

Qwen3-VL-8B-Instruct is a multimodal vision-language model from the Qwen3-VL series, built for high-fidelity understanding and reasoning across text, images, and video. It features improved multimodal fusion with Interleaved-MRoPE for long-horizon...

textvisionreasoning
262,144 ctx$0.18/1M in
Explore specs and pricingView details β†’

Gemma 3 4b it

google

Gemma 3 introduces multimodality, supporting vision-language input and text outputs. It handles context windows up to 128k tokens, understands over 140 languages, and offers improved math, reasoning, and chat capabilities,...

textvisionreasoning
65,536 ctxFree in
Explore specs and pricingView details β†’

Qwen3 235B A22B Thinking 2507 FP8

qwen

Qwen3-235B-A22B-Thinking-2507 is a high-performance, open-weight Mixture-of-Experts (MoE) language model optimized for complex reasoning tasks. It activates 22B of its 235B parameters per forward pass and natively supports up to 262,144...

textreasoningcheap
262,144 ctx$0.65/1M in
Explore specs and pricingView details β†’

Qwen3-VL-32B-Instruct

qwen

Qwen3-VL-32B-Instruct is a large-scale multimodal vision-language model designed for high-precision understanding and reasoning across text, images, and video. With 32 billion parameters, it combines deep visual perception with advanced text...

textvisionreasoning
262,144 ctx$0.50/1M in
Explore specs and pricingView details β†’

EssentialAI Rnj-1 Instruct

essentialai

Rnj-1 is an 8B-parameter, dense, open-weight model family developed by Essential AI and trained from scratch with a focus on programming, math, and scientific reasoning. The model demonstrates strong performance...

textreasoninginstruct
32,768 ctx$0.15/1M in
Explore specs and pricingView details β†’

Qwen/Qwen3-32B

deepinfra

Qwen3-32B is a dense 32.8B parameter causal language model from the Qwen3 series, optimized for both complex reasoning and efficient dialogue. It supports seamless switching between a "thinking" mode for...

textreasoningcheap
40,960 ctxFree in
Explore specs and pricingView details β†’

Qwen/Qwen3-30B-A3B

deepinfra

Qwen3, the latest generation in the Qwen large language model series, features both dense and mixture-of-experts (MoE) architectures to excel in reasoning, multilingual support, and advanced agent tasks. Its unique...

textreasoningagents
40,960 ctxFree in
Explore specs and pricingView details β†’

google/gemma-3-12b-it

deepinfra

Gemma 3 introduces multimodality, supporting vision-language input and text outputs. It handles context windows up to 128k tokens, understands over 140 languages, and offers improved math, reasoning, and chat capabilities,...

textvisionreasoning
32,768 ctxFree in
Explore specs and pricingView details β†’

Qwen/Qwen3-Max

deepinfra

Qwen3-Max is an updated release built on the Qwen3 series, offering major improvements in reasoning, instruction following, multilingual support, and long-tail knowledge coverage compared to the January 2025 version. It...

textreasoningmultilingual
262,144 ctxFree in
Explore specs and pricingView details β†’

Qwen/Qwen3-Next-80B-A3B-Instruct

deepinfra

Qwen3-Next-80B-A3B-Instruct is an instruction-tuned chat model in the Qwen3-Next series optimized for fast, stable responses without β€œthinking” traces. It targets complex tasks across reasoning, code generation, knowledge QA, and multilingual...

textcodereasoning
Run locally
262,144 ctxFree in
Explore specs and pricingView details β†’

meta-llama/Meta-Llama-3.1-70B-Instruct

deepinfra

Meta Llama 3.1 70B Instruct on DeepInfra β€” powerful open-source model for complex reasoning tasks.

textreasoninginstruct
ctxFree in
Explore specs and pricingView details β†’