modelstop.top
Home/All Models

AI Model Catalogue

Browse 174 models across providers, modalities, and use cases.

🌐 All Models

174 models · Page 4 of 5

Qwen: Qwen3 Coder 480B A35B (free)

qwen

Qwen3-Coder-480B-A35B-Instruct is a Mixture-of-Experts (MoE) code generation model developed by the Qwen team. It is optimized for agentic coding tasks such as function calling, tool use, and long-context reasoning over...

textcodereasoning
262,000 ctx$0.22/1M in
Explore specs and pricingView details →

Qwen: Qwen3 235B A22B Thinking 2507

qwen

Qwen3-235B-A22B-Thinking-2507 is a high-performance, open-weight Mixture-of-Experts (MoE) language model optimized for complex reasoning tasks. It activates 22B of its 235B parameters per forward pass and natively supports up to 262,144...

textreasoningcheap
262,144 ctx$0.15/1M in
Explore specs and pricingView details →

Qwen: Qwen3 30B A3B Instruct 2507

qwen

Qwen3-30B-A3B-Instruct-2507 is a 30.5B-parameter mixture-of-experts language model from Qwen, with 3.3B active parameters per inference. It operates in non-thinking mode and is designed for high-quality instruction following, multilingual understanding, and...

textreasoningmultilingual
262,144 ctx$0.09/1M in
Explore specs and pricingView details →

Anthropic: Claude Opus 4.1

anthropic

Claude Opus 4.1 is an updated version of Anthropic’s flagship model, offering improved performance in coding, reasoning, and agentic tasks. It achieves 74.5% on SWE-bench Verified and shows notable gains...

textvisionmultimodal
200,000 ctx$15.00/1M in
Explore specs and pricingView details →

OpenAI: gpt-oss-120b (free)

openai

gpt-oss-120b is an open-weight, 117B-parameter Mixture-of-Experts (MoE) language model from OpenAI designed for high-reasoning, agentic, and general-purpose production use cases. It activates 5.1B parameters per forward pass and is optimized...

textreasoningagents
131,072 ctx$0.04/1M in
Explore specs and pricingView details →

OpenAI: GPT-5 Nano

openai

GPT-5-Nano is the smallest and fastest variant in the GPT-5 system, optimized for developer tools, rapid interactions, and ultra-low latency environments. While limited in reasoning depth compared to its larger...

textvisionmultimodal
400,000 ctx$0.05/1M in
Explore specs and pricingView details →

OpenAI: GPT-5 Mini

openai

GPT-5 Mini is a compact version of GPT-5, designed to handle lighter-weight reasoning tasks. It provides the same instruction-following and safety-tuning benefits as GPT-5, but with reduced latency and cost....

textvisionmultimodal
400,000 ctx$0.25/1M in
Explore specs and pricingView details →

OpenAI: GPT-5

openai

GPT-5 is OpenAI’s most advanced model, offering major improvements in reasoning, code quality, and user experience. It is optimized for complex tasks that require step-by-step reasoning, instruction following, and accuracy...

textvisionmultimodal
400,000 ctx$1.25/1M in
Explore specs and pricingView details →

DeepSeek: DeepSeek V3.1

deepseek

DeepSeek-V3.1 is a large hybrid reasoning model (671B parameters, 37B active) that supports both thinking and non-thinking modes via prompt templates. It extends the DeepSeek-V3 base with a two-phase long-context...

textreasoningcheap
32,768 ctx$0.15/1M in
Explore specs and pricingView details →

Nous: Hermes 4 405B

nousresearch

Hermes 4 is a large-scale reasoning model built on Meta-Llama-3.1-405B and released by Nous Research. It introduces a hybrid reasoning mode, where the model can choose to deliberate internally with...

textreasoningcheap
131,072 ctx$1.00/1M in
Explore specs and pricingView details →

Nous: Hermes 4 70B

nousresearch

Hermes 4 70B is a hybrid reasoning model from Nous Research, built on Meta-Llama-3.1-70B. It introduces the same hybrid mode as the larger 405B release, allowing the model to either...

textreasoningcheap
131,072 ctx$0.13/1M in
Explore specs and pricingView details →

xAI: Grok Code Fast 1

x-ai

Grok Code Fast 1 is a speedy and economical reasoning model that excels at agentic coding. With reasoning traces visible in the response, developers can steer Grok Code for high-quality...

textcodereasoning
256,000 ctx$0.20/1M in
Explore specs and pricingView details →

Qwen: Qwen3 30B A3B Thinking 2507

qwen

Qwen3-30B-A3B-Thinking-2507 is a 30B parameter Mixture-of-Experts reasoning model optimized for complex tasks requiring extended multi-step thinking. The model is designed specifically for “thinking mode,” where internal reasoning traces are separated...

textreasoningcheap
131,072 ctx$0.08/1M in
Explore specs and pricingView details →

NVIDIA: Nemotron Nano 9B V2 (free)

nvidia

NVIDIA-Nemotron-Nano-9B-v2 is a large language model (LLM) trained from scratch by NVIDIA, and designed as a unified model for both reasoning and non-reasoning tasks. It responds to user queries and...

textreasoningfree
128,000 ctx$0.04/1M in
Explore specs and pricingView details →

Qwen: Qwen Plus 0728 (thinking)

qwen

Qwen Plus 0728, based on the Qwen3 foundation model, is a 1 million context hybrid reasoning model with a balanced performance, speed, and cost combination.

textreasoningcheap
1,000,000 ctx$0.26/1M in
Explore specs and pricingView details →

Qwen: Qwen3 Next 80B A3B Instruct (free)

qwen

Qwen3-Next-80B-A3B-Instruct is an instruction-tuned chat model in the Qwen3-Next series optimized for fast, stable responses without “thinking” traces. It targets complex tasks across reasoning, code generation, knowledge QA, and multilingual...

textcodereasoning
262,144 ctx$0.09/1M in
Explore specs and pricingView details →

Qwen: Qwen3 Next 80B A3B Thinking

qwen

Qwen3-Next-80B-A3B-Thinking is a reasoning-first chat model in the Qwen3-Next line that outputs structured “thinking” traces by default. It’s designed for hard multi-step problems; math proofs, code synthesis/debugging, logic, and agentic...

textcodereasoning
131,072 ctx$0.10/1M in
Explore specs and pricingView details →

xAI: Grok 4 Fast

x-ai

Grok 4 Fast is xAI's latest multimodal model with SOTA cost-efficiency and a 2M token context window. It comes in two flavors: non-reasoning and reasoning. Read more about the model...

textvisionmultimodal
2,000,000 ctx$0.20/1M in
Explore specs and pricingView details →

Qwen: Qwen3 Max

qwen

Qwen3-Max is an updated release built on the Qwen3 series, offering major improvements in reasoning, instruction following, multilingual support, and long-tail knowledge coverage compared to the January 2025 version. It...

textreasoningmultilingual
262,144 ctx$0.78/1M in
Explore specs and pricingView details →

Qwen: Qwen3 VL 235B A22B Thinking

qwen

Qwen3-VL-235B-A22B Thinking is a multimodal model that unifies strong text generation with visual understanding across images and video. The Thinking model is optimized for multimodal reasoning in STEM and math....

textvisionimage
131,072 ctx$0.26/1M in
Explore specs and pricingView details →

Google: Gemini 2.5 Flash Lite Preview 09-2025

google

Gemini 2.5 Flash-Lite is a lightweight reasoning model in the Gemini 2.5 family, optimized for ultra-low latency and cost efficiency. It offers improved throughput, faster token generation, and better performance...

textvisionimage
1,048,576 ctx$0.10/1M in
Explore specs and pricingView details →

OpenAI: GPT-5 Pro

openai

GPT-5 Pro is OpenAI’s most advanced model, offering major improvements in reasoning, code quality, and user experience. It is optimized for complex tasks that require step-by-step reasoning, instruction following, and...

textvisionmultimodal
400,000 ctx$15.00/1M in
Explore specs and pricingView details →

Qwen: Qwen3 VL 30B A3B Thinking

qwen

Qwen3-VL-30B-A3B-Thinking is a multimodal model that unifies strong text generation with visual understanding for images and videos. Its Thinking variant enhances reasoning in STEM, math, and complex tasks. It excels...

textvisionimage
131,072 ctx$0.13/1M in
Explore specs and pricingView details →

Baidu: ERNIE 4.5 21B A3B Thinking

baidu

ERNIE-4.5-21B-A3B-Thinking is Baidu's upgraded lightweight MoE model, refined to boost reasoning depth and quality for top-tier performance in logical puzzles, math, science, coding, text generation, and expert-level academic benchmarks.

textcodereasoning
131,072 ctx$0.07/1M in
Explore specs and pricingView details →

NVIDIA: Llama 3.3 Nemotron Super 49B V1.5

nvidia

Llama-3.3-Nemotron-Super-49B-v1.5 is a 49B-parameter, English-centric reasoning/chat model derived from Meta’s Llama-3.3-70B-Instruct with a 128K context. It’s post-trained for agentic workflows (RAG, tool calling) via SFT across math, code, science, and...

textcodereasoning
131,072 ctx$0.10/1M in
Explore specs and pricingView details →

OpenAI: GPT-5 Image

openai

[GPT-5](https://openrouter.ai/openai/gpt-5) Image combines OpenAI's GPT-5 model with state-of-the-art image generation capabilities. It offers major improvements in reasoning, code quality, and user experience while incorporating GPT Image 1's superior instruction following,...

textvisionimage
400,000 ctx$10.00/1M in
Explore specs and pricingView details →

Qwen: Qwen3 VL 8B Instruct

qwen

Qwen3-VL-8B-Instruct is a multimodal vision-language model from the Qwen3-VL series, built for high-fidelity understanding and reasoning across text, images, and video. It features improved multimodal fusion with Interleaved-MRoPE for long-horizon...

textvisionmultimodal
131,072 ctx$0.08/1M in
Explore specs and pricingView details →

Qwen: Qwen3 VL 8B Thinking

qwen

Qwen3-VL-8B-Thinking is the reasoning-optimized variant of the Qwen3-VL-8B multimodal model, designed for advanced visual and textual reasoning across complex scenes, documents, and temporal sequences. It integrates enhanced multimodal alignment and...

textvisionmultimodal
131,072 ctx$0.12/1M in
Explore specs and pricingView details →

Qwen: Qwen3 VL 32B Instruct

qwen

Qwen3-VL-32B-Instruct is a large-scale multimodal vision-language model designed for high-precision understanding and reasoning across text, images, and video. With 32 billion parameters, it combines deep visual perception with advanced text...

textvisionmultimodal
131,072 ctx$0.10/1M in
Explore specs and pricingView details →

MiniMax: MiniMax M2

minimax

MiniMax-M2 is a compact, high-efficiency large language model optimized for end-to-end coding and agentic workflows. With 10 billion activated parameters (230 billion total), it delivers near-frontier intelligence across general reasoning,...

textcodereasoning
196,608 ctx$0.26/1M in
Explore specs and pricingView details →

NVIDIA: Nemotron Nano 12B 2 VL (free)

nvidia

NVIDIA Nemotron Nano 2 VL is a 12-billion-parameter open multimodal reasoning model designed for video understanding and document intelligence. It introduces a hybrid Transformer-Mamba architecture, combining transformer-level accuracy with Mamba’s...

textvisionmultimodal
128,000 ctx$0.20/1M in
Explore specs and pricingView details →

OpenAI: gpt-oss-safeguard-20b

openai

gpt-oss-safeguard-20b is a safety reasoning model from OpenAI built upon gpt-oss-20b. This open-weight, 21B-parameter Mixture-of-Experts (MoE) model offers lower latency for safety tasks like content classification, LLM filtering, and trust...

textreasoningcheap
131,072 ctx$0.07/1M in
Explore specs and pricingView details →

Perplexity: Sonar Pro Search

perplexity

Exclusively available on the OpenRouter API, Sonar Pro's new Pro Search mode is Perplexity's most advanced agentic search system. It is designed for deeper reasoning and analysis. Pricing is based...

textvisionmultimodal
200,000 ctx$3.00/1M in
Explore specs and pricingView details →

Amazon: Nova Premier 1.0

amazon

Amazon Nova Premier is the most capable of Amazon’s multimodal models for complex reasoning tasks and for use as the best teacher for distilling custom models.

textvisionmultimodal
1,000,000 ctx$2.50/1M in
Explore specs and pricingView details →

MoonshotAI: Kimi K2 Thinking

moonshotai

Kimi K2 Thinking is Moonshot AI’s most advanced open reasoning model to date, extending the K2 series into agentic, long-horizon reasoning. Built on the trillion-parameter Mixture-of-Experts (MoE) architecture introduced in...

textreasoningagents
262,144 ctx$0.60/1M in
Explore specs and pricingView details →

OpenAI: GPT-5.1 Chat

openai

GPT-5.1 Chat (AKA Instant is the fast, lightweight member of the 5.1 family, optimized for low-latency chat while retaining strong general intelligence. It uses adaptive reasoning to selectively “think” on...

textvisionmultimodal
128,000 ctx$1.25/1M in
Explore specs and pricingView details →