modelstop.top
Home/All Models

AI Model Catalogue

Browse 408 models across providers, modalities, and use cases.

📄 Long Context

408 models · Page 9 of 12

Qwen: Qwen3 Next 80B A3B Thinking

qwen

Qwen3-Next-80B-A3B-Thinking is a reasoning-first chat model in the Qwen3-Next line that outputs structured “thinking” traces by default. It’s designed for hard multi-step problems; math proofs, code synthesis/debugging, logic, and agentic...

textcodereasoning
131,072 ctx$0.10/1M in
Explore specs and pricingView details →

Qwen: Qwen3 Coder Flash

qwen

Qwen3 Coder Flash is Alibaba's fast and cost efficient version of their proprietary Qwen3 Coder Plus. It is a powerful coding agent model specializing in autonomous programming via tool calling...

textcodeagents
1,000,000 ctx$0.20/1M in
Explore specs and pricingView details →

Tongyi DeepResearch 30B A3B

alibaba

Tongyi DeepResearch is an agentic large language model developed by Tongyi Lab, with 30 billion total parameters activating only 3 billion per token. It's optimized for long-horizon, deep information-seeking tasks...

textagentscheap
131,072 ctx$0.09/1M in
Explore specs and pricingView details →

xAI: Grok 4 Fast

x-ai

Grok 4 Fast is xAI's latest multimodal model with SOTA cost-efficiency and a 2M token context window. It comes in two flavors: non-reasoning and reasoning. Read more about the model...

textvisionmultimodal
2,000,000 ctx$0.20/1M in
Explore specs and pricingView details →

DeepSeek: DeepSeek V3.1 Terminus

deepseek

DeepSeek-V3.1 Terminus is an update to [DeepSeek V3.1](/deepseek/deepseek-chat-v3.1) that maintains the model's original capabilities while addressing issues reported by users, including language consistency and agent capabilities, further optimizing the model's...

textagentscheap
163,840 ctx$0.21/1M in
Explore specs and pricingView details →

OpenAI: GPT-5 Codex

openai

GPT-5-Codex is a specialized version of GPT-5 optimized for software engineering and coding workflows. It is designed for both interactive development sessions and long, independent execution of complex engineering tasks....

textvisionmultimodal
400,000 ctx$1.25/1M in
Explore specs and pricingView details →

Qwen: Qwen3 Coder Plus

qwen

Qwen3 Coder Plus is Alibaba's proprietary version of the Open Source Qwen3 Coder 480B A35B. It is a powerful coding agent model specializing in autonomous programming via tool calling and...

textcodeagents
1,000,000 ctx$0.65/1M in
Explore specs and pricingView details →

Qwen: Qwen3 Max

qwen

Qwen3-Max is an updated release built on the Qwen3 series, offering major improvements in reasoning, instruction following, multilingual support, and long-tail knowledge coverage compared to the January 2025 version. It...

textreasoningmultilingual
262,144 ctx$0.78/1M in
Explore specs and pricingView details →

Qwen: Qwen3 VL 235B A22B Instruct

qwen

Qwen3-VL-235B-A22B Instruct is an open-weight multimodal model that unifies strong text generation with visual understanding across images and video. The Instruct model targets general vision-language use (VQA, document parsing, chart/table...

textvisionimage
262,144 ctx$0.20/1M in
Explore specs and pricingView details →

Qwen: Qwen3 VL 235B A22B Thinking

qwen

Qwen3-VL-235B-A22B Thinking is a multimodal model that unifies strong text generation with visual understanding across images and video. The Thinking model is optimized for multimodal reasoning in STEM and math....

textvisionimage
131,072 ctx$0.26/1M in
Explore specs and pricingView details →

Google: Gemini 2.5 Flash Lite Preview 09-2025

google

Gemini 2.5 Flash-Lite is a lightweight reasoning model in the Gemini 2.5 family, optimized for ultra-low latency and cost efficiency. It offers improved throughput, faster token generation, and better performance...

textvisionimage
1,048,576 ctx$0.10/1M in
Explore specs and pricingView details →

Relace: Relace Apply 3

relace

Relace Apply 3 is a specialized code-patching LLM that merges AI-suggested edits straight into your source files. It can apply updates from GPT-4o, Claude, and others into your files at...

textcodecheap
256,000 ctx$0.85/1M in
Explore specs and pricingView details →

TheDrummer: Cydonia 24B V4.1

thedrummer

Uncensored and creative writing model based on Mistral Small 3.2 24B with good recall, prompt adherence, and intelligence.

textcheaplong-context
131,072 ctx$0.30/1M in
Explore specs and pricingView details →

DeepSeek: DeepSeek V3.2 Exp

deepseek

DeepSeek-V3.2-Exp is an experimental large language model released by DeepSeek as an intermediate step between V3.1 and future architectures. It introduces DeepSeek Sparse Attention (DSA), a fine-grained sparse attention mechanism...

textcheaplong-context
163,840 ctx$0.27/1M in
Explore specs and pricingView details →

Anthropic: Claude Sonnet 4.5

anthropic

Claude Sonnet 4.5 is Anthropic’s most advanced Sonnet model to date, optimized for real-world agents and coding workflows. It delivers state-of-the-art performance on coding benchmarks such as SWE-bench Verified, with...

textvisionmultimodal
1,000,000 ctx$3.00/1M in
Explore specs and pricingView details →

Z.ai: GLM 4.6

z-ai

Compared with GLM-4.5, this generation brings several key improvements: Longer context window: The context window has been expanded from 128K to 200K tokens, enabling the model to handle more complex...

textcheaplong-context
204,800 ctx$0.39/1M in
Explore specs and pricingView details →

OpenAI: GPT-5 Pro

openai

GPT-5 Pro is OpenAI’s most advanced model, offering major improvements in reasoning, code quality, and user experience. It is optimized for complex tasks that require step-by-step reasoning, instruction following, and...

textvisionmultimodal
400,000 ctx$15.00/1M in
Explore specs and pricingView details →

Qwen: Qwen3 VL 30B A3B Instruct

qwen

Qwen3-VL-30B-A3B-Instruct is a multimodal model that unifies strong text generation with visual understanding for images and videos. Its Instruct variant optimizes instruction-following for general multimodal tasks. It excels in perception...

textvisionimage
131,072 ctx$0.13/1M in
Explore specs and pricingView details →

Qwen: Qwen3 VL 30B A3B Thinking

qwen

Qwen3-VL-30B-A3B-Thinking is a multimodal model that unifies strong text generation with visual understanding for images and videos. Its Thinking variant enhances reasoning in STEM, math, and complex tasks. It excels...

textvisionimage
131,072 ctx$0.13/1M in
Explore specs and pricingView details →

Baidu: ERNIE 4.5 21B A3B Thinking

baidu

ERNIE-4.5-21B-A3B-Thinking is Baidu's upgraded lightweight MoE model, refined to boost reasoning depth and quality for top-tier performance in logical puzzles, math, science, coding, text generation, and expert-level academic benchmarks.

textcodereasoning
131,072 ctx$0.07/1M in
Explore specs and pricingView details →

NVIDIA: Llama 3.3 Nemotron Super 49B V1.5

nvidia

Llama-3.3-Nemotron-Super-49B-v1.5 is a 49B-parameter, English-centric reasoning/chat model derived from Meta’s Llama-3.3-70B-Instruct with a 128K context. It’s post-trained for agentic workflows (RAG, tool calling) via SFT across math, code, science, and...

textcodereasoning
131,072 ctx$0.10/1M in
Explore specs and pricingView details →

OpenAI: o4 Mini Deep Research

openai

o4-mini-deep-research is OpenAI's faster, more affordable deep research model—ideal for tackling complex, multi-step research tasks. Note: This model always uses the 'web_search' tool which adds additional cost.

textvisionmultimodal
200,000 ctx$2.00/1M in
Explore specs and pricingView details →

OpenAI: o3 Deep Research

openai

o3-deep-research is OpenAI's advanced model for deep research, designed to tackle complex, multi-step research tasks. Note: This model always uses the 'web_search' tool which adds additional cost.

textvisionmultimodal
200,000 ctx$10.00/1M in
Explore specs and pricingView details →

OpenAI: GPT-5 Image

openai

[GPT-5](https://openrouter.ai/openai/gpt-5) Image combines OpenAI's GPT-5 model with state-of-the-art image generation capabilities. It offers major improvements in reasoning, code quality, and user experience while incorporating GPT Image 1's superior instruction following,...

textvisionimage
400,000 ctx$10.00/1M in
Explore specs and pricingView details →

Qwen: Qwen3 VL 8B Instruct

qwen

Qwen3-VL-8B-Instruct is a multimodal vision-language model from the Qwen3-VL series, built for high-fidelity understanding and reasoning across text, images, and video. It features improved multimodal fusion with Interleaved-MRoPE for long-horizon...

textvisionmultimodal
131,072 ctx$0.08/1M in
Explore specs and pricingView details →

Qwen: Qwen3 VL 8B Thinking

qwen

Qwen3-VL-8B-Thinking is the reasoning-optimized variant of the Qwen3-VL-8B multimodal model, designed for advanced visual and textual reasoning across complex scenes, documents, and temporal sequences. It integrates enhanced multimodal alignment and...

textvisionmultimodal
131,072 ctx$0.12/1M in
Explore specs and pricingView details →

Anthropic: Claude Haiku 4.5

anthropic

Claude Haiku 4.5 is Anthropic’s fastest and most efficient model, delivering near-frontier intelligence at a fraction of the cost and latency of larger Claude models. Matching Claude Sonnet 4’s performance...

textvisionmultimodal
200,000 ctx$1.00/1M in
Explore specs and pricingView details →

OpenAI: GPT-5 Image Mini

openai

GPT-5 Image Mini combines OpenAI's advanced language capabilities, powered by [GPT-5 Mini](https://openrouter.ai/openai/gpt-5-mini), with GPT Image 1 Mini for efficient image generation. This natively multimodal model features superior instruction following, text...

textvisionimage
400,000 ctx$2.50/1M in
Explore specs and pricingView details →

IBM: Granite 4.0 Micro

ibm-granite

Granite-4.0-H-Micro is a 3B parameter from the Granite 4 family of models. These models are the latest in a series of models released by IBM. They are fine-tuned for long...

textcheaplong-context
131,000 ctx$0.02/1M in
Explore specs and pricingView details →

Qwen: Qwen3 VL 32B Instruct

qwen

Qwen3-VL-32B-Instruct is a large-scale multimodal vision-language model designed for high-precision understanding and reasoning across text, images, and video. With 32 billion parameters, it combines deep visual perception with advanced text...

textvisionmultimodal
131,072 ctx$0.10/1M in
Explore specs and pricingView details →

MiniMax: MiniMax M2

minimax

MiniMax-M2 is a compact, high-efficiency large language model optimized for end-to-end coding and agentic workflows. With 10 billion activated parameters (230 billion total), it delivers near-frontier intelligence across general reasoning,...

textcodereasoning
196,608 ctx$0.26/1M in
Explore specs and pricingView details →

NVIDIA: Nemotron Nano 12B 2 VL (free)

nvidia

NVIDIA Nemotron Nano 2 VL is a 12-billion-parameter open multimodal reasoning model designed for video understanding and document intelligence. It introduces a hybrid Transformer-Mamba architecture, combining transformer-level accuracy with Mamba’s...

textvisionmultimodal
128,000 ctx$0.20/1M in
Explore specs and pricingView details →

OpenAI: gpt-oss-safeguard-20b

openai

gpt-oss-safeguard-20b is a safety reasoning model from OpenAI built upon gpt-oss-20b. This open-weight, 21B-parameter Mixture-of-Experts (MoE) model offers lower latency for safety tasks like content classification, LLM filtering, and trust...

textreasoningcheap
131,072 ctx$0.07/1M in
Explore specs and pricingView details →

Perplexity: Sonar Pro Search

perplexity

Exclusively available on the OpenRouter API, Sonar Pro's new Pro Search mode is Perplexity's most advanced agentic search system. It is designed for deeper reasoning and analysis. Pricing is based...

textvisionmultimodal
200,000 ctx$3.00/1M in
Explore specs and pricingView details →

Amazon: Nova Premier 1.0

amazon

Amazon Nova Premier is the most capable of Amazon’s multimodal models for complex reasoning tasks and for use as the best teacher for distilling custom models.

textvisionmultimodal
1,000,000 ctx$2.50/1M in
Explore specs and pricingView details →

MoonshotAI: Kimi K2 Thinking

moonshotai

Kimi K2 Thinking is Moonshot AI’s most advanced open reasoning model to date, extending the K2 series into agentic, long-horizon reasoning. Built on the trillion-parameter Mixture-of-Experts (MoE) architecture introduced in...

textreasoningagents
262,144 ctx$0.60/1M in
Explore specs and pricingView details →