modelstop.top
Home/All Models

AI Model Catalogue

Browse 408 models across providers, modalities, and use cases.

πŸ“„ Long Context

408 models Β· Page 8 of 12

xAI: Grok 4

x-ai

Grok 4 is xAI's latest reasoning model with a 256k context window. It supports parallel tool calling, structured outputs, and both image and text inputs. Note that reasoning is not...

textvisionmultimodal
256,000 ctx$3.00/1M in
Explore specs and pricingView details β†’

Mistral: Devstral Small 1.1

mistralai

Devstral Small 1.1 is a 24B parameter open-weight language model for software engineering agents, developed by Mistral AI in collaboration with All Hands AI. Finetuned from Mistral Small 3.1 and...

textagentscheap
131,072 ctx$0.10/1M in
Explore specs and pricingView details β†’

Mistral: Devstral Medium

mistralai

Devstral Medium is a high-performance code generation and agentic reasoning model developed jointly by Mistral AI and All Hands AI. Positioned as a step up from Devstral Small, it achieves...

textcodereasoning
131,072 ctx$0.40/1M in
Explore specs and pricingView details β†’

MoonshotAI: Kimi K2 0711

moonshotai

Kimi K2 Instruct is a large-scale Mixture-of-Experts (MoE) language model developed by Moonshot AI, featuring 1 trillion total parameters with 32 billion active per forward pass. It is optimized for...

textinstructcheap
131,072 ctx$0.57/1M in
Explore specs and pricingView details β†’

Switchpoint Router

switchpoint

Switchpoint AI's router instantly analyzes your request and directs it to the optimal AI from an ever-evolving library. As the world of LLMs advances, our router gets smarter, ensuring you...

textcheaplong-context
131,072 ctx$0.85/1M in
Explore specs and pricingView details β†’

Qwen: Qwen3 235B A22B Instruct 2507

qwen

Qwen3-235B-A22B-Instruct-2507 is a multilingual, instruction-tuned mixture-of-experts language model based on the Qwen3-235B architecture, with 22B active parameters per forward pass. It is optimized for general-purpose text generation, including instruction following,...

textmultilingualinstruct
262,144 ctx$0.07/1M in
Explore specs and pricingView details β†’

Google: Gemini 2.5 Flash Lite

google

Gemini 2.5 Flash-Lite is a lightweight reasoning model in the Gemini 2.5 family, optimized for ultra-low latency and cost efficiency. It offers improved throughput, faster token generation, and better performance...

textvisionimage
1,048,576 ctx$0.10/1M in
Explore specs and pricingView details β†’

ByteDance: UI-TARS 7B

bytedance

UI-TARS-1.5 is a multimodal vision-language agent optimized for GUI-based environments, including desktop interfaces, web browsers, mobile systems, and games. Built by ByteDance, it builds upon the UI-TARS framework with reinforcement...

textvisionmultimodal
128,000 ctx$0.10/1M in
Explore specs and pricingView details β†’

Qwen: Qwen3 Coder 480B A35B (free)

qwen

Qwen3-Coder-480B-A35B-Instruct is a Mixture-of-Experts (MoE) code generation model developed by the Qwen team. It is optimized for agentic coding tasks such as function calling, tool use, and long-context reasoning over...

textcodereasoning
262,000 ctx$0.22/1M in
Explore specs and pricingView details β†’

Z.ai: GLM 4 32B

z-ai

GLM 4 32B is a cost-effective foundation language model. It can efficiently perform complex tasks and has significantly enhanced capabilities in tool use, online search, and code-related intelligent tasks. It...

textcodeagents
128,000 ctx$0.10/1M in
Explore specs and pricingView details β†’

Qwen: Qwen3 235B A22B Thinking 2507

qwen

Qwen3-235B-A22B-Thinking-2507 is a high-performance, open-weight Mixture-of-Experts (MoE) language model optimized for complex reasoning tasks. It activates 22B of its 235B parameters per forward pass and natively supports up to 262,144...

textreasoningcheap
262,144 ctx$0.15/1M in
Explore specs and pricingView details β†’

Z.ai: GLM 4.5 Air (free)

z-ai

GLM-4.5-Air is the lightweight variant of our latest flagship model family, also purpose-built for agent-centric applications. Like GLM-4.5, it adopts the Mixture-of-Experts (MoE) architecture but with a more compact parameter...

textagentsfree
131,072 ctx$0.13/1M in
Explore specs and pricingView details β†’

Z.ai: GLM 4.5

z-ai

GLM-4.5 is our latest flagship foundation model, purpose-built for agent-based applications. It leverages a Mixture-of-Experts (MoE) architecture and supports a context length of up to 128k tokens. GLM-4.5 delivers significantly...

textagentscheap
131,072 ctx$0.60/1M in
Explore specs and pricingView details β†’

Qwen: Qwen3 30B A3B Instruct 2507

qwen

Qwen3-30B-A3B-Instruct-2507 is a 30.5B-parameter mixture-of-experts language model from Qwen, with 3.3B active parameters per inference. It operates in non-thinking mode and is designed for high-quality instruction following, multilingual understanding, and...

textreasoningmultilingual
262,144 ctx$0.09/1M in
Explore specs and pricingView details β†’

Qwen: Qwen3 Coder 30B A3B Instruct

qwen

Qwen3-Coder-30B-A3B-Instruct is a 30.5B parameter Mixture-of-Experts (MoE) model with 128 experts (8 active per forward pass), designed for advanced code generation, repository-scale understanding, and agentic tool use. Built on the...

textcodeagents
160,000 ctx$0.07/1M in
Explore specs and pricingView details β†’

Mistral: Codestral 2508

mistralai

Mistral's cutting-edge language model for coding released end of July 2025. Codestral specializes in low-latency, high-frequency tasks such as fill-in-the-middle (FIM), code correction and test generation. [Blog Post](https://mistral.ai/news/codestral-25-08)

textcodecheap
256,000 ctx$0.30/1M in
Explore specs and pricingView details β†’

Anthropic: Claude Opus 4.1

anthropic

Claude Opus 4.1 is an updated version of Anthropic’s flagship model, offering improved performance in coding, reasoning, and agentic tasks. It achieves 74.5% on SWE-bench Verified and shows notable gains...

textvisionmultimodal
200,000 ctx$15.00/1M in
Explore specs and pricingView details β†’

OpenAI: gpt-oss-20b (free)

openai

gpt-oss-20b is an open-weight 21B parameter model released by OpenAI under the Apache 2.0 license. It uses a Mixture-of-Experts (MoE) architecture with 3.6B active parameters per forward pass, optimized for...

textfreelong-context
131,072 ctx$0.03/1M in
Explore specs and pricingView details β†’

OpenAI: gpt-oss-120b (free)

openai

gpt-oss-120b is an open-weight, 117B-parameter Mixture-of-Experts (MoE) language model from OpenAI designed for high-reasoning, agentic, and general-purpose production use cases. It activates 5.1B parameters per forward pass and is optimized...

textreasoningagents
131,072 ctx$0.04/1M in
Explore specs and pricingView details β†’

OpenAI: GPT-5 Nano

openai

GPT-5-Nano is the smallest and fastest variant in the GPT-5 system, optimized for developer tools, rapid interactions, and ultra-low latency environments. While limited in reasoning depth compared to its larger...

textvisionmultimodal
400,000 ctx$0.05/1M in
Explore specs and pricingView details β†’

OpenAI: GPT-5 Mini

openai

GPT-5 Mini is a compact version of GPT-5, designed to handle lighter-weight reasoning tasks. It provides the same instruction-following and safety-tuning benefits as GPT-5, but with reduced latency and cost....

textvisionmultimodal
400,000 ctx$0.25/1M in
Explore specs and pricingView details β†’

OpenAI: GPT-5

openai

GPT-5 is OpenAI’s most advanced model, offering major improvements in reasoning, code quality, and user experience. It is optimized for complex tasks that require step-by-step reasoning, instruction following, and accuracy...

textvisionmultimodal
400,000 ctx$1.25/1M in
Explore specs and pricingView details β†’

OpenAI: GPT-5 Chat

openai

GPT-5 Chat is designed for advanced, natural, multimodal, and context-aware conversations for enterprise applications.

textvisionmultimodal
128,000 ctx$1.25/1M in
Explore specs and pricingView details β†’

AI21: Jamba Large 1.7

ai21

Jamba Large 1.7 is the latest model in the Jamba open family, offering improvements in grounding, instruction-following, and overall efficiency. Built on a hybrid SSM-Transformer architecture with a 256K context...

textlong-context
256,000 ctx$2.00/1M in
Explore specs and pricingView details β†’

Baidu: ERNIE 4.5 21B A3B

baidu

A sophisticated text-based Mixture-of-Experts (MoE) model featuring 21B total parameters with 3B activated per token, delivering exceptional multimodal understanding and generation through heterogeneous MoE structures and modality-isolated routing. Supporting an...

textvisioncheap
120,000 ctx$0.07/1M in
Explore specs and pricingView details β†’

Mistral: Mistral Medium 3.1

mistralai

Mistral Medium 3.1 is an updated version of Mistral Medium 3, which is a high-performance enterprise-grade language model designed to deliver frontier-level capabilities at significantly reduced operational cost. It balances...

textvisionmultimodal
131,072 ctx$0.40/1M in
Explore specs and pricingView details β†’

OpenAI: GPT-4o Audio

openai

The gpt-4o-audio-preview model adds support for audio inputs as prompts. This enhancement allows the model to detect nuances within audio recordings and add depth to generated user experiences. Audio outputs...

textaudiolong-context
128,000 ctx$2.50/1M in
Explore specs and pricingView details β†’

Nous: Hermes 4 405B

nousresearch

Hermes 4 is a large-scale reasoning model built on Meta-Llama-3.1-405B and released by Nous Research. It introduces a hybrid reasoning mode, where the model can choose to deliberate internally with...

textreasoningcheap
131,072 ctx$1.00/1M in
Explore specs and pricingView details β†’

Nous: Hermes 4 70B

nousresearch

Hermes 4 70B is a hybrid reasoning model from Nous Research, built on Meta-Llama-3.1-70B. It introduces the same hybrid mode as the larger 405B release, allowing the model to either...

textreasoningcheap
131,072 ctx$0.13/1M in
Explore specs and pricingView details β†’

xAI: Grok Code Fast 1

x-ai

Grok Code Fast 1 is a speedy and economical reasoning model that excels at agentic coding. With reasoning traces visible in the response, developers can steer Grok Code for high-quality...

textcodereasoning
256,000 ctx$0.20/1M in
Explore specs and pricingView details β†’

Qwen: Qwen3 30B A3B Thinking 2507

qwen

Qwen3-30B-A3B-Thinking-2507 is a 30B parameter Mixture-of-Experts reasoning model optimized for complex tasks requiring extended multi-step thinking. The model is designed specifically for β€œthinking mode,” where internal reasoning traces are separated...

textreasoningcheap
131,072 ctx$0.08/1M in
Explore specs and pricingView details β†’

MoonshotAI: Kimi K2 0905

moonshotai

Kimi K2 0905 is the September update of [Kimi K2 0711](moonshotai/kimi-k2). It is a large-scale Mixture-of-Experts (MoE) language model developed by Moonshot AI, featuring 1 trillion total parameters with 32...

textcheaplong-context
262,144 ctx$0.40/1M in
Explore specs and pricingView details β†’

NVIDIA: Nemotron Nano 9B V2 (free)

nvidia

NVIDIA-Nemotron-Nano-9B-v2 is a large language model (LLM) trained from scratch by NVIDIA, and designed as a unified model for both reasoning and non-reasoning tasks. It responds to user queries and...

textreasoningfree
128,000 ctx$0.04/1M in
Explore specs and pricingView details β†’

Qwen: Qwen Plus 0728 (thinking)

qwen

Qwen Plus 0728, based on the Qwen3 foundation model, is a 1 million context hybrid reasoning model with a balanced performance, speed, and cost combination.

textreasoningcheap
1,000,000 ctx$0.26/1M in
Explore specs and pricingView details β†’

Meituan: LongCat Flash Chat

meituan

LongCat-Flash-Chat is a large-scale Mixture-of-Experts (MoE) model with 560B total parameters, of which 18.6B–31.3B (β‰ˆ27B on average) are dynamically activated per input. It introduces a shortcut-connected MoE design to reduce...

cheaplong-context
131,072 ctx$0.20/1M in
Explore specs and pricingView details β†’

Qwen: Qwen3 Next 80B A3B Instruct (free)

qwen

Qwen3-Next-80B-A3B-Instruct is an instruction-tuned chat model in the Qwen3-Next series optimized for fast, stable responses without β€œthinking” traces. It targets complex tasks across reasoning, code generation, knowledge QA, and multilingual...

textcodereasoning
262,144 ctx$0.09/1M in
Explore specs and pricingView details β†’