modelstop.top
Home/All Models

AI Model Catalogue

Browse 454 models across providers, modalities, and use cases.

πŸ“„ Long Context

454 models Β· Page 9 of 13

Google: Gemini 2.5 Flash

google

Gemini 2.5 Flash is Google's state-of-the-art workhorse model, specifically designed for advanced reasoning, coding, mathematics, and scientific tasks. It includes built-in "thinking" capabilities, enabling it to provide responses with greater...

textvisionmultimodal
Run locally
1,048,576 ctx$0.30/1M in
Explore specs and pricingView details β†’

MiniMax: MiniMax M1

minimax

MiniMax-M1 is a large-scale, open-weight reasoning model designed for extended context and high-efficiency inference. It leverages a hybrid Mixture-of-Experts (MoE) architecture paired with a custom "lightning attention" mechanism, allowing it...

textreasoningcheap
Run locally
1,000,000 ctx$0.40/1M in
Explore specs and pricingView details β†’

Mistral: Mistral Small 3.2 24B

mistralai

Mistral-Small-3.2-24B-Instruct-2506 is an updated 24B parameter model from Mistral optimized for instruction following, repetition reduction, and improved function calling. Compared to the 3.1 release, version 3.2 significantly improves accuracy on...

textvisionmultimodal
Run locally
128,000 ctx$0.07/1M in
Explore specs and pricingView details β†’

Inception: Mercury

inception

Mercury is the first diffusion large language model (dLLM). Applying a breakthrough discrete diffusion approach, the model runs 5-10x faster than even speed optimized models like GPT-4.1 Nano and Claude...

cheaplong-context
128,000 ctx$0.25/1M in
Explore specs and pricingView details β†’

Baidu: ERNIE 4.5 300B A47B

baidu

ERNIE-4.5-300B-A47B is a 300B parameter Mixture-of-Experts (MoE) language model developed by Baidu as part of the ERNIE 4.5 series. It activates 47B parameters per token and supports text generation in...

textcheaplong-context
131,072 ctx$0.28/1M in
Explore specs and pricingView details β†’

Baidu: ERNIE 4.5 VL 424B A47B

baidu

ERNIE-4.5-VL-424B-A47B is a multimodal Mixture-of-Experts (MoE) model from Baidu’s ERNIE 4.5 series, featuring 424B total parameters with 47B active per token. It is trained jointly on text and image data...

textvisionmultimodal
Run locally
131,072 ctx$0.42/1M in
Explore specs and pricingView details β†’

Morph: Morph V3 Large

morph

Morph's high-accuracy apply model for complex code edits. ~4,500 tokens/sec with 98% accuracy for precise code transformations. The model requires the prompt to be in the following format: <instruction>{instruction}</instruction> <code>{initial_code}</code>...

textcodecheap
Run locally
262,144 ctx$0.90/1M in
Explore specs and pricingView details β†’

TNG: DeepSeek R1T2 Chimera

tngtech

DeepSeek-TNG-R1T2-Chimera is the second-generation Chimera model from TNG Tech. It is a 671 B-parameter mixture-of-experts text-generation model assembled from DeepSeek-AI’s R1-0528, R1, and V3-0324 checkpoints with an Assembly-of-Experts merge. The...

textcheaplong-context
163,840 ctx$0.30/1M in
Explore specs and pricingView details β†’

Tencent: Hunyuan A13B Instruct

tencent

Hunyuan-A13B is a 13B active parameter Mixture-of-Experts (MoE) language model developed by Tencent, with a total parameter count of 80B and support for reasoning via Chain-of-Thought. It offers competitive benchmark...

textreasoninginstruct
Run locally
131,072 ctx$0.14/1M in
Explore specs and pricingView details β†’

xAI: Grok 4

x-ai

Grok 4 is xAI's latest reasoning model with a 256k context window. It supports parallel tool calling, structured outputs, and both image and text inputs. Note that reasoning is not...

textvisionmultimodal
256,000 ctx$3.00/1M in
Explore specs and pricingView details β†’

Mistral: Devstral Small 1.1

mistralai

Devstral Small 1.1 is a 24B parameter open-weight language model for software engineering agents, developed by Mistral AI in collaboration with All Hands AI. Finetuned from Mistral Small 3.1 and...

textagentscheap
131,072 ctx$0.10/1M in
Explore specs and pricingView details β†’

Mistral: Devstral Medium

mistralai

Devstral Medium is a high-performance code generation and agentic reasoning model developed jointly by Mistral AI and All Hands AI. Positioned as a step up from Devstral Small, it achieves...

textcodereasoning
131,072 ctx$0.40/1M in
Explore specs and pricingView details β†’

MoonshotAI: Kimi K2 0711

moonshotai

Kimi K2 Instruct is a large-scale Mixture-of-Experts (MoE) language model developed by Moonshot AI, featuring 1 trillion total parameters with 32 billion active per forward pass. It is optimized for...

textinstructcheap
Run locally
131,072 ctx$0.57/1M in
Explore specs and pricingView details β†’

Switchpoint Router

switchpoint

Switchpoint AI's router instantly analyzes your request and directs it to the optimal AI from an ever-evolving library. As the world of LLMs advances, our router gets smarter, ensuring you...

textcheaplong-context
Run locally
131,072 ctx$0.85/1M in
Explore specs and pricingView details β†’

Qwen: Qwen3 235B A22B Instruct 2507

qwen

Qwen3-235B-A22B-Instruct-2507 is a multilingual, instruction-tuned mixture-of-experts language model based on the Qwen3-235B architecture, with 22B active parameters per forward pass. It is optimized for general-purpose text generation, including instruction following,...

textmultilingualinstruct
262,144 ctx$0.07/1M in
Explore specs and pricingView details β†’

Google: Gemini 2.5 Flash Lite

google

Gemini 2.5 Flash-Lite is a lightweight reasoning model in the Gemini 2.5 family, optimized for ultra-low latency and cost efficiency. It offers improved throughput, faster token generation, and better performance...

textvisionimage
1,048,576 ctx$0.10/1M in
Explore specs and pricingView details β†’

ByteDance: UI-TARS 7B

bytedance

UI-TARS-1.5 is a multimodal vision-language agent optimized for GUI-based environments, including desktop interfaces, web browsers, mobile systems, and games. Built by ByteDance, it builds upon the UI-TARS framework with reinforcement...

textvisionmultimodal
128,000 ctx$0.10/1M in
Explore specs and pricingView details β†’

Qwen: Qwen3 Coder 480B A35B (free)

qwen

Qwen3-Coder-480B-A35B-Instruct is a Mixture-of-Experts (MoE) code generation model developed by the Qwen team. It is optimized for agentic coding tasks such as function calling, tool use, and long-context reasoning over...

textcodereasoning
Run locally
1,048,576 ctx$0.22/1M in
Explore specs and pricingView details β†’

Z.ai: GLM 4 32B

z-ai

GLM 4 32B is a cost-effective foundation language model. It can efficiently perform complex tasks and has significantly enhanced capabilities in tool use, online search, and code-related intelligent tasks. It...

textcodeagents
128,000 ctx$0.10/1M in
Explore specs and pricingView details β†’

Qwen: Qwen3 235B A22B Thinking 2507

qwen

Qwen3-235B-A22B-Thinking-2507 is a high-performance, open-weight Mixture-of-Experts (MoE) language model optimized for complex reasoning tasks. It activates 22B of its 235B parameters per forward pass and natively supports up to 262,144...

textreasoningcheap
262,144 ctx$0.15/1M in
Explore specs and pricingView details β†’

Z.ai: GLM 4.5 Air

z-ai

GLM-4.5-Air is the lightweight variant of our latest flagship model family, also purpose-built for agent-centric applications. Like GLM-4.5, it adopts the Mixture-of-Experts (MoE) architecture but with a more compact parameter...

textagentscheap
Run locally
131,072 ctx$0.13/1M in
Explore specs and pricingView details β†’

Z.ai: GLM 4.5

z-ai

GLM-4.5 is our latest flagship foundation model, purpose-built for agent-based applications. It leverages a Mixture-of-Experts (MoE) architecture and supports a context length of up to 128k tokens. GLM-4.5 delivers significantly...

textagentscheap
131,072 ctx$0.60/1M in
Explore specs and pricingView details β†’

Qwen: Qwen3 30B A3B Instruct 2507

qwen

Qwen3-30B-A3B-Instruct-2507 is a 30.5B-parameter mixture-of-experts language model from Qwen, with 3.3B active parameters per inference. It operates in non-thinking mode and is designed for high-quality instruction following, multilingual understanding, and...

textreasoningmultilingual
262,144 ctx$0.09/1M in
Explore specs and pricingView details β†’

Qwen: Qwen3 Coder 30B A3B Instruct

qwen

Qwen3-Coder-30B-A3B-Instruct is a 30.5B parameter Mixture-of-Experts (MoE) model with 128 experts (8 active per forward pass), designed for advanced code generation, repository-scale understanding, and agentic tool use. Built on the...

textcodeagents
160,000 ctx$0.07/1M in
Explore specs and pricingView details β†’

Mistral: Codestral 2508

mistralai

Mistral's cutting-edge language model for coding released end of July 2025. Codestral specializes in low-latency, high-frequency tasks such as fill-in-the-middle (FIM), code correction and test generation. [Blog Post](https://mistral.ai/news/codestral-25-08)

textcodecheap
256,000 ctx$0.30/1M in
Explore specs and pricingView details β†’

Anthropic: Claude Opus 4.1

anthropic

Claude Opus 4.1 is an updated version of Anthropic’s flagship model, offering improved performance in coding, reasoning, and agentic tasks. It achieves 74.5% on SWE-bench Verified and shows notable gains...

textvisionmultimodal
Run locally
200,000 ctx$15.00/1M in
Explore specs and pricingView details β†’

OpenAI: gpt-oss-20b (free)

openai

gpt-oss-20b is an open-weight 21B parameter model released by OpenAI under the Apache 2.0 license. It uses a Mixture-of-Experts (MoE) architecture with 3.6B active parameters per forward pass, optimized for...

textfreelong-context
131,072 ctx$0.03/1M in
Explore specs and pricingView details β†’

OpenAI: gpt-oss-120b (free)

openai

gpt-oss-120b is an open-weight, 117B-parameter Mixture-of-Experts (MoE) language model from OpenAI designed for high-reasoning, agentic, and general-purpose production use cases. It activates 5.1B parameters per forward pass and is optimized...

textreasoningagents
131,072 ctx$0.04/1M in
Explore specs and pricingView details β†’

OpenAI: GPT-5 Nano

openai

GPT-5-Nano is the smallest and fastest variant in the GPT-5 system, optimized for developer tools, rapid interactions, and ultra-low latency environments. While limited in reasoning depth compared to its larger...

textvisionmultimodal
Run locally
400,000 ctx$0.05/1M in
Explore specs and pricingView details β†’

OpenAI: GPT-5 Mini

openai

GPT-5 Mini is a compact version of GPT-5, designed to handle lighter-weight reasoning tasks. It provides the same instruction-following and safety-tuning benefits as GPT-5, but with reduced latency and cost....

textvisionmultimodal
400,000 ctx$0.25/1M in
Explore specs and pricingView details β†’

OpenAI: GPT-5

openai

GPT-5 is OpenAI’s most advanced model, offering major improvements in reasoning, code quality, and user experience. It is optimized for complex tasks that require step-by-step reasoning, instruction following, and accuracy...

textvisionmultimodal
400,000 ctx$1.25/1M in
Explore specs and pricingView details β†’

OpenAI: GPT-5 Chat

openai

GPT-5 Chat is designed for advanced, natural, multimodal, and context-aware conversations for enterprise applications.

textvisionmultimodal
128,000 ctx$1.25/1M in
Explore specs and pricingView details β†’

AI21: Jamba Large 1.7

ai21

Jamba Large 1.7 is the latest model in the Jamba open family, offering improvements in grounding, instruction-following, and overall efficiency. Built on a hybrid SSM-Transformer architecture with a 256K context...

textlong-context
Run locally
256,000 ctx$2.00/1M in
Explore specs and pricingView details β†’

Baidu: ERNIE 4.5 VL 28B A3B

baidu

A powerful multimodal Mixture-of-Experts chat model featuring 28B total parameters with 3B activated per token, delivering exceptional text and vision understanding through its innovative heterogeneous MoE structure with modality-isolated routing....

textvisionmultimodal
131,072 ctx$0.14/1M in
Explore specs and pricingView details β†’

Baidu: ERNIE 4.5 21B A3B

baidu

A sophisticated text-based Mixture-of-Experts (MoE) model featuring 21B total parameters with 3B activated per token, delivering exceptional multimodal understanding and generation through heterogeneous MoE structures and modality-isolated routing. Supporting an...

textvisioncheap
120,000 ctx$0.07/1M in
Explore specs and pricingView details β†’

Mistral: Mistral Medium 3.1

mistralai

Mistral Medium 3.1 is an updated version of Mistral Medium 3, which is a high-performance enterprise-grade language model designed to deliver frontier-level capabilities at significantly reduced operational cost. It balances...

textvisionmultimodal
Run locally
131,072 ctx$0.40/1M in
Explore specs and pricingView details β†’