modelstop.top
Home/All Models

AI Model Catalogue

Browse 287 models across providers, modalities, and use cases.

🌐 All Models

287 models Β· Page 6 of 8

Qwen: Qwen3 235B A22B Thinking 2507

qwen

Qwen3-235B-A22B-Thinking-2507 is a high-performance, open-weight Mixture-of-Experts (MoE) language model optimized for complex reasoning tasks. It activates 22B of its 235B parameters per forward pass and natively supports up to 262,144...

textreasoningcheap
262,144 ctx$0.15/1M in
Explore specs and pricingView details β†’

Z.ai: GLM 4.5

z-ai

GLM-4.5 is our latest flagship foundation model, purpose-built for agent-based applications. It leverages a Mixture-of-Experts (MoE) architecture and supports a context length of up to 128k tokens. GLM-4.5 delivers significantly...

textagentscheap
131,072 ctx$0.60/1M in
Explore specs and pricingView details β†’

Qwen: Qwen3 30B A3B Instruct 2507

qwen

Qwen3-30B-A3B-Instruct-2507 is a 30.5B-parameter mixture-of-experts language model from Qwen, with 3.3B active parameters per inference. It operates in non-thinking mode and is designed for high-quality instruction following, multilingual understanding, and...

textreasoningmultilingual
262,144 ctx$0.09/1M in
Explore specs and pricingView details β†’

Qwen: Qwen3 Coder 30B A3B Instruct

qwen

Qwen3-Coder-30B-A3B-Instruct is a 30.5B parameter Mixture-of-Experts (MoE) model with 128 experts (8 active per forward pass), designed for advanced code generation, repository-scale understanding, and agentic tool use. Built on the...

textcodeagents
160,000 ctx$0.07/1M in
Explore specs and pricingView details β†’

Mistral: Codestral 2508

mistralai

Mistral's cutting-edge language model for coding released end of July 2025. Codestral specializes in low-latency, high-frequency tasks such as fill-in-the-middle (FIM), code correction and test generation. [Blog Post](https://mistral.ai/news/codestral-25-08)

textcodecheap
256,000 ctx$0.30/1M in
Explore specs and pricingView details β†’

OpenAI: GPT-5 Nano

openai

GPT-5-Nano is the smallest and fastest variant in the GPT-5 system, optimized for developer tools, rapid interactions, and ultra-low latency environments. While limited in reasoning depth compared to its larger...

textvisionmultimodal
400,000 ctx$0.05/1M in
Explore specs and pricingView details β†’

OpenAI: GPT-5 Mini

openai

GPT-5 Mini is a compact version of GPT-5, designed to handle lighter-weight reasoning tasks. It provides the same instruction-following and safety-tuning benefits as GPT-5, but with reduced latency and cost....

textvisionmultimodal
400,000 ctx$0.25/1M in
Explore specs and pricingView details β†’

Z.ai: GLM 4.5V

z-ai

GLM-4.5V is a vision-language foundation model for multimodal agent applications. Built on a Mixture-of-Experts (MoE) architecture with 106B parameters and 12B activated parameters, it achieves state-of-the-art results in video understanding,...

textvisionmultimodal
65,536 ctx$0.60/1M in
Explore specs and pricingView details β†’

Baidu: ERNIE 4.5 VL 28B A3B

baidu

A powerful multimodal Mixture-of-Experts chat model featuring 28B total parameters with 3B activated per token, delivering exceptional text and vision understanding through its innovative heterogeneous MoE structure with modality-isolated routing....

textvisionmultimodal
30,000 ctx$0.14/1M in
Explore specs and pricingView details β†’

Baidu: ERNIE 4.5 21B A3B

baidu

A sophisticated text-based Mixture-of-Experts (MoE) model featuring 21B total parameters with 3B activated per token, delivering exceptional multimodal understanding and generation through heterogeneous MoE structures and modality-isolated routing. Supporting an...

textvisioncheap
120,000 ctx$0.07/1M in
Explore specs and pricingView details β†’

Mistral: Mistral Medium 3.1

mistralai

Mistral Medium 3.1 is an updated version of Mistral Medium 3, which is a high-performance enterprise-grade language model designed to deliver frontier-level capabilities at significantly reduced operational cost. It balances...

textvisionmultimodal
131,072 ctx$0.40/1M in
Explore specs and pricingView details β†’

DeepSeek: DeepSeek V3.1

deepseek

DeepSeek-V3.1 is a large hybrid reasoning model (671B parameters, 37B active) that supports both thinking and non-thinking modes via prompt templates. It extends the DeepSeek-V3 base with a two-phase long-context...

textreasoningcheap
32,768 ctx$0.15/1M in
Explore specs and pricingView details β†’

Nous: Hermes 4 405B

nousresearch

Hermes 4 is a large-scale reasoning model built on Meta-Llama-3.1-405B and released by Nous Research. It introduces a hybrid reasoning mode, where the model can choose to deliberate internally with...

textreasoningcheap
131,072 ctx$1.00/1M in
Explore specs and pricingView details β†’

Nous: Hermes 4 70B

nousresearch

Hermes 4 70B is a hybrid reasoning model from Nous Research, built on Meta-Llama-3.1-70B. It introduces the same hybrid mode as the larger 405B release, allowing the model to either...

textreasoningcheap
131,072 ctx$0.13/1M in
Explore specs and pricingView details β†’

xAI: Grok Code Fast 1

x-ai

Grok Code Fast 1 is a speedy and economical reasoning model that excels at agentic coding. With reasoning traces visible in the response, developers can steer Grok Code for high-quality...

textcodereasoning
256,000 ctx$0.20/1M in
Explore specs and pricingView details β†’

Qwen: Qwen3 30B A3B Thinking 2507

qwen

Qwen3-30B-A3B-Thinking-2507 is a 30B parameter Mixture-of-Experts reasoning model optimized for complex tasks requiring extended multi-step thinking. The model is designed specifically for β€œthinking mode,” where internal reasoning traces are separated...

textreasoningcheap
131,072 ctx$0.08/1M in
Explore specs and pricingView details β†’

MoonshotAI: Kimi K2 0905

moonshotai

Kimi K2 0905 is the September update of [Kimi K2 0711](moonshotai/kimi-k2). It is a large-scale Mixture-of-Experts (MoE) language model developed by Moonshot AI, featuring 1 trillion total parameters with 32...

textcheaplong-context
262,144 ctx$0.40/1M in
Explore specs and pricingView details β†’

Qwen: Qwen Plus 0728 (thinking)

qwen

Qwen Plus 0728, based on the Qwen3 foundation model, is a 1 million context hybrid reasoning model with a balanced performance, speed, and cost combination.

textreasoningcheap
1,000,000 ctx$0.26/1M in
Explore specs and pricingView details β†’

Meituan: LongCat Flash Chat

meituan

LongCat-Flash-Chat is a large-scale Mixture-of-Experts (MoE) model with 560B total parameters, of which 18.6B–31.3B (β‰ˆ27B on average) are dynamically activated per input. It introduces a shortcut-connected MoE design to reduce...

cheaplong-context
131,072 ctx$0.20/1M in
Explore specs and pricingView details β†’

Qwen: Qwen3 Next 80B A3B Thinking

qwen

Qwen3-Next-80B-A3B-Thinking is a reasoning-first chat model in the Qwen3-Next line that outputs structured β€œthinking” traces by default. It’s designed for hard multi-step problems; math proofs, code synthesis/debugging, logic, and agentic...

textcodereasoning
131,072 ctx$0.10/1M in
Explore specs and pricingView details β†’

Qwen: Qwen3 Coder Flash

qwen

Qwen3 Coder Flash is Alibaba's fast and cost efficient version of their proprietary Qwen3 Coder Plus. It is a powerful coding agent model specializing in autonomous programming via tool calling...

textcodeagents
1,000,000 ctx$0.20/1M in
Explore specs and pricingView details β†’

Tongyi DeepResearch 30B A3B

alibaba

Tongyi DeepResearch is an agentic large language model developed by Tongyi Lab, with 30 billion total parameters activating only 3 billion per token. It's optimized for long-horizon, deep information-seeking tasks...

textagentscheap
131,072 ctx$0.09/1M in
Explore specs and pricingView details β†’

xAI: Grok 4 Fast

x-ai

Grok 4 Fast is xAI's latest multimodal model with SOTA cost-efficiency and a 2M token context window. It comes in two flavors: non-reasoning and reasoning. Read more about the model...

textvisionmultimodal
2,000,000 ctx$0.20/1M in
Explore specs and pricingView details β†’

DeepSeek: DeepSeek V3.1 Terminus

deepseek

DeepSeek-V3.1 Terminus is an update to [DeepSeek V3.1](/deepseek/deepseek-chat-v3.1) that maintains the model's original capabilities while addressing issues reported by users, including language consistency and agent capabilities, further optimizing the model's...

textagentscheap
163,840 ctx$0.21/1M in
Explore specs and pricingView details β†’

Qwen: Qwen3 Coder Plus

qwen

Qwen3 Coder Plus is Alibaba's proprietary version of the Open Source Qwen3 Coder 480B A35B. It is a powerful coding agent model specializing in autonomous programming via tool calling and...

textcodeagents
1,000,000 ctx$0.65/1M in
Explore specs and pricingView details β†’

Qwen: Qwen3 Max

qwen

Qwen3-Max is an updated release built on the Qwen3 series, offering major improvements in reasoning, instruction following, multilingual support, and long-tail knowledge coverage compared to the January 2025 version. It...

textreasoningmultilingual
262,144 ctx$0.78/1M in
Explore specs and pricingView details β†’

Qwen: Qwen3 VL 235B A22B Instruct

qwen

Qwen3-VL-235B-A22B Instruct is an open-weight multimodal model that unifies strong text generation with visual understanding across images and video. The Instruct model targets general vision-language use (VQA, document parsing, chart/table...

textvisionimage
262,144 ctx$0.20/1M in
Explore specs and pricingView details β†’

Qwen: Qwen3 VL 235B A22B Thinking

qwen

Qwen3-VL-235B-A22B Thinking is a multimodal model that unifies strong text generation with visual understanding across images and video. The Thinking model is optimized for multimodal reasoning in STEM and math....

textvisionimage
131,072 ctx$0.26/1M in
Explore specs and pricingView details β†’

Google: Gemini 2.5 Flash Lite Preview 09-2025

google

Gemini 2.5 Flash-Lite is a lightweight reasoning model in the Gemini 2.5 family, optimized for ultra-low latency and cost efficiency. It offers improved throughput, faster token generation, and better performance...

textvisionimage
1,048,576 ctx$0.10/1M in
Explore specs and pricingView details β†’

Relace: Relace Apply 3

relace

Relace Apply 3 is a specialized code-patching LLM that merges AI-suggested edits straight into your source files. It can apply updates from GPT-4o, Claude, and others into your files at...

textcodecheap
256,000 ctx$0.85/1M in
Explore specs and pricingView details β†’

TheDrummer: Cydonia 24B V4.1

thedrummer

Uncensored and creative writing model based on Mistral Small 3.2 24B with good recall, prompt adherence, and intelligence.

textcheaplong-context
131,072 ctx$0.30/1M in
Explore specs and pricingView details β†’

DeepSeek: DeepSeek V3.2 Exp

deepseek

DeepSeek-V3.2-Exp is an experimental large language model released by DeepSeek as an intermediate step between V3.1 and future architectures. It introduces DeepSeek Sparse Attention (DSA), a fine-grained sparse attention mechanism...

textcheaplong-context
163,840 ctx$0.27/1M in
Explore specs and pricingView details β†’

Z.ai: GLM 4.6

z-ai

Compared with GLM-4.5, this generation brings several key improvements: Longer context window: The context window has been expanded from 128K to 200K tokens, enabling the model to handle more complex...

textcheaplong-context
204,800 ctx$0.39/1M in
Explore specs and pricingView details β†’

Qwen: Qwen3 VL 30B A3B Instruct

qwen

Qwen3-VL-30B-A3B-Instruct is a multimodal model that unifies strong text generation with visual understanding for images and videos. Its Instruct variant optimizes instruction-following for general multimodal tasks. It excels in perception...

textvisionimage
131,072 ctx$0.13/1M in
Explore specs and pricingView details β†’

Qwen: Qwen3 VL 30B A3B Thinking

qwen

Qwen3-VL-30B-A3B-Thinking is a multimodal model that unifies strong text generation with visual understanding for images and videos. Its Thinking variant enhances reasoning in STEM, math, and complex tasks. It excels...

textvisionimage
131,072 ctx$0.13/1M in
Explore specs and pricingView details β†’

Google: Nano Banana (Gemini 2.5 Flash Image)

google

Gemini 2.5 Flash Image, a.k.a. "Nano Banana," is now generally available. It is a state of the art image generation model with contextual understanding. It is capable of image generation,...

textvisionimage
32,768 ctx$0.30/1M in
Explore specs and pricingView details β†’