modelstop.top
Home/All Models

AI Model Catalogue

Browse 91 models across providers, modalities, and use cases.

🌐 All Models

91 models Β· Page 2 of 3

ByteDance: UI-TARS 7B

bytedance

UI-TARS-1.5 is a multimodal vision-language agent optimized for GUI-based environments, including desktop interfaces, web browsers, mobile systems, and games. Built by ByteDance, it builds upon the UI-TARS framework with reinforcement...

textvisionmultimodal
128,000 ctx$0.10/1M in
Explore specs and pricingView details β†’

Qwen: Qwen3 Coder 480B A35B (free)

qwen

Qwen3-Coder-480B-A35B-Instruct is a Mixture-of-Experts (MoE) code generation model developed by the Qwen team. It is optimized for agentic coding tasks such as function calling, tool use, and long-context reasoning over...

textcodereasoning
Run locally
1,048,576 ctx$0.22/1M in
Explore specs and pricingView details β†’

Z.ai: GLM 4 32B

z-ai

GLM 4 32B is a cost-effective foundation language model. It can efficiently perform complex tasks and has significantly enhanced capabilities in tool use, online search, and code-related intelligent tasks. It...

textcodeagents
128,000 ctx$0.10/1M in
Explore specs and pricingView details β†’

Z.ai: GLM 4.5 Air

z-ai

GLM-4.5-Air is the lightweight variant of our latest flagship model family, also purpose-built for agent-centric applications. Like GLM-4.5, it adopts the Mixture-of-Experts (MoE) architecture but with a more compact parameter...

textagentscheap
Run locally
131,072 ctx$0.13/1M in
Explore specs and pricingView details β†’

Z.ai: GLM 4.5

z-ai

GLM-4.5 is our latest flagship foundation model, purpose-built for agent-based applications. It leverages a Mixture-of-Experts (MoE) architecture and supports a context length of up to 128k tokens. GLM-4.5 delivers significantly...

textagentscheap
131,072 ctx$0.60/1M in
Explore specs and pricingView details β†’

Qwen: Qwen3 Coder 30B A3B Instruct

qwen

Qwen3-Coder-30B-A3B-Instruct is a 30.5B parameter Mixture-of-Experts (MoE) model with 128 experts (8 active per forward pass), designed for advanced code generation, repository-scale understanding, and agentic tool use. Built on the...

textcodeagents
160,000 ctx$0.07/1M in
Explore specs and pricingView details β†’

Anthropic: Claude Opus 4.1

anthropic

Claude Opus 4.1 is an updated version of Anthropic’s flagship model, offering improved performance in coding, reasoning, and agentic tasks. It achieves 74.5% on SWE-bench Verified and shows notable gains...

textvisionmultimodal
Run locally
200,000 ctx$15.00/1M in
Explore specs and pricingView details β†’

OpenAI: gpt-oss-120b (free)

openai

gpt-oss-120b is an open-weight, 117B-parameter Mixture-of-Experts (MoE) language model from OpenAI designed for high-reasoning, agentic, and general-purpose production use cases. It activates 5.1B parameters per forward pass and is optimized...

textreasoningagents
131,072 ctx$0.04/1M in
Explore specs and pricingView details β†’

Z.ai: GLM 4.5V

z-ai

GLM-4.5V is a vision-language foundation model for multimodal agent applications. Built on a Mixture-of-Experts (MoE) architecture with 106B parameters and 12B activated parameters, it achieves state-of-the-art results in video understanding,...

textvisionmultimodal
65,536 ctx$0.60/1M in
Explore specs and pricingView details β†’

xAI: Grok Code Fast 1

x-ai

Grok Code Fast 1 is a speedy and economical reasoning model that excels at agentic coding. With reasoning traces visible in the response, developers can steer Grok Code for high-quality...

textcodereasoning
256,000 ctx$0.20/1M in
Explore specs and pricingView details β†’

Qwen: Qwen3 Next 80B A3B Thinking

qwen

Qwen3-Next-80B-A3B-Thinking is a reasoning-first chat model in the Qwen3-Next line that outputs structured β€œthinking” traces by default. It’s designed for hard multi-step problems; math proofs, code synthesis/debugging, logic, and agentic...

textcodereasoning
131,072 ctx$0.10/1M in
Explore specs and pricingView details β†’

Qwen: Qwen3 Coder Flash

qwen

Qwen3 Coder Flash is Alibaba's fast and cost efficient version of their proprietary Qwen3 Coder Plus. It is a powerful coding agent model specializing in autonomous programming via tool calling...

textcodeagents
1,000,000 ctx$0.20/1M in
Explore specs and pricingView details β†’

Tongyi DeepResearch 30B A3B

alibaba

Tongyi DeepResearch is an agentic large language model developed by Tongyi Lab, with 30 billion total parameters activating only 3 billion per token. It's optimized for long-horizon, deep information-seeking tasks...

textagentscheap
131,072 ctx$0.09/1M in
Explore specs and pricingView details β†’

DeepSeek: DeepSeek V3.1 Terminus

deepseek

DeepSeek-V3.1 Terminus is an update to [DeepSeek V3.1](/deepseek/deepseek-chat-v3.1) that maintains the model's original capabilities while addressing issues reported by users, including language consistency and agent capabilities, further optimizing the model's...

textagentscheap
163,840 ctx$0.21/1M in
Explore specs and pricingView details β†’

Qwen: Qwen3 Coder Plus

qwen

Qwen3 Coder Plus is Alibaba's proprietary version of the Open Source Qwen3 Coder 480B A35B. It is a powerful coding agent model specializing in autonomous programming via tool calling and...

textcodeagents
1,000,000 ctx$0.65/1M in
Explore specs and pricingView details β†’

Anthropic: Claude Sonnet 4.5

anthropic

Claude Sonnet 4.5 is Anthropic’s most advanced Sonnet model to date, optimized for real-world agents and coding workflows. It delivers state-of-the-art performance on coding benchmarks such as SWE-bench Verified, with...

textvisionmultimodal
Run locally
1,000,000 ctx$3.00/1M in
Explore specs and pricingView details β†’

NVIDIA: Llama 3.3 Nemotron Super 49B V1.5

nvidia

Llama-3.3-Nemotron-Super-49B-v1.5 is a 49B-parameter, English-centric reasoning/chat model derived from Meta’s Llama-3.3-70B-Instruct with a 128K context. It’s post-trained for agentic workflows (RAG, tool calling) via SFT across math, code, science, and...

textcodereasoning
131,072 ctx$0.10/1M in
Explore specs and pricingView details β†’

MiniMax: MiniMax M2

minimax

MiniMax-M2 is a compact, high-efficiency large language model optimized for end-to-end coding and agentic workflows. With 10 billion activated parameters (230 billion total), it delivers near-frontier intelligence across general reasoning,...

textcodereasoning
Run locally
204,800 ctx$0.26/1M in
Explore specs and pricingView details β†’

Perplexity: Sonar Pro Search

perplexity

Exclusively available on the OpenRouter API, Sonar Pro's new Pro Search mode is Perplexity's most advanced agentic search system. It is designed for deeper reasoning and analysis. Pricing is based...

textvisionmultimodal
200,000 ctx$3.00/1M in
Explore specs and pricingView details β†’

MoonshotAI: Kimi K2 Thinking

moonshotai

Kimi K2 Thinking is Moonshot AI’s most advanced open reasoning model to date, extending the K2 series into agentic, long-horizon reasoning. Built on the trillion-parameter Mixture-of-Experts (MoE) architecture introduced in...

textreasoningagents
262,144 ctx$0.60/1M in
Explore specs and pricingView details β†’

xAI: Grok 4.1 Fast

x-ai

Grok 4.1 Fast is xAI's best agentic tool calling model that shines in real-world use cases like customer support and deep research. 2M context window. Reasoning can be enabled/disabled using...

textvisionmultimodal
2,000,000 ctx$0.20/1M in
Explore specs and pricingView details β†’

Anthropic: Claude Opus 4.5

anthropic

Claude Opus 4.5 is Anthropic’s frontier reasoning model optimized for complex software engineering, agentic workflows, and long-horizon computer use. It offers strong multimodal capabilities, competitive performance across real-world coding and...

textvisionmultimodal
200,000 ctx$5.00/1M in
Explore specs and pricingView details β†’

DeepSeek: DeepSeek V3.2

deepseek

DeepSeek-V3.2 is a large language model designed to harmonize high computational efficiency with strong reasoning and agentic tool-use performance. It introduces DeepSeek Sparse Attention (DSA), a fine-grained sparse attention mechanism...

textreasoningagents
163,840 ctx$0.26/1M in
Explore specs and pricingView details β†’

DeepSeek: DeepSeek V3.2 Speciale

deepseek

DeepSeek-V3.2-Speciale is a high-compute variant of DeepSeek-V3.2 optimized for maximum reasoning and agentic performance. It builds on DeepSeek Sparse Attention (DSA) for efficient long-context processing, then scales post-training reinforcement learning...

textreasoningagents
163,840 ctx$0.40/1M in
Explore specs and pricingView details β†’

OpenAI: GPT-5.1-Codex-Max

openai

GPT-5.1-Codex-Max is OpenAI’s latest agentic coding model, designed for long-running, high-context software development tasks. It is based on an updated version of the 5.1 reasoning stack and trained on agentic...

textvisionmultimodal
400,000 ctx$1.25/1M in
Explore specs and pricingView details β†’

Nex AGI: DeepSeek V3.1 Nex N1

nex-agi

DeepSeek V3.1 Nex-N1 is the flagship release of the Nex-N1 series β€” a post-trained model designed to highlight agent autonomy, tool use, and real-world productivity. Nex-N1 demonstrates competitive performance across...

textagentscheap
131,072 ctx$0.14/1M in
Explore specs and pricingView details β†’

Relace: Relace Search

relace

The relace-search model uses 4-12 `view_file` and `grep` tools in parallel to explore a codebase and return relevant files to the user request. In contrast to RAG, relace-search performs agentic...

textagentscheap
256,000 ctx$1.00/1M in
Explore specs and pricingView details β†’

Mistral: Devstral 2 2512

mistralai

Devstral 2 is a state-of-the-art open-source model by Mistral AI specializing in agentic coding. It is a 123B-parameter dense transformer model supporting a 256K context window. Devstral 2 supports exploring...

textcodeagents
262,144 ctx$0.40/1M in
Explore specs and pricingView details β†’

OpenAI: GPT-5.2

openai

GPT-5.2 is the latest frontier-grade model in the GPT-5 series, offering stronger agentic and long context perfomance compared to GPT-5.1. It uses adaptive reasoning to allocate computation dynamically, responding quickly...

textvisionmultimodal
400,000 ctx$1.75/1M in
Explore specs and pricingView details β†’

OpenAI: GPT-5.2 Pro

openai

GPT-5.2 Pro is OpenAI’s most advanced model, offering major improvements in agentic coding and long context performance over GPT-5 Pro. It is optimized for complex tasks that require step-by-step reasoning,...

textvisionmultimodal
400,000 ctx$21.00/1M in
Explore specs and pricingView details β†’

NVIDIA: Nemotron 3 Nano 30B A3B (free)

nvidia

NVIDIA Nemotron 3 Nano 30B A3B is a small language MoE model with highest compute efficiency and accuracy for developers to build specialized agentic AI systems. The model is fully...

textagentsfree
256,000 ctx$0.05/1M in
Explore specs and pricingView details β†’

Mistral: Mistral Small Creative

mistralai

Mistral Small Creative is an experimental small model designed for creative writing, narrative generation, roleplay and character-driven dialogue, general-purpose instruction following, and conversational agents.

textagentscheap
32,768 ctx$0.10/1M in
Explore specs and pricingView details β†’

Google: Gemini 3 Flash Preview

google

Gemini 3 Flash Preview is a high speed, high value thinking model designed for agentic workflows, multi turn chat, and coding assistance. It delivers near Pro level reasoning and tool...

textvisionmultimodal
1,048,576 ctx$0.50/1M in
Explore specs and pricingView details β†’

Z.ai: GLM 4.7

z-ai

GLM-4.7 is Z.ai’s latest flagship model, featuring upgrades in two key areas: enhanced programming capabilities and more stable multi-step reasoning/execution. It demonstrates significant improvements in executing complex agent tasks while...

textreasoningagents
202,752 ctx$0.39/1M in
Explore specs and pricingView details β†’

MiniMax: MiniMax M2.1

minimax

MiniMax-M2.1 is a lightweight, state-of-the-art large language model optimized for coding, agentic workflows, and modern application development. With only 10 billion activated parameters, it delivers a major jump in real-world...

textcodeagents
196,608 ctx$0.29/1M in
Explore specs and pricingView details β†’

Z.ai: GLM 4.7 Flash

z-ai

As a 30B-class SOTA model, GLM-4.7-Flash offers a new option that balances performance and efficiency. It is further optimized for agentic coding use cases, strengthening coding capabilities, long-horizon task planning,...

textcodeagents
202,752 ctx$0.06/1M in
Explore specs and pricingView details β†’