Playground Find a Model ⚡ Pro Tools Pulse API Advertise PricingLoading...

Loading...

The most comprehensive directory of AI models, providers, and agents. Updated daily.

Explore

All Models
Collections
Leaderboard
Compare
Pro Tools
Pulse Feed
API Docs

Categories

Language Models
Inference Providers
Agents & SaaS
Open Source

Stay Updated

Weekly digest of new models and price changes.

Business contact

Support: support@modelstop.top

Enquiries: hello@modelstop.top

Billing: billing@modelstop.top

Privacy: privacy@modelstop.top

Legal: legal@modelstop.top

© 2026 modelstop.top. All rights reserved.Updated daily · 4695+ models indexed

Home/All Models

AI Model Catalogue

Browse 353 models across providers, modalities, and use cases.

🌐All Models 💬Text Generation 💻Code & Reasoning 👁️Vision & Multimodal 🎨Image Generation 🎙️Audio & Speech 🤖Agents & Tools 📄Long Context 🆓Free & Open

🧠

Reasoning

🌍Multilingual

Providers:⚡OpenAI 🔷Anthropic 🔍Google 🦙Meta 🌀Mistral ✕xAI 🚀Groq 🐋DeepSeek 🌐Cohere ☁️Amazon

Filter & Sort

🌐 All Models

353 models · Page 8 of 10

Baidu: ERNIE 4.5 VL 28B A3B

A powerful multimodal Mixture-of-Experts chat model featuring 28B total parameters with 3B activated per token, delivering exceptional text and vision understanding through its innovative heterogeneous MoE structure with modality-isolated routing....

textvisionmultimodal

Input$0.1400/1M

Output$0.5600/1M

📏131kcontext

Explore specs and pricingView details →

Baidu: ERNIE 4.5 21B A3B

A sophisticated text-based Mixture-of-Experts (MoE) model featuring 21B total parameters with 3B activated per token, delivering exceptional multimodal understanding and generation through heterogeneous MoE structures and modality-isolated routing. Supporting an...

textvisioncheap

Input$0.0700/1M

Output$0.2800/1M

📏120kcontext

Explore specs and pricingView details →

Mistral: Mistral Medium 3.1

Mistral Medium 3.1 is an updated version of Mistral Medium 3, which is a high-performance enterprise-grade language model designed to deliver frontier-level capabilities at significantly reduced operational cost. It balances...

textvisionmultimodal

Input$0.4000/1M

Output$2.0000/1M

📏131kcontext

Explore specs and pricingView details →

DeepSeek: DeepSeek V3.1

DeepSeek-V3.1 is a large hybrid reasoning model (671B parameters, 37B active) that supports both thinking and non-thinking modes via prompt templates. It extends the DeepSeek-V3 base with a two-phase long-context...

textreasoningcheap

Input$0.1500/1M

Output$0.7500/1M

📏164kcontext

Explore specs and pricingView details →

Nous: Hermes 4 405B

Hermes 4 is a large-scale reasoning model built on Meta-Llama-3.1-405B and released by Nous Research. It introduces a hybrid reasoning mode, where the model can choose to deliberate internally with...

textreasoningcheap

Input$1.0000/1M

Output$3.0000/1M

📏131kcontext

Explore specs and pricingView details →

Nous: Hermes 4 70B

Hermes 4 70B is a hybrid reasoning model from Nous Research, built on Meta-Llama-3.1-70B. It introduces the same hybrid mode as the larger 405B release, allowing the model to either...

textreasoningcheap

Input$0.1300/1M

Output$0.4000/1M

📏131kcontext

Explore specs and pricingView details →

xAI: Grok Code Fast 1

Grok Code Fast 1 is a speedy and economical reasoning model that excels at agentic coding. With reasoning traces visible in the response, developers can steer Grok Code for high-quality...

textcodereasoning

Input$0.2000/1M

Output$1.5000/1M

📏256kcontext

Explore specs and pricingView details →

Qwen: Qwen3 30B A3B Thinking 2507

Qwen3-30B-A3B-Thinking-2507 is a 30B parameter Mixture-of-Experts reasoning model optimized for complex tasks requiring extended multi-step thinking. The model is designed specifically for “thinking mode,” where internal reasoning traces are separated...

textreasoningcheap

Input$0.0800/1M

Output$0.4000/1M

📏131kcontext

Explore specs and pricingView details →

MoonshotAI: Kimi K2 0905

Kimi K2 0905 is the September update of [Kimi K2 0711](moonshotai/kimi-k2). It is a large-scale Mixture-of-Experts (MoE) language model developed by Moonshot AI, featuring 1 trillion total parameters with 32...

textcheaplong-context

Input$0.4000/1M

Output$2.0000/1M

📏262kcontext

Explore specs and pricingView details →

Qwen: Qwen Plus 0728 (thinking)

Qwen Plus 0728, based on the Qwen3 foundation model, is a 1 million context hybrid reasoning model with a balanced performance, speed, and cost combination.

textreasoningcheap

📏1000kcontext

Explore specs and pricingView details →

Meituan: LongCat Flash Chat

LongCat-Flash-Chat is a large-scale Mixture-of-Experts (MoE) model with 560B total parameters, of which 18.6B–31.3B (≈27B on average) are dynamically activated per input. It introduces a shortcut-connected MoE design to reduce...

cheaplong-context

Input$0.2000/1M

Output$0.8000/1M

📏131kcontext

Explore specs and pricingView details →

Qwen: Qwen3 Next 80B A3B Thinking

Qwen3-Next-80B-A3B-Thinking is a reasoning-first chat model in the Qwen3-Next line that outputs structured “thinking” traces by default. It’s designed for hard multi-step problems; math proofs, code synthesis/debugging, logic, and agentic...

textcodereasoning

📏262kcontext

Explore specs and pricingView details →

Qwen: Qwen3 Coder Flash

Qwen3 Coder Flash is Alibaba's fast and cost efficient version of their proprietary Qwen3 Coder Plus. It is a powerful coding agent model specializing in autonomous programming via tool calling...

📏1000kcontext

Explore specs and pricingView details →

Tongyi DeepResearch 30B A3B

Tongyi DeepResearch is an agentic large language model developed by Tongyi Lab, with 30 billion total parameters activating only 3 billion per token. It's optimized for long-horizon, deep information-seeking tasks...

textagentscheap

Input$0.0900/1M

Output$0.4500/1M

📏131kcontext

Explore specs and pricingView details →

xAI: Grok 4 Fast

Grok 4 Fast is xAI's latest multimodal model with SOTA cost-efficiency and a 2M token context window. It comes in two flavors: non-reasoning and reasoning. Read more about the model...

textvisionmultimodal

Input$0.2000/1M

Output$0.5000/1M

📏2000kcontext

Explore specs and pricingView details →

DeepSeek: DeepSeek V3.1 Terminus

DeepSeek-V3.1 Terminus is an update to [DeepSeek V3.1](/deepseek/deepseek-chat-v3.1) that maintains the model's original capabilities while addressing issues reported by users, including language consistency and agent capabilities, further optimizing the model's...

textagentscheap

Input$0.2100/1M

Output$0.7900/1M

📏164kcontext

Explore specs and pricingView details →

Qwen: Qwen3 Coder Plus

Qwen3 Coder Plus is Alibaba's proprietary version of the Open Source Qwen3 Coder 480B A35B. It is a powerful coding agent model specializing in autonomous programming via tool calling and...

📏1000kcontext

Explore specs and pricingView details →

Qwen: Qwen3 Max

Qwen3-Max is an updated release built on the Qwen3 series, offering major improvements in reasoning, instruction following, multilingual support, and long-tail knowledge coverage compared to the January 2025 version. It...

textreasoningmultilingual

📏262kcontext

Explore specs and pricingView details →

Qwen: Qwen3 VL 235B A22B Instruct

Qwen3-VL-235B-A22B Instruct is an open-weight multimodal model that unifies strong text generation with visual understanding across images and video. The Instruct model targets general vision-language use (VQA, document parsing, chart/table...

textvisionimage

Input$0.2000/1M

Output$0.8800/1M

📏262kcontext

Explore specs and pricingView details →

Qwen: Qwen3 VL 235B A22B Thinking

Qwen3-VL-235B-A22B Thinking is a multimodal model that unifies strong text generation with visual understanding across images and video. The Thinking model is optimized for multimodal reasoning in STEM and math....

textvisionimage

📏131kcontext

Explore specs and pricingView details →

Google: Gemini 2.5 Flash Lite Preview 09-2025

Gemini 2.5 Flash-Lite is a lightweight reasoning model in the Gemini 2.5 family, optimized for ultra-low latency and cost efficiency. It offers improved throughput, faster token generation, and better performance...

textvisionimage

Input$0.1000/1M

Output$0.4000/1M

📏1049kcontext

Explore specs and pricingView details →

Relace: Relace Apply 3

Relace Apply 3 is a specialized code-patching LLM that merges AI-suggested edits straight into your source files. It can apply updates from GPT-4o, Claude, and others into your files at...

📏256kcontext

Explore specs and pricingView details →

TheDrummer: Cydonia 24B V4.1

Uncensored and creative writing model based on Mistral Small 3.2 24B with good recall, prompt adherence, and intelligence.

textcheaplong-context

📏131kcontext

Explore specs and pricingView details →

DeepSeek: DeepSeek V3.2 Exp

DeepSeek-V3.2-Exp is an experimental large language model released by DeepSeek as an intermediate step between V3.1 and future architectures. It introduces DeepSeek Sparse Attention (DSA), a fine-grained sparse attention mechanism...

textcheaplong-context

📏164kcontext

Explore specs and pricingView details →

Z.ai: GLM 4.6

Compared with GLM-4.5, this generation brings several key improvements: Longer context window: The context window has been expanded from 128K to 200K tokens, enabling the model to handle more complex...

textcheaplong-context

Input$0.3900/1M

Output$1.9000/1M

📏203kcontext

Explore specs and pricingView details →

Qwen: Qwen3 VL 30B A3B Instruct

Qwen3-VL-30B-A3B-Instruct is a multimodal model that unifies strong text generation with visual understanding for images and videos. Its Instruct variant optimizes instruction-following for general multimodal tasks. It excels in perception...

textvisionimage

📏262kcontext

Explore specs and pricingView details →

Qwen: Qwen3 VL 30B A3B Thinking

Qwen3-VL-30B-A3B-Thinking is a multimodal model that unifies strong text generation with visual understanding for images and videos. Its Thinking variant enhances reasoning in STEM, math, and complex tasks. It excels...

textvisionimage

📏131kcontext

Explore specs and pricingView details →

Google: Nano Banana (Gemini 2.5 Flash Image)

Gemini 2.5 Flash Image, a.k.a. "Nano Banana," is now generally available. It is a state of the art image generation model with contextual understanding. It is capable of image generation,...

textvisionimage

Explore specs and pricingView details →

Baidu: ERNIE 4.5 21B A3B Thinking

ERNIE-4.5-21B-A3B-Thinking is Baidu's upgraded lightweight MoE model, refined to boost reasoning depth and quality for top-tier performance in logical puzzles, math, science, coding, text generation, and expert-level academic benchmarks.

textcodereasoning

Input$0.0700/1M

Output$0.2800/1M

📏131kcontext

Explore specs and pricingView details →

NVIDIA: Llama 3.3 Nemotron Super 49B V1.5

Llama-3.3-Nemotron-Super-49B-v1.5 is a 49B-parameter, English-centric reasoning/chat model derived from Meta’s Llama-3.3-70B-Instruct with a 128K context. It’s post-trained for agentic workflows (RAG, tool calling) via SFT across math, code, science, and...

textcodereasoning

Input$0.1000/1M

Output$0.4000/1M

📏131kcontext

Explore specs and pricingView details →

Qwen: Qwen3 VL 8B Instruct

Qwen3-VL-8B-Instruct is a multimodal vision-language model from the Qwen3-VL series, built for high-fidelity understanding and reasoning across text, images, and video. It features improved multimodal fusion with Interleaved-MRoPE for long-horizon...

textvisionmultimodal

Input$0.0800/1M

Output$0.5000/1M

📏256kcontext

Explore specs and pricingView details →

Qwen: Qwen3 VL 8B Thinking

Qwen3-VL-8B-Thinking is the reasoning-optimized variant of the Qwen3-VL-8B multimodal model, designed for advanced visual and textual reasoning across complex scenes, documents, and temporal sequences. It integrates enhanced multimodal alignment and...

textvisionmultimodal

📏256kcontext

Explore specs and pricingView details →

Anthropic: Claude Haiku 4.5

Claude Haiku 4.5 is Anthropic’s fastest and most efficient model, delivering near-frontier intelligence at a fraction of the cost and latency of larger Claude models. Matching Claude Sonnet 4’s performance...

textvisionmultimodal

📏200kcontext

Explore specs and pricingView details →

IBM: Granite 4.0 Micro

Granite-4.0-H-Micro is a 3B parameter from the Granite 4 family of models. These models are the latest in a series of models released by IBM. They are fine-tuned for long...

textcheaplong-context

Input$0.0170/1M

Output$0.1100/1M

📏131kcontext

Explore specs and pricingView details →

Qwen: Qwen3 VL 32B Instruct

Qwen3-VL-32B-Instruct is a large-scale multimodal vision-language model designed for high-precision understanding and reasoning across text, images, and video. With 32 billion parameters, it combines deep visual perception with advanced text...

textvisionmultimodal

Input$0.1040/1M

Output$0.4160/1M

📏262kcontext

Explore specs and pricingView details →

MiniMax: MiniMax M2

MiniMax-M2 is a compact, high-efficiency large language model optimized for end-to-end coding and agentic workflows. With 10 billion activated parameters (230 billion total), it delivers near-frontier intelligence across general reasoning,...

textcodereasoning

Input$0.2550/1M

Output$1.0000/1M

📏205kcontext

Explore specs and pricingView details →

← Prev 4 5 6 7 8 9 10 Next →