Playground Find a Model ⚡ Pro Tools Pulse API Advertise PricingLoading...

Loading...

The most comprehensive directory of AI models, providers, and agents. Updated daily.

Explore

All Models
Collections
Leaderboard
Compare
Pro Tools
Pulse Feed
API Docs

Categories

Language Models
Inference Providers
Agents & SaaS
Open Source

Stay Updated

Weekly digest of new models and price changes.

Business contact

Support: support@modelstop.top

Enquiries: hello@modelstop.top

Billing: billing@modelstop.top

Privacy: privacy@modelstop.top

Legal: legal@modelstop.top

© 2026 modelstop.top. All rights reserved.Updated daily · 4695+ models indexed

Home/All Models

AI Model Catalogue

Browse 464 models across providers, modalities, and use cases.

🌐All Models 💬Text Generation 💻Code & Reasoning 👁️Vision & Multimodal 🎨Image Generation 🎙️Audio & Speech 🤖Agents & Tools 📄Long Context 🆓Free & Open

🧠

Reasoning

🌍Multilingual

Providers:⚡OpenAI 🔷Anthropic 🔍Google 🦙Meta 🌀Mistral ✕xAI 🚀Groq 🐋DeepSeek 🌐Cohere ☁️Amazon

Filter & Sort

📄 Long Context

464 models · Page 8 of 13

Perplexity: Sonar Pro

Note: Sonar Pro pricing includes Perplexity search pricing. See [details here](https://docs.perplexity.ai/guides/pricing#detailed-pricing-breakdown-for-sonar-reasoning-pro-and-sonar-pro) For enterprises seeking more advanced capabilities, the Sonar Pro API can handle in-depth, multi-step queries with added extensibility, like...

textvisionmultimodal

📏200kcontext

Explore specs and pricingView details →

Perplexity: Sonar Reasoning Pro

Note: Sonar Pro pricing includes Perplexity search pricing. See [details here](https://docs.perplexity.ai/guides/pricing#detailed-pricing-breakdown-for-sonar-reasoning-pro-and-sonar-pro) Sonar Reasoning Pro is a premier reasoning model powered by DeepSeek R1 with Chain of Thought (CoT). Designed for...

textvisionmultimodal

📏128kcontext

Explore specs and pricingView details →

Google: Gemma 3 27B

Gemma 3 introduces multimodality, supporting vision-language input and text outputs. It handles context windows up to 128k tokens, understands over 140 languages, and offers improved math, reasoning, and chat capabilities,...

textvisionmultimodal

Input$0.0800/1M

Output$0.1600/1M

📏131kcontext

Explore specs and pricingView details →

OpenAI: GPT-4o Search Preview

GPT-4o Search Previewis a specialized model for web search in Chat Completions. It is trained to understand and execute web search queries.

textlong-context

Input$2.5000/1M

Output$10.0000/1M

📏128kcontext

Explore specs and pricingView details →

OpenAI: GPT-4o-mini Search Preview

GPT-4o mini Search Preview is a specialized model for web search in Chat Completions. It is trained to understand and execute web search queries.

textcheaplong-context

📏128kcontext

Explore specs and pricingView details →

Cohere: Command A

Command A is an open-weights 111B parameter model with a 256k context window focused on delivering great performance across agentic, multilingual, and coding use cases. Compared to other leading proprietary...

📏256kcontext

Explore specs and pricingView details →

Google: Gemma 3 12B

Gemma 3 introduces multimodality, supporting vision-language input and text outputs. It handles context windows up to 128k tokens, understands over 140 languages, and offers improved math, reasoning, and chat capabilities,...

textvisionmultimodal

Input$0.0400/1M

Output$0.1300/1M

📏131kcontext

Explore specs and pricingView details →

Google: Gemma 3 4B

Gemma 3 introduces multimodality, supporting vision-language input and text outputs. It handles context windows up to 128k tokens, understands over 140 languages, and offers improved math, reasoning, and chat capabilities,...

textvisionmultimodal

Input$0.0400/1M

Output$0.0800/1M

📏131kcontext

Explore specs and pricingView details →

AllenAI: Olmo 2 32B Instruct

OLMo-2 32B Instruct is a supervised instruction-finetuned variant of the OLMo-2 32B March 2025 base model. It excels in complex reasoning and instruction-following tasks across diverse benchmarks such as GSM8K,...

textreasoninginstruct

Input$0.0500/1M

Output$0.2000/1M

📏128kcontext

Explore specs and pricingView details →

Mistral: Mistral Small 3.1 24B

Mistral Small 3.1 24B Instruct is an upgraded variant of Mistral Small 3 (2501), featuring 24 billion parameters with advanced multimodal capabilities. It provides state-of-the-art performance in text-based reasoning and...

textvisionmultimodal

Input$0.0300/1M

Output$0.1100/1M

📏128kcontext

Explore specs and pricingView details →

OpenAI: o1-pro

The o1 series of models are trained with reinforcement learning to think before they answer and perform complex reasoning. The o1-pro model uses more compute to think harder and provide...

textvisionmultimodal

Input$150.0000/1M

Output$600.0000/1M

📏200kcontext

Explore specs and pricingView details →

DeepSeek: DeepSeek V3 0324

DeepSeek V3, a 685B-parameter, mixture-of-experts model, is the latest iteration of the flagship chat model family from the DeepSeek team. It succeeds the [DeepSeek V3](/deepseek/deepseek-chat-v3) model and performs really well...

textcheaplong-context

Input$0.2000/1M

Output$0.7700/1M

📏164kcontext

Explore specs and pricingView details →

Qwen: Qwen2.5 VL 32B Instruct

Qwen2.5-VL-32B is a multimodal vision-language model fine-tuned through reinforcement learning for enhanced mathematical reasoning, structured outputs, and visual problem-solving capabilities. It excels at visual analysis tasks, including object recognition, textual...

textvisionmultimodal

Input$0.2000/1M

Output$0.6000/1M

📏128kcontext

Explore specs and pricingView details →

Meta: Llama 4 Scout

Llama 4 Scout 17B Instruct (16E) is a mixture-of-experts (MoE) language model developed by Meta, activating 17 billion parameters out of a total of 109B. It supports native multimodal input...

textvisionmultimodal

Input$0.0800/1M

Output$0.3000/1M

📏328kcontext

Explore specs and pricingView details →

Meta: Llama 4 Maverick

Llama 4 Maverick 17B Instruct (128E) is a high-capacity multimodal language model from Meta, built on a mixture-of-experts (MoE) architecture with 128 experts and 17 billion active parameters per forward...

textvisionmultimodal

Input$0.1500/1M

Output$0.6000/1M

📏1049kcontext

Explore specs and pricingView details →

NVIDIA: Llama 3.1 Nemotron Ultra 253B v1

Llama-3.1-Nemotron-Ultra-253B-v1 is a large language model (LLM) optimized for advanced reasoning, human-interactive chat, retrieval-augmented generation (RAG), and tool-calling tasks. Derived from Meta’s Llama-3.1-405B-Instruct, it has been significantly customized using Neural...

reasoninginstructcheap

Input$0.6000/1M

Output$1.8000/1M

📏131kcontext

Explore specs and pricingView details →

xAI: Grok 3 Beta

Grok 3 is the latest model from xAI. It's their flagship model that excels at enterprise use cases like data extraction, coding, and text summarization. Possesses deep domain knowledge in...

textcodelong-context

Input$3.0000/1M

Output$15.0000/1M

📏131kcontext

Explore specs and pricingView details →

xAI: Grok 3 Mini Beta

Grok 3 Mini is a lightweight, smaller thinking model. Unlike traditional models that generate answers immediately, Grok 3 Mini thinks before responding. It’s ideal for reasoning-heavy tasks that don’t demand...

textreasoningcheap

Input$0.3000/1M

Output$0.5000/1M

📏131kcontext

Explore specs and pricingView details →

OpenAI: GPT-4.1 Nano

For tasks that demand low latency, GPT‑4.1 nano is the fastest and cheapest model in the GPT-4.1 series. It delivers exceptional performance at a small size with its 1 million...

textvisionmultimodal

Input$0.1000/1M

Output$0.4000/1M

📏1048kcontext

⭐1270.0%score

Explore specs and pricingView details →

OpenAI: GPT-4.1 Mini

GPT-4.1 Mini is a mid-sized model delivering performance competitive with GPT-4o at substantially lower latency and cost. It retains a 1 million token context window and scores 45.1% on hard...

textvisionmultimodal

Input$0.4000/1M

Output$1.6000/1M

📏1048kcontext

Explore specs and pricingView details →

OpenAI: GPT-4.1

GPT-4.1 is a flagship large language model optimized for advanced instruction following, real-world software engineering, and long-context reasoning. It supports a 1 million token context window and outperforms GPT-4o and...

textvisionmultimodal

📏1048kcontext

Explore specs and pricingView details →

OpenAI: o4 Mini

OpenAI o4-mini is a compact reasoning model in the o-series, optimized for fast, cost-efficient performance while retaining strong multimodal and agentic capabilities. It supports tool use and demonstrates competitive reasoning...

textvisionmultimodal

📏200kcontext

Explore specs and pricingView details →

OpenAI: o3

o3 is a well-rounded and powerful model across domains. It sets a new standard for math, science, coding, and visual reasoning tasks. It also excels at technical writing and instruction-following....

textvisionmultimodal

Input$2.0000/1M

Output$8.0000/1M

📏200kcontext

Explore specs and pricingView details →

OpenAI: o4 Mini High

OpenAI o4-mini-high is the same model as [o4-mini](/openai/o4-mini) with reasoning_effort set to high. OpenAI o4-mini is a compact reasoning model in the o-series, optimized for fast, cost-efficient performance while retaining...

textvisionmultimodal

📏200kcontext

Explore specs and pricingView details →

Qwen: Qwen3 235B A22B

Qwen3-235B-A22B is a 235B parameter mixture-of-experts (MoE) model developed by Qwen, activating 22B parameters per forward pass. It supports seamless switching between a "thinking" mode for complex reasoning, math, and...

textreasoningcheap

Input$0.4550/1M

Output$1.8200/1M

📏131kcontext

Explore specs and pricingView details →

Qwen: Qwen3 32B

Qwen3-32B is a dense 32.8B parameter causal language model from the Qwen3 series, optimized for both complex reasoning and efficient dialogue. It supports seamless switching between a "thinking" mode for...

textreasoningcheap

Input$0.0800/1M

Output$0.2400/1M

📏131kcontext

Explore specs and pricingView details →

Qwen: Qwen3 30B A3B

Qwen3, the latest generation in the Qwen large language model series, features both dense and mixture-of-experts (MoE) architectures to excel in reasoning, multilingual support, and advanced agent tasks. Its unique...

textreasoningagents

Input$0.0800/1M

Output$0.2800/1M

📏131kcontext

Explore specs and pricingView details →

Meta: Llama Guard 4 12B

Llama Guard 4 is a Llama 4 Scout-derived multimodal pretrained model, fine-tuned for content safety classification. Similar to previous versions, it can be used to classify content in both LLM...

textvisionmultimodal

📏164kcontext

Explore specs and pricingView details →

Inception: Mercury Coder

Mercury Coder is the first diffusion large language model (dLLM). Applying a breakthrough discrete diffusion approach, the model runs 5-10x faster than even speed optimized models like Claude 3.5 Haiku...

codecheaplong-context

Input$0.2500/1M

Output$0.7500/1M

📏128kcontext

Explore specs and pricingView details →

Arcee AI: Virtuoso Large

Virtuoso‑Large is Arcee's top‑tier general‑purpose LLM at 72 B parameters, tuned to tackle cross‑domain reasoning, creative writing and enterprise QA. Unlike many 70 B peers, it retains the 128 k...

textreasoningcheap

📏131kcontext

Explore specs and pricingView details →

Arcee AI: Maestro Reasoning

Maestro Reasoning is Arcee's flagship analysis model: a 32 B‑parameter derivative of Qwen 2.5‑32 B tuned with DPO and chain‑of‑thought RL for step‑by‑step logic. Compared to the earlier 7 B...

textreasoningcheap

Input$0.9000/1M

Output$3.3000/1M

📏131kcontext

Explore specs and pricingView details →

Arcee AI: Spotlight

Spotlight is a 7‑billion‑parameter vision‑language model derived from Qwen 2.5‑VL and fine‑tuned by Arcee AI for tight image‑text grounding tasks. It offers a 32 k‑token context window, enabling rich multimodal...

textvisionmultimodal

Input$0.1800/1M

Output$0.1800/1M

📏131kcontext

Explore specs and pricingView details →

Google: Gemini 2.5 Pro Preview 05-06

Gemini 2.5 Pro is Google’s state-of-the-art AI model designed for advanced reasoning, coding, mathematics, and scientific tasks. It employs “thinking” capabilities, enabling it to reason through responses with enhanced accuracy...

textvisionmultimodal

📏1049kcontext

Explore specs and pricingView details →

Mistral: Mistral Medium 3

Mistral Medium 3 is a high-performance enterprise-grade language model designed to deliver frontier-level capabilities at significantly reduced operational cost. It balances state-of-the-art reasoning and multimodal performance with 8× lower cost...

textvisionmultimodal

Input$0.4000/1M

Output$2.0000/1M

📏131kcontext

Explore specs and pricingView details →

Anthropic: Claude Sonnet 4

Claude Sonnet 4 significantly enhances the capabilities of its predecessor, Sonnet 3.7, excelling in both coding and reasoning tasks with improved precision and controllability. Achieving state-of-the-art performance on SWE-bench (72.7%),...

textvisionmultimodal

📏1000kcontext

Explore specs and pricingView details →

Anthropic: Claude Opus 4

Claude Opus 4 is benchmarked as the world’s best coding model, at time of release, bringing sustained performance on complex, long-running tasks and agent workflows. It sets new benchmarks in...

textvisionmultimodal

📏200kcontext

Explore specs and pricingView details →

← Prev 5 6 7 8 9 10 11 Next →