modelstop.top
Home/All Models

AI Model Catalogue

Browse 454 models across providers, modalities, and use cases.

📄 Long Context

454 models · Page 8 of 13

Cohere: Command A

cohere

Command A is an open-weights 111B parameter model with a 256k context window focused on delivering great performance across agentic, multilingual, and coding use cases. Compared to other leading proprietary...

textcodeagents
Run locally
256,000 ctx$2.50/1M in
Explore specs and pricingView details →

Google: Gemma 3 12B

google

Gemma 3 introduces multimodality, supporting vision-language input and text outputs. It handles context windows up to 128k tokens, understands over 140 languages, and offers improved math, reasoning, and chat capabilities,...

textvisionmultimodal
Run locally
131,072 ctx$0.04/1M in
Explore specs and pricingView details →

Google: Gemma 3 4B

google

Gemma 3 introduces multimodality, supporting vision-language input and text outputs. It handles context windows up to 128k tokens, understands over 140 languages, and offers improved math, reasoning, and chat capabilities,...

textvisionmultimodal
Run locally
131,072 ctx$0.04/1M in
Explore specs and pricingView details →

AllenAI: Olmo 2 32B Instruct

allenai

OLMo-2 32B Instruct is a supervised instruction-finetuned variant of the OLMo-2 32B March 2025 base model. It excels in complex reasoning and instruction-following tasks across diverse benchmarks such as GSM8K,...

textreasoninginstruct
128,000 ctx$0.05/1M in
Explore specs and pricingView details →

Mistral: Mistral Small 3.1 24B

mistralai

Mistral Small 3.1 24B Instruct is an upgraded variant of Mistral Small 3 (2501), featuring 24 billion parameters with advanced multimodal capabilities. It provides state-of-the-art performance in text-based reasoning and...

textvisionmultimodal
128,000 ctx$0.03/1M in
Explore specs and pricingView details →

OpenAI: o1-pro

openai

The o1 series of models are trained with reinforcement learning to think before they answer and perform complex reasoning. The o1-pro model uses more compute to think harder and provide...

textvisionmultimodal
200,000 ctx$150.00/1M in
Explore specs and pricingView details →

DeepSeek: DeepSeek V3 0324

deepseek

DeepSeek V3, a 685B-parameter, mixture-of-experts model, is the latest iteration of the flagship chat model family from the DeepSeek team. It succeeds the [DeepSeek V3](/deepseek/deepseek-chat-v3) model and performs really well...

textcheaplong-context
163,840 ctx$0.20/1M in
Explore specs and pricingView details →

Qwen: Qwen2.5 VL 32B Instruct

qwen

Qwen2.5-VL-32B is a multimodal vision-language model fine-tuned through reinforcement learning for enhanced mathematical reasoning, structured outputs, and visual problem-solving capabilities. It excels at visual analysis tasks, including object recognition, textual...

textvisionmultimodal
128,000 ctx$0.20/1M in
Explore specs and pricingView details →

Meta: Llama 4 Scout

meta-llama

Llama 4 Scout 17B Instruct (16E) is a mixture-of-experts (MoE) language model developed by Meta, activating 17 billion parameters out of a total of 109B. It supports native multimodal input...

textvisionmultimodal
327,680 ctx$0.08/1M in
Explore specs and pricingView details →

Meta: Llama 4 Maverick

meta-llama

Llama 4 Maverick 17B Instruct (128E) is a high-capacity multimodal language model from Meta, built on a mixture-of-experts (MoE) architecture with 128 experts and 17 billion active parameters per forward...

textvisionmultimodal
Run locally
1,048,576 ctx$0.15/1M in
Explore specs and pricingView details →

NVIDIA: Llama 3.1 Nemotron Ultra 253B v1

nvidia

Llama-3.1-Nemotron-Ultra-253B-v1 is a large language model (LLM) optimized for advanced reasoning, human-interactive chat, retrieval-augmented generation (RAG), and tool-calling tasks. Derived from Meta’s Llama-3.1-405B-Instruct, it has been significantly customized using Neural...

reasoninginstructcheap
131,072 ctx$0.60/1M in
Explore specs and pricingView details →

xAI: Grok 3 Beta

x-ai

Grok 3 is the latest model from xAI. It's their flagship model that excels at enterprise use cases like data extraction, coding, and text summarization. Possesses deep domain knowledge in...

textcodelong-context
131,072 ctx$3.00/1M in
Explore specs and pricingView details →

xAI: Grok 3 Mini Beta

x-ai

Grok 3 Mini is a lightweight, smaller thinking model. Unlike traditional models that generate answers immediately, Grok 3 Mini thinks before responding. It’s ideal for reasoning-heavy tasks that don’t demand...

textreasoningcheap
131,072 ctx$0.30/1M in
Explore specs and pricingView details →

OpenAI: GPT-4.1 Nano

openai

For tasks that demand low latency, GPT‑4.1 nano is the fastest and cheapest model in the GPT-4.1 series. It delivers exceptional performance at a small size with its 1 million...

textvisionmultimodal
Run locally
1,047,576 ctx$0.10/1M in
Explore specs and pricingView details →

OpenAI: GPT-4.1 Mini

openai

GPT-4.1 Mini is a mid-sized model delivering performance competitive with GPT-4o at substantially lower latency and cost. It retains a 1 million token context window and scores 45.1% on hard...

textvisionmultimodal
1,047,576 ctx$0.40/1M in
Explore specs and pricingView details →

OpenAI: GPT-4.1

openai

GPT-4.1 is a flagship large language model optimized for advanced instruction following, real-world software engineering, and long-context reasoning. It supports a 1 million token context window and outperforms GPT-4o and...

textvisionmultimodal
Run locally
1,047,576 ctx$2.00/1M in
Explore specs and pricingView details →

OpenAI: o4 Mini

openai

OpenAI o4-mini is a compact reasoning model in the o-series, optimized for fast, cost-efficient performance while retaining strong multimodal and agentic capabilities. It supports tool use and demonstrates competitive reasoning...

textvisionmultimodal
200,000 ctx$1.10/1M in
Explore specs and pricingView details →

OpenAI: o3

openai

o3 is a well-rounded and powerful model across domains. It sets a new standard for math, science, coding, and visual reasoning tasks. It also excels at technical writing and instruction-following....

textvisionmultimodal
Run locally
200,000 ctx$2.00/1M in
Explore specs and pricingView details →

OpenAI: o4 Mini High

openai

OpenAI o4-mini-high is the same model as [o4-mini](/openai/o4-mini) with reasoning_effort set to high. OpenAI o4-mini is a compact reasoning model in the o-series, optimized for fast, cost-efficient performance while retaining...

textvisionmultimodal
200,000 ctx$1.10/1M in
Explore specs and pricingView details →

Qwen: Qwen3 235B A22B

qwen

Qwen3-235B-A22B is a 235B parameter mixture-of-experts (MoE) model developed by Qwen, activating 22B parameters per forward pass. It supports seamless switching between a "thinking" mode for complex reasoning, math, and...

textreasoningcheap
131,072 ctx$0.46/1M in
Explore specs and pricingView details →

Qwen: Qwen3 30B A3B

qwen

Qwen3, the latest generation in the Qwen large language model series, features both dense and mixture-of-experts (MoE) architectures to excel in reasoning, multilingual support, and advanced agent tasks. Its unique...

textreasoningagents
Run locally
131,072 ctx$0.08/1M in
Explore specs and pricingView details →

Meta: Llama Guard 4 12B

meta-llama

Llama Guard 4 is a Llama 4 Scout-derived multimodal pretrained model, fine-tuned for content safety classification. Similar to previous versions, it can be used to classify content in both LLM...

textvisionmultimodal
163,840 ctx$0.18/1M in
Explore specs and pricingView details →

Inception: Mercury Coder

inception

Mercury Coder is the first diffusion large language model (dLLM). Applying a breakthrough discrete diffusion approach, the model runs 5-10x faster than even speed optimized models like Claude 3.5 Haiku...

codecheaplong-context
128,000 ctx$0.25/1M in
Explore specs and pricingView details →

Arcee AI: Virtuoso Large

arcee-ai

Virtuoso‑Large is Arcee's top‑tier general‑purpose LLM at 72 B parameters, tuned to tackle cross‑domain reasoning, creative writing and enterprise QA. Unlike many 70 B peers, it retains the 128 k...

textreasoningcheap
131,072 ctx$0.75/1M in
Explore specs and pricingView details →

Arcee AI: Maestro Reasoning

arcee-ai

Maestro Reasoning is Arcee's flagship analysis model: a 32 B‑parameter derivative of Qwen 2.5‑32 B tuned with DPO and chain‑of‑thought RL for step‑by‑step logic. Compared to the earlier 7 B...

textreasoningcheap
131,072 ctx$0.90/1M in
Explore specs and pricingView details →

Arcee AI: Spotlight

arcee-ai

Spotlight is a 7‑billion‑parameter vision‑language model derived from Qwen 2.5‑VL and fine‑tuned by Arcee AI for tight image‑text grounding tasks. It offers a 32 k‑token context window, enabling rich multimodal...

textvisionmultimodal
131,072 ctx$0.18/1M in
Explore specs and pricingView details →

Google: Gemini 2.5 Pro Preview 05-06

google

Gemini 2.5 Pro is Google’s state-of-the-art AI model designed for advanced reasoning, coding, mathematics, and scientific tasks. It employs “thinking” capabilities, enabling it to reason through responses with enhanced accuracy...

textvisionmultimodal
Run locally
1,048,576 ctx$1.25/1M in
Explore specs and pricingView details →

Mistral: Mistral Medium 3

mistralai

Mistral Medium 3 is a high-performance enterprise-grade language model designed to deliver frontier-level capabilities at significantly reduced operational cost. It balances state-of-the-art reasoning and multimodal performance with 8× lower cost...

textvisionmultimodal
131,072 ctx$0.40/1M in
Explore specs and pricingView details →

Anthropic: Claude Sonnet 4

anthropic

Claude Sonnet 4 significantly enhances the capabilities of its predecessor, Sonnet 3.7, excelling in both coding and reasoning tasks with improved precision and controllability. Achieving state-of-the-art performance on SWE-bench (72.7%),...

textvisionmultimodal
1,000,000 ctx$3.00/1M in
Explore specs and pricingView details →

Anthropic: Claude Opus 4

anthropic

Claude Opus 4 is benchmarked as the world’s best coding model, at time of release, bringing sustained performance on complex, long-running tasks and agent workflows. It sets new benchmarks in...

textvisionmultimodal
Run locally
200,000 ctx$15.00/1M in
Explore specs and pricingView details →

DeepSeek: R1 0528

deepseek

May 28th update to the [original DeepSeek R1](/deepseek/deepseek-r1) Performance on par with [OpenAI o1](/openai/o1), but open-sourced and with fully open reasoning tokens. It's 671B parameters in size, with 37B active...

textreasoningcheap
Run locally
163,840 ctx$0.45/1M in
Explore specs and pricingView details →

Google: Gemini 2.5 Pro Preview 06-05

google

Gemini 2.5 Pro is Google’s state-of-the-art AI model designed for advanced reasoning, coding, mathematics, and scientific tasks. It employs “thinking” capabilities, enabling it to reason through responses with enhanced accuracy...

textvisionmultimodal
Run locally
1,048,576 ctx$1.25/1M in
Explore specs and pricingView details →

xAI: Grok 3

x-ai

Grok 3 is the latest model from xAI. It's their flagship model that excels at enterprise use cases like data extraction, coding, and text summarization. Possesses deep domain knowledge in...

textcodelong-context
131,072 ctx$3.00/1M in
Explore specs and pricingView details →

xAI: Grok 3 Mini

x-ai

A lightweight model that thinks before responding. Fast, smart, and great for logic-based tasks that do not require deep domain knowledge. The raw thinking traces are accessible.

textreasoningcheap
131,072 ctx$0.30/1M in
Explore specs and pricingView details →

OpenAI: o3 Pro

openai

The o-series of models are trained with reinforcement learning to think before they answer and perform complex reasoning. The o3-pro model uses more compute to think harder and provide consistently...

textvisionmultimodal
Run locally
200,000 ctx$20.00/1M in
Explore specs and pricingView details →

Google: Gemini 2.5 Pro

google

Gemini 2.5 Pro is Google’s state-of-the-art AI model designed for advanced reasoning, coding, mathematics, and scientific tasks. It employs “thinking” capabilities, enabling it to reason through responses with enhanced accuracy...

textvisionmultimodal
Run locally
1,048,576 ctx$1.25/1M in
Explore specs and pricingView details →