modelstop.top
Home/All Models

AI Model Catalogue

Browse 408 models across providers, modalities, and use cases.

📄 Long Context

408 models · Page 7 of 12

Meta: Llama 4 Scout

meta-llama

Llama 4 Scout 17B Instruct (16E) is a mixture-of-experts (MoE) language model developed by Meta, activating 17 billion parameters out of a total of 109B. It supports native multimodal input...

textvisionmultimodal
327,680 ctx$0.08/1M in
Explore specs and pricingView details →

Meta: Llama 4 Maverick

meta-llama

Llama 4 Maverick 17B Instruct (128E) is a high-capacity multimodal language model from Meta, built on a mixture-of-experts (MoE) architecture with 128 experts and 17 billion active parameters per forward...

textvisionmultimodal
1,048,576 ctx$0.15/1M in
Explore specs and pricingView details →

NVIDIA: Llama 3.1 Nemotron Ultra 253B v1

nvidia

Llama-3.1-Nemotron-Ultra-253B-v1 is a large language model (LLM) optimized for advanced reasoning, human-interactive chat, retrieval-augmented generation (RAG), and tool-calling tasks. Derived from Meta’s Llama-3.1-405B-Instruct, it has been significantly customized using Neural...

reasoninginstructcheap
131,072 ctx$0.60/1M in
Explore specs and pricingView details →

xAI: Grok 3 Beta

x-ai

Grok 3 is the latest model from xAI. It's their flagship model that excels at enterprise use cases like data extraction, coding, and text summarization. Possesses deep domain knowledge in...

textcodelong-context
131,072 ctx$3.00/1M in
Explore specs and pricingView details →

xAI: Grok 3 Mini Beta

x-ai

Grok 3 Mini is a lightweight, smaller thinking model. Unlike traditional models that generate answers immediately, Grok 3 Mini thinks before responding. It’s ideal for reasoning-heavy tasks that don’t demand...

textreasoningcheap
131,072 ctx$0.30/1M in
Explore specs and pricingView details →

OpenAI: GPT-4.1 Nano

openai

For tasks that demand low latency, GPT‑4.1 nano is the fastest and cheapest model in the GPT-4.1 series. It delivers exceptional performance at a small size with its 1 million...

textvisionmultimodal
1,047,576 ctx$0.10/1M in
Explore specs and pricingView details →

OpenAI: GPT-4.1 Mini

openai

GPT-4.1 Mini is a mid-sized model delivering performance competitive with GPT-4o at substantially lower latency and cost. It retains a 1 million token context window and scores 45.1% on hard...

textvisionmultimodal
1,047,576 ctx$0.40/1M in
Explore specs and pricingView details →

OpenAI: GPT-4.1

openai

GPT-4.1 is a flagship large language model optimized for advanced instruction following, real-world software engineering, and long-context reasoning. It supports a 1 million token context window and outperforms GPT-4o and...

textvisionmultimodal
1,047,576 ctx$2.00/1M in
Explore specs and pricingView details →

OpenAI: o4 Mini

openai

OpenAI o4-mini is a compact reasoning model in the o-series, optimized for fast, cost-efficient performance while retaining strong multimodal and agentic capabilities. It supports tool use and demonstrates competitive reasoning...

textvisionmultimodal
200,000 ctx$1.10/1M in
Explore specs and pricingView details →

OpenAI: o3

openai

o3 is a well-rounded and powerful model across domains. It sets a new standard for math, science, coding, and visual reasoning tasks. It also excels at technical writing and instruction-following....

textvisionmultimodal
200,000 ctx$2.00/1M in
Explore specs and pricingView details →

OpenAI: o4 Mini High

openai

OpenAI o4-mini-high is the same model as [o4-mini](/openai/o4-mini) with reasoning_effort set to high. OpenAI o4-mini is a compact reasoning model in the o-series, optimized for fast, cost-efficient performance while retaining...

textvisionmultimodal
200,000 ctx$1.10/1M in
Explore specs and pricingView details →

Qwen: Qwen3 235B A22B

qwen

Qwen3-235B-A22B is a 235B parameter mixture-of-experts (MoE) model developed by Qwen, activating 22B parameters per forward pass. It supports seamless switching between a "thinking" mode for complex reasoning, math, and...

textreasoningcheap
131,072 ctx$0.46/1M in
Explore specs and pricingView details →

Meta: Llama Guard 4 12B

meta-llama

Llama Guard 4 is a Llama 4 Scout-derived multimodal pretrained model, fine-tuned for content safety classification. Similar to previous versions, it can be used to classify content in both LLM...

textvisionmultimodal
163,840 ctx$0.18/1M in
Explore specs and pricingView details →

Inception: Mercury Coder

inception

Mercury Coder is the first diffusion large language model (dLLM). Applying a breakthrough discrete diffusion approach, the model runs 5-10x faster than even speed optimized models like Claude 3.5 Haiku...

codecheaplong-context
128,000 ctx$0.25/1M in
Explore specs and pricingView details →

Arcee AI: Virtuoso Large

arcee-ai

Virtuoso‑Large is Arcee's top‑tier general‑purpose LLM at 72 B parameters, tuned to tackle cross‑domain reasoning, creative writing and enterprise QA. Unlike many 70 B peers, it retains the 128 k...

textreasoningcheap
131,072 ctx$0.75/1M in
Explore specs and pricingView details →

Arcee AI: Maestro Reasoning

arcee-ai

Maestro Reasoning is Arcee's flagship analysis model: a 32 B‑parameter derivative of Qwen 2.5‑32 B tuned with DPO and chain‑of‑thought RL for step‑by‑step logic. Compared to the earlier 7 B...

textreasoningcheap
131,072 ctx$0.90/1M in
Explore specs and pricingView details →

Arcee AI: Spotlight

arcee-ai

Spotlight is a 7‑billion‑parameter vision‑language model derived from Qwen 2.5‑VL and fine‑tuned by Arcee AI for tight image‑text grounding tasks. It offers a 32 k‑token context window, enabling rich multimodal...

textvisionmultimodal
131,072 ctx$0.18/1M in
Explore specs and pricingView details →

Google: Gemini 2.5 Pro Preview 05-06

google

Gemini 2.5 Pro is Google’s state-of-the-art AI model designed for advanced reasoning, coding, mathematics, and scientific tasks. It employs “thinking” capabilities, enabling it to reason through responses with enhanced accuracy...

textvisionmultimodal
1,048,576 ctx$1.25/1M in
Explore specs and pricingView details →

Mistral: Mistral Medium 3

mistralai

Mistral Medium 3 is a high-performance enterprise-grade language model designed to deliver frontier-level capabilities at significantly reduced operational cost. It balances state-of-the-art reasoning and multimodal performance with 8× lower cost...

textvisionmultimodal
131,072 ctx$0.40/1M in
Explore specs and pricingView details →

Anthropic: Claude Sonnet 4

anthropic

Claude Sonnet 4 significantly enhances the capabilities of its predecessor, Sonnet 3.7, excelling in both coding and reasoning tasks with improved precision and controllability. Achieving state-of-the-art performance on SWE-bench (72.7%),...

textvisionmultimodal
1,000,000 ctx$3.00/1M in
Explore specs and pricingView details →

Anthropic: Claude Opus 4

anthropic

Claude Opus 4 is benchmarked as the world’s best coding model, at time of release, bringing sustained performance on complex, long-running tasks and agent workflows. It sets new benchmarks in...

textvisionmultimodal
200,000 ctx$15.00/1M in
Explore specs and pricingView details →

DeepSeek: R1 0528

deepseek

May 28th update to the [original DeepSeek R1](/deepseek/deepseek-r1) Performance on par with [OpenAI o1](/openai/o1), but open-sourced and with fully open reasoning tokens. It's 671B parameters in size, with 37B active...

textreasoningcheap
163,840 ctx$0.45/1M in
Explore specs and pricingView details →

Google: Gemini 2.5 Pro Preview 06-05

google

Gemini 2.5 Pro is Google’s state-of-the-art AI model designed for advanced reasoning, coding, mathematics, and scientific tasks. It employs “thinking” capabilities, enabling it to reason through responses with enhanced accuracy...

textvisionmultimodal
1,048,576 ctx$1.25/1M in
Explore specs and pricingView details →

xAI: Grok 3

x-ai

Grok 3 is the latest model from xAI. It's their flagship model that excels at enterprise use cases like data extraction, coding, and text summarization. Possesses deep domain knowledge in...

textcodelong-context
131,072 ctx$3.00/1M in
Explore specs and pricingView details →

xAI: Grok 3 Mini

x-ai

A lightweight model that thinks before responding. Fast, smart, and great for logic-based tasks that do not require deep domain knowledge. The raw thinking traces are accessible.

textreasoningcheap
131,072 ctx$0.30/1M in
Explore specs and pricingView details →

OpenAI: o3 Pro

openai

The o-series of models are trained with reinforcement learning to think before they answer and perform complex reasoning. The o3-pro model uses more compute to think harder and provide consistently...

textvisionmultimodal
200,000 ctx$20.00/1M in
Explore specs and pricingView details →

Google: Gemini 2.5 Pro

google

Gemini 2.5 Pro is Google’s state-of-the-art AI model designed for advanced reasoning, coding, mathematics, and scientific tasks. It employs “thinking” capabilities, enabling it to reason through responses with enhanced accuracy...

textvisionmultimodal
1,048,576 ctx$1.25/1M in
Explore specs and pricingView details →

Google: Gemini 2.5 Flash

google

Gemini 2.5 Flash is Google's state-of-the-art workhorse model, specifically designed for advanced reasoning, coding, mathematics, and scientific tasks. It includes built-in "thinking" capabilities, enabling it to provide responses with greater...

textvisionmultimodal
1,048,576 ctx$0.30/1M in
Explore specs and pricingView details →

MiniMax: MiniMax M1

minimax

MiniMax-M1 is a large-scale, open-weight reasoning model designed for extended context and high-efficiency inference. It leverages a hybrid Mixture-of-Experts (MoE) architecture paired with a custom "lightning attention" mechanism, allowing it...

textreasoningcheap
1,000,000 ctx$0.40/1M in
Explore specs and pricingView details →

Mistral: Mistral Small 3.2 24B

mistralai

Mistral-Small-3.2-24B-Instruct-2506 is an updated 24B parameter model from Mistral optimized for instruction following, repetition reduction, and improved function calling. Compared to the 3.1 release, version 3.2 significantly improves accuracy on...

textvisionmultimodal
128,000 ctx$0.07/1M in
Explore specs and pricingView details →

Inception: Mercury

inception

Mercury is the first diffusion large language model (dLLM). Applying a breakthrough discrete diffusion approach, the model runs 5-10x faster than even speed optimized models like GPT-4.1 Nano and Claude...

cheaplong-context
128,000 ctx$0.25/1M in
Explore specs and pricingView details →

Baidu: ERNIE 4.5 300B A47B

baidu

ERNIE-4.5-300B-A47B is a 300B parameter Mixture-of-Experts (MoE) language model developed by Baidu as part of the ERNIE 4.5 series. It activates 47B parameters per token and supports text generation in...

textcheaplong-context
123,000 ctx$0.28/1M in
Explore specs and pricingView details →

Baidu: ERNIE 4.5 VL 424B A47B

baidu

ERNIE-4.5-VL-424B-A47B is a multimodal Mixture-of-Experts (MoE) model from Baidu’s ERNIE 4.5 series, featuring 424B total parameters with 47B active per token. It is trained jointly on text and image data...

textvisionmultimodal
123,000 ctx$0.42/1M in
Explore specs and pricingView details →

Morph: Morph V3 Large

morph

Morph's high-accuracy apply model for complex code edits. ~4,500 tokens/sec with 98% accuracy for precise code transformations. The model requires the prompt to be in the following format: <instruction>{instruction}</instruction> <code>{initial_code}</code>...

textcodecheap
262,144 ctx$0.90/1M in
Explore specs and pricingView details →

TNG: DeepSeek R1T2 Chimera

tngtech

DeepSeek-TNG-R1T2-Chimera is the second-generation Chimera model from TNG Tech. It is a 671 B-parameter mixture-of-experts text-generation model assembled from DeepSeek-AI’s R1-0528, R1, and V3-0324 checkpoints with an Assembly-of-Experts merge. The...

textcheaplong-context
163,840 ctx$0.30/1M in
Explore specs and pricingView details →

Tencent: Hunyuan A13B Instruct

tencent

Hunyuan-A13B is a 13B active parameter Mixture-of-Experts (MoE) language model developed by Tencent, with a total parameter count of 80B and support for reasoning via Chain-of-Thought. It offers competitive benchmark...

textreasoninginstruct
131,072 ctx$0.14/1M in
Explore specs and pricingView details →