modelstop.top
Home/All Models

AI Model Catalogue

Browse 454 models across providers, modalities, and use cases.

๐Ÿ“„ Long Context

454 models ยท Page 7 of 13

Qwen2.5 Coder 32B Instruct

qwen

Qwen2.5-Coder is the latest series of Code-Specific Qwen large language models (formerly known as CodeQwen). Qwen2.5-Coder brings the following improvements upon CodeQwen1.5: - Significantly improvements in **code generation**, **code reasoning**...

textcodereasoning
Run locally
128,000 ctx$0.66/1M in
Explore specs and pricingView details โ†’

Mistral: Pixtral Large 2411

mistralai

Pixtral Large is a 124B parameter, open-weight, multimodal model built on top of [Mistral Large 2](/mistralai/mistral-large-2411). The model is able to understand documents, charts and natural images. The model is...

textvisionmultimodal
131,072 ctx$2.00/1M in
Explore specs and pricingView details โ†’

Mistral Large 2407

mistralai

This is Mistral AI's flagship model, Mistral Large 2 (version mistral-large-2407). It's a proprietary weights-available model and excels at reasoning, code, JSON, chat, and more. Read the launch announcement [here](https://mistral.ai/news/mistral-large-2407/)....

textcodereasoning
131,072 ctx$2.00/1M in
Explore specs and pricingView details โ†’

Mistral Large 2411

mistralai

Mistral Large 2 2411 is an update of [Mistral Large 2](/mistralai/mistral-large) released together with [Pixtral Large 2411](/mistralai/pixtral-large-2411) It provides a significant upgrade on the previous [Mistral Large 24.07](/mistralai/mistral-large-2407), with notable...

textlong-context
131,072 ctx$2.00/1M in
Explore specs and pricingView details โ†’

OpenAI: GPT-4o (2024-11-20)

openai

The 2024-11-20 version of GPT-4o offers a leveled-up creative writing ability with more natural, engaging, and tailored writing to improve relevance & readability. Itโ€™s also better at working with uploaded...

textvisionmultimodal
Run locally
128,000 ctx$2.50/1M in
Explore specs and pricingView details โ†’

Amazon: Nova Pro 1.0

amazon

Amazon Nova Pro 1.0 is a capable multimodal model from Amazon focused on providing a combination of accuracy, speed, and cost for a wide range of tasks. As of December...

textvisionmultimodal
Run locally
300,000 ctx$0.80/1M in
Explore specs and pricingView details โ†’

Amazon: Nova Micro 1.0

amazon

Amazon Nova Micro 1.0 is a text-only model that delivers the lowest latency responses in the Amazon Nova family of models at a very low cost. With a context length...

textcheaplong-context
Run locally
128,000 ctx$0.04/1M in
Explore specs and pricingView details โ†’

Amazon: Nova Lite 1.0

amazon

Amazon Nova Lite 1.0 is a very low-cost multimodal model from Amazon that focused on fast processing of image, video, and text inputs to generate text output. Amazon Nova Lite...

textvisionimage
Run locally
300,000 ctx$0.06/1M in
Explore specs and pricingView details โ†’

Meta: Llama 3.3 70B Instruct (free)

meta-llama

The Meta Llama 3.3 multilingual large language model (LLM) is a pretrained and instruction tuned generative model in 70B (text in/text out). The Llama 3.3 instruction tuned text only model...

textmultilingualinstruct
Run locally
131,072 ctx$0.10/1M in
Explore specs and pricingView details โ†’

Cohere: Command R7B (12-2024)

cohere

Command R7B (12-2024) is a small, fast update of the Command R+ model, delivered in December 2024. It excels at RAG, tool use, agents, and similar tasks requiring complex reasoning...

textreasoningagents
Run locally
128,000 ctx$0.04/1M in
Explore specs and pricingView details โ†’

OpenAI: o1

openai

The latest and strongest model family from OpenAI, o1 is designed to spend more time thinking before responding. The o1 model series is trained with large-scale reinforcement learning to reason...

textvisionmultimodal
Run locally
200,000 ctx$15.00/1M in
Explore specs and pricingView details โ†’

Sao10K: Llama 3.3 Euryale 70B

sao10k

Euryale L3.3 70B is a model focused on creative roleplay from [Sao10k](https://ko-fi.com/sao10k). It is the successor of [Euryale L3 70B v2.2](/models/sao10k/l3-euryale-70b).

textcheaplong-context
131,072 ctx$0.65/1M in
Explore specs and pricingView details โ†’

DeepSeek: DeepSeek V3

deepseek

DeepSeek-V3 is the latest model from the DeepSeek team, building upon the instruction following and coding abilities of the previous versions. Pre-trained on nearly 15 trillion tokens, the reported evaluations...

textcodecheap
Run locally
131,072 ctx$0.32/1M in
Explore specs and pricingView details โ†’

MiniMax: MiniMax-01

minimax

MiniMax-01 is a combines MiniMax-Text-01 for text generation and MiniMax-VL-01 for image understanding. It has 456 billion parameters, with 45.9 billion parameters activated per inference, and can handle a context...

textvisionimage
Run locally
1,000,192 ctx$0.20/1M in
Explore specs and pricingView details โ†’

DeepSeek: R1 Distill Llama 70B

deepseek

DeepSeek R1 Distill Llama 70B is a distilled large language model based on [Llama-3.3-70B-Instruct](/meta-llama/llama-3.3-70b-instruct), using outputs from [DeepSeek R1](/deepseek/deepseek-r1). The model combines advanced distillation techniques to achieve high performance across...

textinstructcheap
Run locally
131,072 ctx$0.70/1M in
Explore specs and pricingView details โ†’

Perplexity: Sonar

perplexity

Sonar is lightweight, affordable, fast, and simple to use โ€” now featuring citations and the ability to customize sources. It is designed for companies seeking to integrate lightweight question-and-answer features...

textvisionmultimodal
Run locally
127,072 ctx$1.00/1M in
Explore specs and pricingView details โ†’

DeepSeek: R1 Distill Qwen 32B

deepseek

DeepSeek R1 Distill Qwen 32B is a distilled large language model based on [Qwen 2.5 32B](https://huggingface.co/Qwen/Qwen2.5-32B), using outputs from [DeepSeek R1](/deepseek/deepseek-r1). It outperforms OpenAI's o1-mini across various benchmarks, achieving new...

textcheaplong-context
Run locally
128,000 ctx$0.29/1M in
Explore specs and pricingView details โ†’

OpenAI: o3 Mini

openai

OpenAI o3-mini is a cost-efficient language model optimized for STEM reasoning tasks, particularly excelling in science, mathematics, and coding. This model supports the `reasoning_effort` parameter, which can be set to...

textcodereasoning
Run locally
200,000 ctx$1.10/1M in
Explore specs and pricingView details โ†’

Qwen: Qwen-Plus

qwen

Qwen-Plus, based on the Qwen2.5 foundation model, is a 131K context model with a balanced performance, speed, and cost combination.

textcheaplong-context
1,000,000 ctx$0.26/1M in
Explore specs and pricingView details โ†’

Qwen: Qwen-Turbo

qwen

Qwen-Turbo, based on Qwen2.5, is a 1M context model that provides fast speed and low cost, suitable for simple tasks.

textcheaplong-context
131,072 ctx$0.03/1M in
Explore specs and pricingView details โ†’

Qwen: Qwen VL Max

qwen

Qwen VL Max is a visual understanding model with 7500 tokens context length. It excels in delivering optimal performance for a broader spectrum of complex tasks.

textvisionmultimodal
131,072 ctx$0.52/1M in
Explore specs and pricingView details โ†’

AionLabs: Aion-1.0-Mini

aion-labs

Aion-1.0-Mini 32B parameter model is a distilled version of the DeepSeek-R1 model, designed for strong performance in reasoning domains such as mathematics, coding, and logic. It is a modified variant...

textcodereasoning
Run locally
131,072 ctx$0.70/1M in
Explore specs and pricingView details โ†’

AionLabs: Aion-1.0

aion-labs

Aion-1.0 is a multi-model system designed for high performance across various tasks, including reasoning and coding. It is built on DeepSeek-R1, augmented with additional models and techniques such as Tree...

textcodereasoning
Run locally
131,072 ctx$4.00/1M in
Explore specs and pricingView details โ†’

Qwen: Qwen VL Plus

qwen

Qwen's Enhanced Large Visual Language Model. Significantly upgraded for detailed recognition capabilities and text recognition abilities, supporting ultra-high pixel resolutions up to millions of pixels and extreme aspect ratios for...

textvisionmultimodal
131,072 ctx$0.14/1M in
Explore specs and pricingView details โ†’

Google: Gemini 2.0 Flash

google

Gemini Flash 2.0 offers a significantly faster time to first token (TTFT) compared to [Gemini Flash 1.5](/google/gemini-flash-1.5), while maintaining quality on par with larger models like [Gemini Pro 1.5](/google/gemini-pro-1.5). It...

textvisionmultimodal
1,000,000 ctx$0.10/1M in
Explore specs and pricingView details โ†’

OpenAI: o3 Mini High

openai

OpenAI o3-mini-high is the same model as [o3-mini](/openai/o3-mini) with reasoning_effort set to high. o3-mini is a cost-efficient language model optimized for STEM reasoning tasks, particularly excelling in science, mathematics, and...

textreasoninglong-context
Run locally
200,000 ctx$1.10/1M in
Explore specs and pricingView details โ†’

Llama Guard 3 8B

meta-llama

Llama Guard 3 is a Llama-3.1-8B pretrained model, fine-tuned for content safety classification. Similar to previous versions, it can be used to classify content in both LLM inputs (prompt classification)...

textcheaplong-context
Run locally
131,072 ctx$0.02/1M in
Explore specs and pricingView details โ†’

Anthropic: Claude 3.7 Sonnet

anthropic

Claude 3.7 Sonnet is an advanced large language model with improved reasoning, coding, and problem-solving capabilities. It introduces a hybrid reasoning approach, allowing users to choose between rapid responses and...

textvisionmultimodal
200,000 ctx$3.00/1M in
Explore specs and pricingView details โ†’

Google: Gemini 2.0 Flash Lite

google

Gemini 2.0 Flash Lite offers a significantly faster time to first token (TTFT) compared to [Gemini Flash 1.5](/google/gemini-flash-1.5), while maintaining quality on par with larger models like [Gemini Pro 1.5](/google/gemini-pro-1.5),...

textvisionmultimodal
1,048,576 ctx$0.07/1M in
Explore specs and pricingView details โ†’

Qwen: QwQ 32B

qwen

QwQ is the reasoning model of the Qwen series. Compared with conventional instruction-tuned models, QwQ, which is capable of thinking and reasoning, can achieve significantly enhanced performance in downstream tasks,...

textreasoningcheap
131,072 ctx$0.15/1M in
Explore specs and pricingView details โ†’

Perplexity: Sonar Deep Research

perplexity

Sonar Deep Research is a research-focused model designed for multi-step retrieval, synthesis, and reasoning across complex topics. It autonomously searches, reads, and evaluates sources, refining its approach as it gathers...

textreasoninglong-context
128,000 ctx$2.00/1M in
Explore specs and pricingView details โ†’

Perplexity: Sonar Pro

perplexity

Note: Sonar Pro pricing includes Perplexity search pricing. See [details here](https://docs.perplexity.ai/guides/pricing#detailed-pricing-breakdown-for-sonar-reasoning-pro-and-sonar-pro) For enterprises seeking more advanced capabilities, the Sonar Pro API can handle in-depth, multi-step queries with added extensibility, like...

textvisionmultimodal
200,000 ctx$3.00/1M in
Explore specs and pricingView details โ†’

Perplexity: Sonar Reasoning Pro

perplexity

Note: Sonar Pro pricing includes Perplexity search pricing. See [details here](https://docs.perplexity.ai/guides/pricing#detailed-pricing-breakdown-for-sonar-reasoning-pro-and-sonar-pro) Sonar Reasoning Pro is a premier reasoning model powered by DeepSeek R1 with Chain of Thought (CoT). Designed for...

textvisionmultimodal
128,000 ctx$2.00/1M in
Explore specs and pricingView details โ†’

Google: Gemma 3 27B

google

Gemma 3 introduces multimodality, supporting vision-language input and text outputs. It handles context windows up to 128k tokens, understands over 140 languages, and offers improved math, reasoning, and chat capabilities,...

textvisionmultimodal
Run locally
131,072 ctx$0.08/1M in
Explore specs and pricingView details โ†’

OpenAI: GPT-4o Search Preview

openai

GPT-4o Search Previewis a specialized model for web search in Chat Completions. It is trained to understand and execute web search queries.

textlong-context
128,000 ctx$2.50/1M in
Explore specs and pricingView details โ†’

OpenAI: GPT-4o-mini Search Preview

openai

GPT-4o mini Search Preview is a specialized model for web search in Chat Completions. It is trained to understand and execute web search queries.

textcheaplong-context
128,000 ctx$0.15/1M in
Explore specs and pricingView details โ†’