modelstop.top
Home/All Models

AI Model Catalogue

Browse 408 models across providers, modalities, and use cases.

๐Ÿ“„ Long Context

408 models ยท Page 5 of 12

Qwen3-Coder-30B-A3B-Instruct

qwen

Open-source Qwen3-Coder-30B-A3B-Instruct model from qwen โ€” available for download and self-hosting on Hugging Face.

textcodeinstruct
160,000 ctx$0.00/1M in
Explore specs and pricingView details โ†’

gpt-oss-120b

openai

Open-source gpt-oss-120b model from openai โ€” available for download and self-hosting on Hugging Face.

textfreelong-context
131,072 ctx$0.00/1M in
Explore specs and pricingView details โ†’

gpt-oss-20b

openai

Open-source gpt-oss-20b model from openai โ€” available for download and self-hosting on Hugging Face.

textfreelong-context
131,072 ctx$0.00/1M in
Explore specs and pricingView details โ†’

Qwen3 235B A22B

alibaba

Qwen3 235B A22B is Alibaba's flagship mixture-of-experts model with 235B total parameters and 22B active per token. Delivers frontier-level performance on coding, reasoning, and multilingual tasks at significantly lower inference cost.

codereasoningmultilingual
128,000 ctx$0.00/1M in
Explore specs and pricingView details โ†’

AI21 Jamba 1.6 Mini

ai21

AI21 Jamba 1.6 Mini is a lightweight Mamba-Transformer hybrid optimized for cost-effective, high-throughput inference with an impressive 256K context window. An excellent choice for document-heavy workloads on a budget.

long-contextinstructcheap
256,000 ctx$0.20/1M in
Explore specs and pricingView details โ†’

AI21 Jamba 1.6 Large

ai21

AI21 Jamba 1.6 Large uses a hybrid Mamba-Transformer architecture offering low memory footprint and high throughput compared to equivalent Transformer models. Features 256K context at a fraction of the inference cost.

long-contextinstructcheap
256,000 ctx$2.00/1M in
Explore specs and pricingView details โ†’

Microsoft Phi-4 Mini

microsoft

Microsoft Phi-4 Mini is a 3.8B parameter compact model from Microsoft. Delivers impressive reasoning capabilities for edge and mobile deployment scenarios, with strong performance on math and coding tasks relative to its size.

reasoningcodeinstruct
128,000 ctx$0.00/1M in
Explore specs and pricingView details โ†’

IBM Granite 3.0 2B Instruct

IBM Research

IBM Granite 3.0 2B Instruct is an ultra-compact enterprise model excelling at summarization, extraction, and classification. The smallest model in the Granite family, suitable for edge deployments and constrained environments.

instructopen-sourcecheap
128,000 ctx$0.00/1M in
Explore specs and pricingView details โ†’

IBM Granite 3.0 8B Instruct

IBM Research

IBM Granite 3.0 8B Instruct is a lightweight enterprise-grade language model trained on a carefully curated enterprise corpus and optimized for RAG, summarization, classification, and code generation in business contexts.

codeinstructopen-source
128,000 ctx$0.00/1M in
Explore specs and pricingView details โ†’

Amazon Nova Pro

amazon

Amazon Nova Pro is a highly capable multimodal model with the best combination of accuracy, speed, and cost across a wide range of tasks. Supports text, image, and video inputs.

visionmultimodallong-context
300,000 ctx$0.80/1M in
Explore specs and pricingView details โ†’

Amazon Nova Lite

amazon

Amazon Nova Lite is a very low-cost multimodal model that can process image, video, and text inputs. Fast and accurate for a wide range of tasks requiring visual and language understanding.

visionmultimodalcheap
300,000 ctx$0.06/1M in
Explore specs and pricingView details โ†’

OpenAI: GPT-4 Turbo (older v1106)

openai

The latest GPT-4 Turbo model with vision capabilities. Vision requests can now use JSON mode and function calling. Training data: up to April 2023.

textvisionlong-context
128,000 ctx$10.00/1M in
Explore specs and pricingView details โ†’

Auto Router

openrouter

Your prompt will be processed by a meta-model and routed to one of dozens of models (see below), optimizing for the best possible output. To see which model was used,...

textvisionmultimodal
2,000,000 ctxFree in
Explore specs and pricingView details โ†’

OpenAI: GPT-4 Turbo Preview

openai

The preview GPT-4 model with improved instruction following, JSON mode, reproducible outputs, parallel function calling, and more. Training data: up to Dec 2023. **Note:** heavily rate limited by OpenAI while...

textlong-context
128,000 ctx$10.00/1M in
Explore specs and pricingView details โ†’

Mistral Large

mistralai

This is Mistral AI's flagship model, Mistral Large 2 (version `mistral-large-2407`). It's a proprietary weights-available model and excels at reasoning, code, JSON, chat, and more. Read the launch announcement [here](https://mistral.ai/news/mistral-large-2407/)....

textcodereasoning
128,000 ctx$2.00/1M in
Explore specs and pricingView details โ†’

Anthropic: Claude 3 Haiku

anthropic

Claude 3 Haiku is Anthropic's fastest and most compact model for near-instant responsiveness. Quick and accurate targeted performance. See the launch announcement and benchmark results [here](https://www.anthropic.com/news/claude-3-haiku) #multimodal

textvisionmultimodal
200,000 ctx$0.25/1M in
Explore specs and pricingView details โ†’

OpenAI: GPT-4 Turbo

openai

The latest GPT-4 Turbo model with vision capabilities. Vision requests can now use JSON mode and function calling. Training data: up to December 2023.

textvisionmultimodal
128,000 ctx$10.00/1M in
Explore specs and pricingView details โ†’

OpenAI: GPT-4o

openai

GPT-4o ("o" for "omni") is OpenAI's latest AI model, supporting both text and image inputs with text outputs. It maintains the intelligence level of [GPT-4 Turbo](/models/openai/gpt-4-turbo) while being twice as...

textvisionmultimodal
128,000 ctx$2.50/1M in
Explore specs and pricingView details โ†’

OpenAI: GPT-4o (2024-05-13)

openai

GPT-4o ("o" for "omni") is OpenAI's latest AI model, supporting both text and image inputs with text outputs. It maintains the intelligence level of [GPT-4 Turbo](/models/openai/gpt-4-turbo) while being twice as...

textvisionmultimodal
128,000 ctx$5.00/1M in
Explore specs and pricingView details โ†’

OpenAI: GPT-4o-mini

openai

GPT-4o mini is OpenAI's newest model after [GPT-4 Omni](/models/openai/gpt-4o), supporting both text and image inputs with text outputs. As their most advanced small model, it is many multiples more affordable...

textvisionmultimodal
128,000 ctx$0.15/1M in
Explore specs and pricingView details โ†’

OpenAI: GPT-4o-mini (2024-07-18)

openai

GPT-4o mini is OpenAI's newest model after [GPT-4 Omni](/models/openai/gpt-4o), supporting both text and image inputs with text outputs. As their most advanced small model, it is many multiples more affordable...

textvisionmultimodal
128,000 ctx$0.15/1M in
Explore specs and pricingView details โ†’

Mistral: Mistral Nemo

mistralai

A 12B parameter model with a 128k token context length built by Mistral in collaboration with NVIDIA. The model is multilingual, supporting English, French, German, Spanish, Italian, Portuguese, Chinese, Japanese,...

textmultilingualcheap
131,072 ctx$0.02/1M in
Explore specs and pricingView details โ†’

Meta: Llama 3.1 70B Instruct

meta-llama

Meta's latest class of model (Llama 3.1) launched with a variety of sizes & flavors. This 70B instruct-tuned version is optimized for high quality dialogue usecases. It has demonstrated strong...

textinstructcheap
131,072 ctx$0.40/1M in
Explore specs and pricingView details โ†’

OpenAI: GPT-4o (2024-08-06)

openai

The 2024-08-06 version of GPT-4o offers improved performance in structured outputs, with the ability to supply a JSON schema in the respone_format. Read more [here](https://openai.com/index/introducing-structured-outputs-in-the-api/). GPT-4o ("o" for "omni") is...

textvisionmultimodal
128,000 ctx$2.50/1M in
Explore specs and pricingView details โ†’

Nous: Hermes 3 405B Instruct (free)

nousresearch

Hermes 3 is a generalist language model with many improvements over Hermes 2, including advanced agentic capabilities, much better roleplaying, reasoning, multi-turn conversation, long context coherence, and improvements across the...

textreasoningagents
131,072 ctx$1.00/1M in
Explore specs and pricingView details โ†’

Nous: Hermes 3 70B Instruct

nousresearch

Hermes 3 is a generalist language model with many improvements over [Hermes 2](/models/nousresearch/nous-hermes-2-mistral-7b-dpo), including advanced agentic capabilities, much better roleplaying, reasoning, multi-turn conversation, long context coherence, and improvements across the...

textreasoningagents
131,072 ctx$0.30/1M in
Explore specs and pricingView details โ†’

Sao10K: Llama 3.1 Euryale 70B v2.2

sao10k

Euryale L3.1 70B v2.2 is a model focused on creative roleplay from [Sao10k](https://ko-fi.com/sao10k). It is the successor of [Euryale L3 70B v2.1](/models/sao10k/l3-euryale-70b).

textcheaplong-context
131,072 ctx$0.85/1M in
Explore specs and pricingView details โ†’

Cohere: Command R (08-2024)

cohere

command-r-08-2024 is an update of the [Command R](/models/cohere/command-r) with improved performance for multilingual retrieval-augmented generation (RAG) and tool use. More broadly, it is better at math, code and reasoning and...

textcodereasoning
128,000 ctx$0.15/1M in
Explore specs and pricingView details โ†’

Cohere: Command R+ (08-2024)

cohere

command-r-plus-08-2024 is an update of the [Command R+](/models/cohere/command-r-plus) with roughly 50% higher throughput and 25% lower latencies as compared to the previous Command R+ version, while keeping the hardware footprint...

textlong-context
128,000 ctx$2.50/1M in
Explore specs and pricingView details โ†’

Meta: Llama 3.2 11B Vision Instruct

meta-llama

Llama 3.2 11B Vision is a multimodal model with 11 billion parameters, designed to handle tasks combining visual and textual data. It excels in tasks such as image captioning and...

textvisionmultimodal
131,072 ctx$0.24/1M in
Explore specs and pricingView details โ†’

Meta: Llama 3.2 3B Instruct (free)

meta-llama

Llama 3.2 3B is a 3-billion-parameter multilingual large language model, optimized for advanced natural language processing tasks like dialogue generation, reasoning, and summarization. Designed with the latest transformer architecture, it...

textreasoningmultilingual
131,072 ctx$0.05/1M in
Explore specs and pricingView details โ†’

NVIDIA: Llama 3.1 Nemotron 70B Instruct

nvidia

NVIDIA's Llama 3.1 Nemotron 70B is a language model designed for generating precise and useful responses. Leveraging [Llama 3.1 70B](/models/meta-llama/llama-3.1-70b-instruct) architecture and Reinforcement Learning from Human Feedback (RLHF), it excels...

textinstructlong-context
131,072 ctx$1.20/1M in
Explore specs and pricingView details โ†’

Anthropic: Claude 3.5 Haiku

anthropic

Claude 3.5 Haiku features offers enhanced capabilities in speed, coding accuracy, and tool use. Engineered to excel in real-time applications, it delivers quick response times that are essential for dynamic...

textvisionmultimodal
200,000 ctx$0.80/1M in
Explore specs and pricingView details โ†’

Mistral: Pixtral Large 2411

mistralai

Pixtral Large is a 124B parameter, open-weight, multimodal model built on top of [Mistral Large 2](/mistralai/mistral-large-2411). The model is able to understand documents, charts and natural images. The model is...

textvisionmultimodal
131,072 ctx$2.00/1M in
Explore specs and pricingView details โ†’

Mistral Large 2407

mistralai

This is Mistral AI's flagship model, Mistral Large 2 (version mistral-large-2407). It's a proprietary weights-available model and excels at reasoning, code, JSON, chat, and more. Read the launch announcement [here](https://mistral.ai/news/mistral-large-2407/)....

textcodereasoning
131,072 ctx$2.00/1M in
Explore specs and pricingView details โ†’

Mistral Large 2411

mistralai

Mistral Large 2 2411 is an update of [Mistral Large 2](/mistralai/mistral-large) released together with [Pixtral Large 2411](/mistralai/pixtral-large-2411) It provides a significant upgrade on the previous [Mistral Large 24.07](/mistralai/mistral-large-2407), with notable...

textlong-context
131,072 ctx$2.00/1M in
Explore specs and pricingView details โ†’