modelstop.top
Home/All Models

AI Model Catalogue

Browse 196 models across providers, modalities, and use cases.

🌐 All Models

196 models Β· Page 5 of 6

AI21 Jamba 1.6 Large

ai21

AI21 Jamba 1.6 Large uses a hybrid Mamba-Transformer architecture offering low memory footprint and high throughput compared to equivalent Transformer models. Features 256K context at a fraction of the inference cost.

long-contextinstructcheap
256,000 ctx$2.00/1M in
Explore specs and pricingView details β†’

Microsoft Phi-4 Mini

microsoft

Microsoft Phi-4 Mini is a 3.8B parameter compact model from Microsoft. Delivers impressive reasoning capabilities for edge and mobile deployment scenarios, with strong performance on math and coding tasks relative to its size.

reasoningcodeinstruct
128,000 ctx$0.00/1M in
Explore specs and pricingView details β†’

IBM Granite 3.0 2B Instruct

IBM Research

IBM Granite 3.0 2B Instruct is an ultra-compact enterprise model excelling at summarization, extraction, and classification. The smallest model in the Granite family, suitable for edge deployments and constrained environments.

instructopen-sourcecheap
128,000 ctx$0.00/1M in
Explore specs and pricingView details β†’

IBM Granite 3.0 8B Instruct

IBM Research

IBM Granite 3.0 8B Instruct is a lightweight enterprise-grade language model trained on a carefully curated enterprise corpus and optimized for RAG, summarization, classification, and code generation in business contexts.

codeinstructopen-source
128,000 ctx$0.00/1M in
Explore specs and pricingView details β†’

Amazon Titan Text Express

amazon

Amazon Titan Text Express is a generative LLM for summarization, text generation, classification, open-ended Q&A, and information extraction. Optimized for enterprise workloads via AWS Bedrock.

instruct
8,192 ctx$0.20/1M in
Explore specs and pricingView details β†’

Amazon Nova Micro

amazon

Amazon Nova Micro is the fastest and most cost-effective text-only model in the Nova family, optimized for speed and low latency. Ideal for customer service, summarization, and translation at scale.

cheapinstruct
128,000 ctx$0.04/1M in
Explore specs and pricingView details β†’

OpenAI: GPT-3.5 Turbo Instruct

openai

This model is a variant of GPT-3.5 Turbo tuned for instructional prompts and omitting chat-related optimizations. Training data: up to Sep 2021.

textinstruct
4,095 ctx$1.50/1M in
Explore specs and pricingView details β†’

Mistral: Mistral 7B Instruct v0.1

mistralai

A 7.3B parameter model that outperforms Llama 2 13B on all benchmarks, with optimizations for speed and context length.

textinstructcheap
2,824 ctx$0.11/1M in
Explore specs and pricingView details β†’

Mistral: Mixtral 8x7B Instruct

mistralai

Mixtral 8x7B Instruct is a pretrained generative Sparse Mixture of Experts, by Mistral AI, for chat and instruction use. Incorporates 8 experts (feed-forward networks) for a total of 47 billion...

textinstructcheap
32,768 ctx$0.54/1M in
Explore specs and pricingView details β†’

Mistral: Mixtral 8x22B Instruct

mistralai

Mistral's official instruct fine-tuned version of [Mixtral 8x22B](/models/mistralai/mixtral-8x22b). It uses 39B active parameters out of 141B, offering unparalleled cost efficiency for its size. Its strengths include: - strong math, coding,...

textcodeinstruct
65,536 ctx$2.00/1M in
Explore specs and pricingView details β†’

Meta: Llama 3 70B Instruct

meta-llama

Meta's latest class of model (Llama 3) launched with a variety of sizes & flavors. This 70B instruct-tuned version was optimized for high quality dialogue usecases. It has demonstrated strong...

textinstructcheap
8,192 ctx$0.51/1M in
Explore specs and pricingView details β†’

Meta: Llama 3 8B Instruct

meta-llama

Meta's latest class of model (Llama 3) launched with a variety of sizes & flavors. This 8B instruct-tuned version was optimized for high quality dialogue usecases. It has demonstrated strong...

textinstructcheap
8,192 ctx$0.03/1M in
Explore specs and pricingView details β†’

Meta: Llama 3.1 70B Instruct

meta-llama

Meta's latest class of model (Llama 3.1) launched with a variety of sizes & flavors. This 70B instruct-tuned version is optimized for high quality dialogue usecases. It has demonstrated strong...

textinstructcheap
131,072 ctx$0.40/1M in
Explore specs and pricingView details β†’

Meta: Llama 3.1 8B Instruct

meta-llama

Meta's latest class of model (Llama 3.1) launched with a variety of sizes & flavors. This 8B instruct-tuned version is fast and efficient. It has demonstrated strong performance compared to...

textinstructcheap
16,384 ctx$0.02/1M in
Explore specs and pricingView details β†’

Nous: Hermes 3 405B Instruct (free)

nousresearch

Hermes 3 is a generalist language model with many improvements over Hermes 2, including advanced agentic capabilities, much better roleplaying, reasoning, multi-turn conversation, long context coherence, and improvements across the...

textreasoningagents
131,072 ctx$1.00/1M in
Explore specs and pricingView details β†’

Nous: Hermes 3 70B Instruct

nousresearch

Hermes 3 is a generalist language model with many improvements over [Hermes 2](/models/nousresearch/nous-hermes-2-mistral-7b-dpo), including advanced agentic capabilities, much better roleplaying, reasoning, multi-turn conversation, long context coherence, and improvements across the...

textreasoningagents
131,072 ctx$0.30/1M in
Explore specs and pricingView details β†’

Qwen2.5 72B Instruct

qwen

Qwen2.5 72B is the latest series of Qwen large language models. Qwen2.5 brings the following improvements upon Qwen2: - Significantly more knowledge and has greatly improved capabilities in coding and...

textcodeinstruct
32,768 ctx$0.12/1M in
Explore specs and pricingView details β†’

Meta: Llama 3.2 11B Vision Instruct

meta-llama

Llama 3.2 11B Vision is a multimodal model with 11 billion parameters, designed to handle tasks combining visual and textual data. It excels in tasks such as image captioning and...

textvisionmultimodal
131,072 ctx$0.24/1M in
Explore specs and pricingView details β†’

Meta: Llama 3.2 1B Instruct

meta-llama

Llama 3.2 1B is a 1-billion-parameter language model focused on efficiently performing natural language tasks, such as summarization, dialogue, and multilingual text analysis. Its smaller size allows it to operate...

textmultilingualinstruct
60,000 ctx$0.03/1M in
Explore specs and pricingView details β†’

Meta: Llama 3.2 3B Instruct (free)

meta-llama

Llama 3.2 3B is a 3-billion-parameter multilingual large language model, optimized for advanced natural language processing tasks like dialogue generation, reasoning, and summarization. Designed with the latest transformer architecture, it...

textreasoningmultilingual
131,072 ctx$0.05/1M in
Explore specs and pricingView details β†’

NVIDIA: Llama 3.1 Nemotron 70B Instruct

nvidia

NVIDIA's Llama 3.1 Nemotron 70B is a language model designed for generating precise and useful responses. Leveraging [Llama 3.1 70B](/models/meta-llama/llama-3.1-70b-instruct) architecture and Reinforcement Learning from Human Feedback (RLHF), it excels...

textinstructlong-context
131,072 ctx$1.20/1M in
Explore specs and pricingView details β†’

Qwen: Qwen2.5 7B Instruct

qwen

Qwen2.5 7B is the latest series of Qwen large language models. Qwen2.5 brings the following improvements upon Qwen2: - Significantly more knowledge and has greatly improved capabilities in coding and...

textcodeinstruct
32,768 ctx$0.04/1M in
Explore specs and pricingView details β†’

Magnum v4 72B

anthracite-org

This is a series of models designed to replicate the prose quality of the Claude 3 models, specifically Sonnet(https://openrouter.ai/anthropic/claude-3.5-sonnet) and Opus(https://openrouter.ai/anthropic/claude-3-opus). The model is fine-tuned on top of [Qwen2.5 72B](https://openrouter.ai/qwen/qwen-2.5-72b-instruct).

textinstruct
16,384 ctx$3.00/1M in
Explore specs and pricingView details β†’

Qwen2.5 Coder 32B Instruct

qwen

Qwen2.5-Coder is the latest series of Code-Specific Qwen large language models (formerly known as CodeQwen). Qwen2.5-Coder brings the following improvements upon CodeQwen1.5: - Significantly improvements in **code generation**, **code reasoning**...

textcodereasoning
32,768 ctx$0.66/1M in
Explore specs and pricingView details β†’

Meta: Llama 3.3 70B Instruct (free)

meta-llama

The Meta Llama 3.3 multilingual large language model (LLM) is a pretrained and instruction tuned generative model in 70B (text in/text out). The Llama 3.3 instruction tuned text only model...

textmultilingualinstruct
65,536 ctx$0.10/1M in
Explore specs and pricingView details β†’

DeepSeek: R1 Distill Llama 70B

deepseek

DeepSeek R1 Distill Llama 70B is a distilled large language model based on [Llama-3.3-70B-Instruct](/meta-llama/llama-3.3-70b-instruct), using outputs from [DeepSeek R1](/deepseek/deepseek-r1). The model combines advanced distillation techniques to achieve high performance across...

textinstructcheap
131,072 ctx$0.70/1M in
Explore specs and pricingView details β†’

Qwen: Qwen2.5 VL 72B Instruct

qwen

Qwen2.5-VL is proficient in recognizing common objects such as flowers, birds, fish, and insects. It is also highly capable of analyzing texts, charts, icons, graphics, and layouts within images.

textvisionmultimodal
32,000 ctx$0.80/1M in
Explore specs and pricingView details β†’

AllenAI: Olmo 2 32B Instruct

allenai

OLMo-2 32B Instruct is a supervised instruction-finetuned variant of the OLMo-2 32B March 2025 base model. It excels in complex reasoning and instruction-following tasks across diverse benchmarks such as GSM8K,...

textreasoninginstruct
128,000 ctx$0.05/1M in
Explore specs and pricingView details β†’

Mistral: Mistral Small 3.1 24B

mistralai

Mistral Small 3.1 24B Instruct is an upgraded variant of Mistral Small 3 (2501), featuring 24 billion parameters with advanced multimodal capabilities. It provides state-of-the-art performance in text-based reasoning and...

textvisionmultimodal
128,000 ctx$0.03/1M in
Explore specs and pricingView details β†’

Qwen: Qwen2.5 VL 32B Instruct

qwen

Qwen2.5-VL-32B is a multimodal vision-language model fine-tuned through reinforcement learning for enhanced mathematical reasoning, structured outputs, and visual problem-solving capabilities. It excels at visual analysis tasks, including object recognition, textual...

textvisionmultimodal
128,000 ctx$0.20/1M in
Explore specs and pricingView details β†’

Meta: Llama 4 Scout

meta-llama

Llama 4 Scout 17B Instruct (16E) is a mixture-of-experts (MoE) language model developed by Meta, activating 17 billion parameters out of a total of 109B. It supports native multimodal input...

textvisionmultimodal
327,680 ctx$0.08/1M in
Explore specs and pricingView details β†’

Meta: Llama 4 Maverick

meta-llama

Llama 4 Maverick 17B Instruct (128E) is a high-capacity multimodal language model from Meta, built on a mixture-of-experts (MoE) architecture with 128 experts and 17 billion active parameters per forward...

textvisionmultimodal
1,048,576 ctx$0.15/1M in
Explore specs and pricingView details β†’

NVIDIA: Llama 3.1 Nemotron Ultra 253B v1

nvidia

Llama-3.1-Nemotron-Ultra-253B-v1 is a large language model (LLM) optimized for advanced reasoning, human-interactive chat, retrieval-augmented generation (RAG), and tool-calling tasks. Derived from Meta’s Llama-3.1-405B-Instruct, it has been significantly customized using Neural...

reasoninginstructcheap
131,072 ctx$0.60/1M in
Explore specs and pricingView details β†’

AlfredPros: CodeLLaMa 7B Instruct Solidity

alfredpros

A finetuned 7 billion parameters Code LLaMA - Instruct model to generate Solidity smart contract using 4-bit QLoRA finetuning provided by PEFT library.

textcodeinstruct
4,096 ctx$0.80/1M in
Explore specs and pricingView details β†’

Qwen: Qwen2.5 Coder 7B Instruct

qwen

Qwen2.5-Coder-7B-Instruct is a 7B parameter instruction-tuned language model optimized for code-related tasks such as code generation, reasoning, and bug fixing. Based on the Qwen2.5 architecture, it incorporates enhancements like RoPE,...

codereasoninginstruct
32,768 ctx$0.03/1M in
Explore specs and pricingView details β†’

Arcee AI: Coder Large

arcee-ai

Coder‑Large is a 32 B‑parameter offspring of Qwen 2.5‑Instruct that has been further trained on permissively‑licensed GitHub, CodeSearchNet and synthetic bug‑fix corpora. It supports a 32k context window, enabling multi‑file...

textcodeinstruct
32,768 ctx$0.50/1M in
Explore specs and pricingView details β†’