modelstop.top
Home/All Models

AI Model Catalogue

Browse 341 models across providers, modalities, and use cases.

🌐 All Models

341 models Β· Page 9 of 10

AI21 Jamba 1.6 Mini

ai21

AI21 Jamba 1.6 Mini is a lightweight Mamba-Transformer hybrid optimized for cost-effective, high-throughput inference with an impressive 256K context window. An excellent choice for document-heavy workloads on a budget.

long-contextinstructcheap
256,000 ctx$0.20/1M in
Explore specs and pricingView details β†’

AI21 Jamba 1.6 Large

ai21

AI21 Jamba 1.6 Large uses a hybrid Mamba-Transformer architecture offering low memory footprint and high throughput compared to equivalent Transformer models. Features 256K context at a fraction of the inference cost.

long-contextinstructcheap
256,000 ctx$2.00/1M in
Explore specs and pricingView details β†’

Microsoft Phi-4 Mini

microsoft

Microsoft Phi-4 Mini is a 3.8B parameter compact model from Microsoft. Delivers impressive reasoning capabilities for edge and mobile deployment scenarios, with strong performance on math and coding tasks relative to its size.

reasoningcodeinstruct
Run locally
128,000 ctx$0.00/1M in
Explore specs and pricingView details β†’

IBM Granite 3.0 2B Instruct

IBM Research

IBM Granite 3.0 2B Instruct is an ultra-compact enterprise model excelling at summarization, extraction, and classification. The smallest model in the Granite family, suitable for edge deployments and constrained environments.

instructopen-sourcecheap
Run locally
128,000 ctx$0.00/1M in
Explore specs and pricingView details β†’

IBM Granite 3.0 8B Instruct

IBM Research

IBM Granite 3.0 8B Instruct is a lightweight enterprise-grade language model trained on a carefully curated enterprise corpus and optimized for RAG, summarization, classification, and code generation in business contexts.

codeinstructopen-source
Run locally
128,000 ctx$0.00/1M in
Explore specs and pricingView details β†’

Amazon Titan Text Express

amazon

Amazon Titan Text Express is a generative LLM for summarization, text generation, classification, open-ended Q&A, and information extraction. Optimized for enterprise workloads via AWS Bedrock.

instruct
8,192 ctx$0.20/1M in
Explore specs and pricingView details β†’

Amazon Nova Micro

amazon

Amazon Nova Micro is the fastest and most cost-effective text-only model in the Nova family, optimized for speed and low latency. Ideal for customer service, summarization, and translation at scale.

cheapinstruct
128,000 ctx$0.04/1M in
Explore specs and pricingView details β†’

OpenAI: GPT-3.5 Turbo Instruct

openai

This model is a variant of GPT-3.5 Turbo tuned for instructional prompts and omitting chat-related optimizations. Training data: up to Sep 2021.

textinstruct
4,095 ctx$1.50/1M in
Explore specs and pricingView details β†’

Mistral: Mistral 7B Instruct v0.1

mistralai

A 7.3B parameter model that outperforms Llama 2 13B on all benchmarks, with optimizations for speed and context length.

textinstructcheap
2,824 ctx$0.11/1M in
Explore specs and pricingView details β†’

Mistral: Mixtral 8x7B Instruct

mistralai

Mixtral 8x7B Instruct is a pretrained generative Sparse Mixture of Experts, by Mistral AI, for chat and instruction use. Incorporates 8 experts (feed-forward networks) for a total of 47 billion...

textinstructcheap
32,768 ctx$0.54/1M in
Explore specs and pricingView details β†’

Mistral: Mixtral 8x22B Instruct

mistralai

Mistral's official instruct fine-tuned version of [Mixtral 8x22B](/models/mistralai/mixtral-8x22b). It uses 39B active parameters out of 141B, offering unparalleled cost efficiency for its size. Its strengths include: - strong math, coding,...

textcodeinstruct
65,536 ctx$2.00/1M in
Explore specs and pricingView details β†’

Meta: Llama 3 70B Instruct

meta-llama

Meta's latest class of model (Llama 3) launched with a variety of sizes & flavors. This 70B instruct-tuned version was optimized for high quality dialogue usecases. It has demonstrated strong...

textinstructcheap
Run locally
8,192 ctx$0.51/1M in
Explore specs and pricingView details β†’

Meta: Llama 3 8B Instruct

meta-llama

Meta's latest class of model (Llama 3) launched with a variety of sizes & flavors. This 8B instruct-tuned version was optimized for high quality dialogue usecases. It has demonstrated strong...

textinstructcheap
8,192 ctx$0.03/1M in
Explore specs and pricingView details β†’

Meta: Llama 3.1 70B Instruct

meta-llama

Meta's latest class of model (Llama 3.1) launched with a variety of sizes & flavors. This 70B instruct-tuned version is optimized for high quality dialogue usecases. It has demonstrated strong...

textinstructcheap
131,072 ctx$0.40/1M in
Explore specs and pricingView details β†’

Meta: Llama 3.1 8B Instruct

meta-llama

Meta's latest class of model (Llama 3.1) launched with a variety of sizes & flavors. This 8B instruct-tuned version is fast and efficient. It has demonstrated strong performance compared to...

textinstructcheap
16,384 ctx$0.02/1M in
Explore specs and pricingView details β†’

Nous: Hermes 3 405B Instruct (free)

nousresearch

Hermes 3 is a generalist language model with many improvements over Hermes 2, including advanced agentic capabilities, much better roleplaying, reasoning, multi-turn conversation, long context coherence, and improvements across the...

textreasoningagents
131,072 ctx$1.00/1M in
Explore specs and pricingView details β†’

Nous: Hermes 3 70B Instruct

nousresearch

Hermes 3 is a generalist language model with many improvements over [Hermes 2](/models/nousresearch/nous-hermes-2-mistral-7b-dpo), including advanced agentic capabilities, much better roleplaying, reasoning, multi-turn conversation, long context coherence, and improvements across the...

textreasoningagents
Run locally
131,072 ctx$0.30/1M in
Explore specs and pricingView details β†’

Qwen2.5 72B Instruct

qwen

Qwen2.5 72B is the latest series of Qwen large language models. Qwen2.5 brings the following improvements upon Qwen2: - Significantly more knowledge and has greatly improved capabilities in coding and...

textcodeinstruct
Run locally
131,072 ctx$0.12/1M in
Explore specs and pricingView details β†’

Meta: Llama 3.2 11B Vision Instruct

meta-llama

Llama 3.2 11B Vision is a multimodal model with 11 billion parameters, designed to handle tasks combining visual and textual data. It excels in tasks such as image captioning and...

textvisionmultimodal
131,072 ctx$0.24/1M in
Explore specs and pricingView details β†’

Meta: Llama 3.2 1B Instruct

meta-llama

Llama 3.2 1B is a 1-billion-parameter language model focused on efficiently performing natural language tasks, such as summarization, dialogue, and multilingual text analysis. Its smaller size allows it to operate...

textmultilingualinstruct
Run locally
131,072 ctx$0.03/1M in
Explore specs and pricingView details β†’

Meta: Llama 3.2 3B Instruct (free)

meta-llama

Llama 3.2 3B is a 3-billion-parameter multilingual large language model, optimized for advanced natural language processing tasks like dialogue generation, reasoning, and summarization. Designed with the latest transformer architecture, it...

textreasoningmultilingual
Run locally
131,072 ctx$0.05/1M in
Explore specs and pricingView details β†’

NVIDIA: Llama 3.1 Nemotron 70B Instruct

nvidia

NVIDIA's Llama 3.1 Nemotron 70B is a language model designed for generating precise and useful responses. Leveraging [Llama 3.1 70B](/models/meta-llama/llama-3.1-70b-instruct) architecture and Reinforcement Learning from Human Feedback (RLHF), it excels...

textinstructlong-context
131,072 ctx$1.20/1M in
Explore specs and pricingView details β†’

Qwen: Qwen2.5 7B Instruct

qwen

Qwen2.5 7B is the latest series of Qwen large language models. Qwen2.5 brings the following improvements upon Qwen2: - Significantly more knowledge and has greatly improved capabilities in coding and...

textcodeinstruct
Run locally
131,072 ctx$0.04/1M in
Explore specs and pricingView details β†’

Magnum v4 72B

anthracite-org

This is a series of models designed to replicate the prose quality of the Claude 3 models, specifically Sonnet(https://openrouter.ai/anthropic/claude-3.5-sonnet) and Opus(https://openrouter.ai/anthropic/claude-3-opus). The model is fine-tuned on top of [Qwen2.5 72B](https://openrouter.ai/qwen/qwen-2.5-72b-instruct).

textinstruct
Run locally
32,768 ctx$3.00/1M in
Explore specs and pricingView details β†’

Qwen2.5 Coder 32B Instruct

qwen

Qwen2.5-Coder is the latest series of Code-Specific Qwen large language models (formerly known as CodeQwen). Qwen2.5-Coder brings the following improvements upon CodeQwen1.5: - Significantly improvements in **code generation**, **code reasoning**...

textcodereasoning
Run locally
128,000 ctx$0.66/1M in
Explore specs and pricingView details β†’

Meta: Llama 3.3 70B Instruct (free)

meta-llama

The Meta Llama 3.3 multilingual large language model (LLM) is a pretrained and instruction tuned generative model in 70B (text in/text out). The Llama 3.3 instruction tuned text only model...

textmultilingualinstruct
Run locally
131,072 ctx$0.10/1M in
Explore specs and pricingView details β†’

DeepSeek: R1 Distill Llama 70B

deepseek

DeepSeek R1 Distill Llama 70B is a distilled large language model based on [Llama-3.3-70B-Instruct](/meta-llama/llama-3.3-70b-instruct), using outputs from [DeepSeek R1](/deepseek/deepseek-r1). The model combines advanced distillation techniques to achieve high performance across...

textinstructcheap
Run locally
131,072 ctx$0.70/1M in
Explore specs and pricingView details β†’

Qwen: Qwen2.5 VL 72B Instruct

qwen

Qwen2.5-VL is proficient in recognizing common objects such as flowers, birds, fish, and insects. It is also highly capable of analyzing texts, charts, icons, graphics, and layouts within images.

textvisionmultimodal
32,000 ctx$0.80/1M in
Explore specs and pricingView details β†’

AllenAI: Olmo 2 32B Instruct

allenai

OLMo-2 32B Instruct is a supervised instruction-finetuned variant of the OLMo-2 32B March 2025 base model. It excels in complex reasoning and instruction-following tasks across diverse benchmarks such as GSM8K,...

textreasoninginstruct
128,000 ctx$0.05/1M in
Explore specs and pricingView details β†’

Mistral: Mistral Small 3.1 24B

mistralai

Mistral Small 3.1 24B Instruct is an upgraded variant of Mistral Small 3 (2501), featuring 24 billion parameters with advanced multimodal capabilities. It provides state-of-the-art performance in text-based reasoning and...

textvisionmultimodal
128,000 ctx$0.03/1M in
Explore specs and pricingView details β†’

Qwen: Qwen2.5 VL 32B Instruct

qwen

Qwen2.5-VL-32B is a multimodal vision-language model fine-tuned through reinforcement learning for enhanced mathematical reasoning, structured outputs, and visual problem-solving capabilities. It excels at visual analysis tasks, including object recognition, textual...

textvisionmultimodal
128,000 ctx$0.20/1M in
Explore specs and pricingView details β†’

Meta: Llama 4 Scout

meta-llama

Llama 4 Scout 17B Instruct (16E) is a mixture-of-experts (MoE) language model developed by Meta, activating 17 billion parameters out of a total of 109B. It supports native multimodal input...

textvisionmultimodal
327,680 ctx$0.08/1M in
Explore specs and pricingView details β†’

Meta: Llama 4 Maverick

meta-llama

Llama 4 Maverick 17B Instruct (128E) is a high-capacity multimodal language model from Meta, built on a mixture-of-experts (MoE) architecture with 128 experts and 17 billion active parameters per forward...

textvisionmultimodal
Run locally
1,048,576 ctx$0.15/1M in
Explore specs and pricingView details β†’

NVIDIA: Llama 3.1 Nemotron Ultra 253B v1

nvidia

Llama-3.1-Nemotron-Ultra-253B-v1 is a large language model (LLM) optimized for advanced reasoning, human-interactive chat, retrieval-augmented generation (RAG), and tool-calling tasks. Derived from Meta’s Llama-3.1-405B-Instruct, it has been significantly customized using Neural...

reasoninginstructcheap
131,072 ctx$0.60/1M in
Explore specs and pricingView details β†’

AlfredPros: CodeLLaMa 7B Instruct Solidity

alfredpros

A finetuned 7 billion parameters Code LLaMA - Instruct model to generate Solidity smart contract using 4-bit QLoRA finetuning provided by PEFT library.

textcodeinstruct
4,096 ctx$0.80/1M in
Explore specs and pricingView details β†’

Qwen: Qwen2.5 Coder 7B Instruct

qwen

Qwen2.5-Coder-7B-Instruct is a 7B parameter instruction-tuned language model optimized for code-related tasks such as code generation, reasoning, and bug fixing. Based on the Qwen2.5 architecture, it incorporates enhancements like RoPE,...

codereasoninginstruct
32,768 ctx$0.03/1M in
Explore specs and pricingView details β†’