modelstop.top
Home/All Models

AI Model Catalogue

Browse 287 models across providers, modalities, and use cases.

🌐 All Models

287 models Β· Page 7 of 8

Baidu: ERNIE 4.5 21B A3B Thinking

baidu

ERNIE-4.5-21B-A3B-Thinking is Baidu's upgraded lightweight MoE model, refined to boost reasoning depth and quality for top-tier performance in logical puzzles, math, science, coding, text generation, and expert-level academic benchmarks.

textcodereasoning
131,072 ctx$0.07/1M in
Explore specs and pricingView details β†’

NVIDIA: Llama 3.3 Nemotron Super 49B V1.5

nvidia

Llama-3.3-Nemotron-Super-49B-v1.5 is a 49B-parameter, English-centric reasoning/chat model derived from Meta’s Llama-3.3-70B-Instruct with a 128K context. It’s post-trained for agentic workflows (RAG, tool calling) via SFT across math, code, science, and...

textcodereasoning
131,072 ctx$0.10/1M in
Explore specs and pricingView details β†’

Qwen: Qwen3 VL 8B Instruct

qwen

Qwen3-VL-8B-Instruct is a multimodal vision-language model from the Qwen3-VL series, built for high-fidelity understanding and reasoning across text, images, and video. It features improved multimodal fusion with Interleaved-MRoPE for long-horizon...

textvisionmultimodal
131,072 ctx$0.08/1M in
Explore specs and pricingView details β†’

Qwen: Qwen3 VL 8B Thinking

qwen

Qwen3-VL-8B-Thinking is the reasoning-optimized variant of the Qwen3-VL-8B multimodal model, designed for advanced visual and textual reasoning across complex scenes, documents, and temporal sequences. It integrates enhanced multimodal alignment and...

textvisionmultimodal
131,072 ctx$0.12/1M in
Explore specs and pricingView details β†’

Anthropic: Claude Haiku 4.5

anthropic

Claude Haiku 4.5 is Anthropic’s fastest and most efficient model, delivering near-frontier intelligence at a fraction of the cost and latency of larger Claude models. Matching Claude Sonnet 4’s performance...

textvisionmultimodal
200,000 ctx$1.00/1M in
Explore specs and pricingView details β†’

IBM: Granite 4.0 Micro

ibm-granite

Granite-4.0-H-Micro is a 3B parameter from the Granite 4 family of models. These models are the latest in a series of models released by IBM. They are fine-tuned for long...

textcheaplong-context
131,000 ctx$0.02/1M in
Explore specs and pricingView details β†’

Qwen: Qwen3 VL 32B Instruct

qwen

Qwen3-VL-32B-Instruct is a large-scale multimodal vision-language model designed for high-precision understanding and reasoning across text, images, and video. With 32 billion parameters, it combines deep visual perception with advanced text...

textvisionmultimodal
131,072 ctx$0.10/1M in
Explore specs and pricingView details β†’

MiniMax: MiniMax M2

minimax

MiniMax-M2 is a compact, high-efficiency large language model optimized for end-to-end coding and agentic workflows. With 10 billion activated parameters (230 billion total), it delivers near-frontier intelligence across general reasoning,...

textcodereasoning
196,608 ctx$0.26/1M in
Explore specs and pricingView details β†’

OpenAI: gpt-oss-safeguard-20b

openai

gpt-oss-safeguard-20b is a safety reasoning model from OpenAI built upon gpt-oss-20b. This open-weight, 21B-parameter Mixture-of-Experts (MoE) model offers lower latency for safety tasks like content classification, LLM filtering, and trust...

textreasoningcheap
131,072 ctx$0.07/1M in
Explore specs and pricingView details β†’

Mistral: Voxtral Small 24B 2507

mistralai

Voxtral Small is an enhancement of Mistral Small 3, incorporating state-of-the-art audio input capabilities while retaining best-in-class text performance. It excels at speech transcription, translation and audio understanding. Input audio...

textaudiocheap
32,000 ctx$0.10/1M in
Explore specs and pricingView details β†’

MoonshotAI: Kimi K2 Thinking

moonshotai

Kimi K2 Thinking is Moonshot AI’s most advanced open reasoning model to date, extending the K2 series into agentic, long-horizon reasoning. Built on the trillion-parameter Mixture-of-Experts (MoE) architecture introduced in...

textreasoningagents
262,144 ctx$0.60/1M in
Explore specs and pricingView details β†’

OpenAI: GPT-5.1-Codex-Mini

openai

GPT-5.1-Codex-Mini is a smaller and faster version of GPT-5.1-Codex

textvisionmultimodal
400,000 ctx$0.25/1M in
Explore specs and pricingView details β†’

xAI: Grok 4.1 Fast

x-ai

Grok 4.1 Fast is xAI's best agentic tool calling model that shines in real-world use cases like customer support and deep research. 2M context window. Reasoning can be enabled/disabled using...

textvisionmultimodal
2,000,000 ctx$0.20/1M in
Explore specs and pricingView details β†’

AllenAI: Olmo 3 32B Think

allenai

Olmo 3 32B Think is a large-scale, 32-billion-parameter model purpose-built for deep reasoning, complex logic chains and advanced instruction-following scenarios. Its capacity enables strong performance on demanding evaluation tasks and...

textreasoningcheap
65,536 ctx$0.15/1M in
Explore specs and pricingView details β†’

Prime Intellect: INTELLECT-3

prime-intellect

INTELLECT-3 is a 106B-parameter Mixture-of-Experts model (12B active) post-trained from GLM-4.5-Air-Base using supervised fine-tuning (SFT) followed by large-scale reinforcement learning (RL). It offers state-of-the-art performance for its size across math,...

textcheaplong-context
131,072 ctx$0.20/1M in
Explore specs and pricingView details β†’

DeepSeek: DeepSeek V3.2

deepseek

DeepSeek-V3.2 is a large language model designed to harmonize high computational efficiency with strong reasoning and agentic tool-use performance. It introduces DeepSeek Sparse Attention (DSA), a fine-grained sparse attention mechanism...

textreasoningagents
163,840 ctx$0.26/1M in
Explore specs and pricingView details β†’

DeepSeek: DeepSeek V3.2 Speciale

deepseek

DeepSeek-V3.2-Speciale is a high-compute variant of DeepSeek-V3.2 optimized for maximum reasoning and agentic performance. It builds on DeepSeek Sparse Attention (DSA) for efficient long-context processing, then scales post-training reinforcement learning...

textreasoningagents
163,840 ctx$0.40/1M in
Explore specs and pricingView details β†’

Arcee AI: Trinity Mini

arcee-ai

Trinity Mini is a 26B-parameter (3B active) sparse mixture-of-experts language model featuring 128 experts with 8 active per token. Engineered for efficient reasoning over long contexts (131k) with robust function...

textreasoningcheap
131,072 ctx$0.04/1M in
Explore specs and pricingView details β†’

Mistral: Mistral Large 3 2512

mistralai

Mistral Large 3 2512 is Mistral’s most capable model to date, featuring a sparse mixture-of-experts architecture with 41B active parameters (675B total), and released under the Apache 2.0 license.

textvisionmultimodal
262,144 ctx$0.50/1M in
Explore specs and pricingView details β†’

Mistral: Ministral 3 3B 2512

mistralai

The smallest model in the Ministral 3 family, Ministral 3 3B is a powerful, efficient tiny language model with vision capabilities.

textvisionmultimodal
131,072 ctx$0.10/1M in
Explore specs and pricingView details β†’

Mistral: Ministral 3 8B 2512

mistralai

A balanced model in the Ministral 3 family, Ministral 3 8B is a powerful, efficient tiny language model with vision capabilities.

textvisionmultimodal
262,144 ctx$0.15/1M in
Explore specs and pricingView details β†’

Mistral: Ministral 3 14B 2512

mistralai

The largest model in the Ministral 3 family, Ministral 3 14B offers frontier capabilities and performance comparable to its larger Mistral Small 3.2 24B counterpart. A powerful and efficient language...

textvisionmultimodal
262,144 ctx$0.20/1M in
Explore specs and pricingView details β†’

Amazon: Nova 2 Lite

amazon

Nova 2 Lite is a fast, cost-effective reasoning model for everyday workloads that can process text, images, and videos to generate text. Nova 2 Lite demonstrates standout capabilities in processing...

textvisionimage
1,000,000 ctx$0.30/1M in
Explore specs and pricingView details β†’

Body Builder (beta)

openrouter

Transform your natural language requests into structured OpenRouter API request objects. Describe what you want to accomplish with AI models, and Body Builder will construct the appropriate API calls. Example:...

textcheaplong-context
128,000 ctxFree in
Explore specs and pricingView details β†’

EssentialAI: Rnj 1 Instruct

essentialai

Rnj-1 is an 8B-parameter, dense, open-weight model family developed by Essential AI and trained from scratch with a focus on programming, math, and scientific reasoning. The model demonstrates strong performance...

textreasoninginstruct
32,768 ctx$0.15/1M in
Explore specs and pricingView details β†’

Nex AGI: DeepSeek V3.1 Nex N1

nex-agi

DeepSeek V3.1 Nex-N1 is the flagship release of the Nex-N1 series β€” a post-trained model designed to highlight agent autonomy, tool use, and real-world productivity. Nex-N1 demonstrates competitive performance across...

textagentscheap
131,072 ctx$0.14/1M in
Explore specs and pricingView details β†’

Z.ai: GLM 4.6V

z-ai

GLM-4.6V is a large multimodal model designed for high-fidelity visual understanding and long-context reasoning across images, documents, and mixed media. It supports up to 128K tokens, processes complex page layouts...

textvisionmultimodal
131,072 ctx$0.30/1M in
Explore specs and pricingView details β†’

Relace: Relace Search

relace

The relace-search model uses 4-12 `view_file` and `grep` tools in parallel to explore a codebase and return relevant files to the user request. In contrast to RAG, relace-search performs agentic...

textagentscheap
256,000 ctx$1.00/1M in
Explore specs and pricingView details β†’

Mistral: Devstral 2 2512

mistralai

Devstral 2 is a state-of-the-art open-source model by Mistral AI specializing in agentic coding. It is a 123B-parameter dense transformer model supporting a 256K context window. Devstral 2 supports exploring...

textcodeagents
262,144 ctx$0.40/1M in
Explore specs and pricingView details β†’

Xiaomi: MiMo-V2-Flash

xiaomi

MiMo-V2-Flash is an open-source foundation language model developed by Xiaomi. It is a Mixture-of-Experts model with 309B total parameters and 15B active parameters, adopting hybrid attention architecture. MiMo-V2-Flash supports a...

textcheaplong-context
262,144 ctx$0.09/1M in
Explore specs and pricingView details β†’

Mistral: Mistral Small Creative

mistralai

Mistral Small Creative is an experimental small model designed for creative writing, narrative generation, roleplay and character-driven dialogue, general-purpose instruction following, and conversational agents.

textagentscheap
32,768 ctx$0.10/1M in
Explore specs and pricingView details β†’

Google: Gemini 3 Flash Preview

google

Gemini 3 Flash Preview is a high speed, high value thinking model designed for agentic workflows, multi turn chat, and coding assistance. It delivers near Pro level reasoning and tool...

textvisionmultimodal
1,048,576 ctx$0.50/1M in
Explore specs and pricingView details β†’

Z.ai: GLM 4.7

z-ai

GLM-4.7 is Z.ai’s latest flagship model, featuring upgrades in two key areas: enhanced programming capabilities and more stable multi-step reasoning/execution. It demonstrates significant improvements in executing complex agent tasks while...

textreasoningagents
202,752 ctx$0.39/1M in
Explore specs and pricingView details β†’

MiniMax: MiniMax M2.1

minimax

MiniMax-M2.1 is a lightweight, state-of-the-art large language model optimized for coding, agentic workflows, and modern application development. With only 10 billion activated parameters, it delivers a major jump in real-world...

textcodeagents
196,608 ctx$0.29/1M in
Explore specs and pricingView details β†’

ByteDance Seed: Seed 1.6

bytedance-seed

Seed 1.6 is a general-purpose model released by the ByteDance Seed team. It incorporates multimodal capabilities and adaptive deep thinking with a 256K context window.

textvisionmultimodal
262,144 ctx$0.25/1M in
Explore specs and pricingView details β†’

ByteDance Seed: Seed 1.6 Flash

bytedance-seed

Seed 1.6 Flash is an ultra-fast multimodal deep thinking model by ByteDance Seed, supporting both text and visual understanding. It features a 256k context window and can generate outputs of...

textvisionimage
262,144 ctx$0.07/1M in
Explore specs and pricingView details β†’