modelstop.top
Home/All Models

AI Model Catalogue

Browse 454 models across providers, modalities, and use cases.

🌐 All Models

454 models Β· Page 10 of 13

OpenAI: GPT-4o Audio

openai

The gpt-4o-audio-preview model adds support for audio inputs as prompts. This enhancement allows the model to detect nuances within audio recordings and add depth to generated user experiences. Audio outputs...

textaudiolong-context
128,000 ctx$2.50/1M in
Explore specs and pricingView details β†’

Nous: Hermes 4 405B

nousresearch

Hermes 4 is a large-scale reasoning model built on Meta-Llama-3.1-405B and released by Nous Research. It introduces a hybrid reasoning mode, where the model can choose to deliberate internally with...

textreasoningcheap
131,072 ctx$1.00/1M in
Explore specs and pricingView details β†’

Nous: Hermes 4 70B

nousresearch

Hermes 4 70B is a hybrid reasoning model from Nous Research, built on Meta-Llama-3.1-70B. It introduces the same hybrid mode as the larger 405B release, allowing the model to either...

textreasoningcheap
Run locally
131,072 ctx$0.13/1M in
Explore specs and pricingView details β†’

xAI: Grok Code Fast 1

x-ai

Grok Code Fast 1 is a speedy and economical reasoning model that excels at agentic coding. With reasoning traces visible in the response, developers can steer Grok Code for high-quality...

textcodereasoning
256,000 ctx$0.20/1M in
Explore specs and pricingView details β†’

Qwen: Qwen3 30B A3B Thinking 2507

qwen

Qwen3-30B-A3B-Thinking-2507 is a 30B parameter Mixture-of-Experts reasoning model optimized for complex tasks requiring extended multi-step thinking. The model is designed specifically for β€œthinking mode,” where internal reasoning traces are separated...

textreasoningcheap
131,072 ctx$0.08/1M in
Explore specs and pricingView details β†’

MoonshotAI: Kimi K2 0905

moonshotai

Kimi K2 0905 is the September update of [Kimi K2 0711](moonshotai/kimi-k2). It is a large-scale Mixture-of-Experts (MoE) language model developed by Moonshot AI, featuring 1 trillion total parameters with 32...

textcheaplong-context
262,144 ctx$0.40/1M in
Explore specs and pricingView details β†’

NVIDIA: Nemotron Nano 9B V2 (free)

nvidia

NVIDIA-Nemotron-Nano-9B-v2 is a large language model (LLM) trained from scratch by NVIDIA, and designed as a unified model for both reasoning and non-reasoning tasks. It responds to user queries and...

textreasoningfree
128,000 ctx$0.04/1M in
Explore specs and pricingView details β†’

Qwen: Qwen Plus 0728 (thinking)

qwen

Qwen Plus 0728, based on the Qwen3 foundation model, is a 1 million context hybrid reasoning model with a balanced performance, speed, and cost combination.

textreasoningcheap
1,000,000 ctx$0.26/1M in
Explore specs and pricingView details β†’

Meituan: LongCat Flash Chat

meituan

LongCat-Flash-Chat is a large-scale Mixture-of-Experts (MoE) model with 560B total parameters, of which 18.6B–31.3B (β‰ˆ27B on average) are dynamically activated per input. It introduces a shortcut-connected MoE design to reduce...

cheaplong-context
131,072 ctx$0.20/1M in
Explore specs and pricingView details β†’

Qwen: Qwen3 Next 80B A3B Instruct (free)

qwen

Qwen3-Next-80B-A3B-Instruct is an instruction-tuned chat model in the Qwen3-Next series optimized for fast, stable responses without β€œthinking” traces. It targets complex tasks across reasoning, code generation, knowledge QA, and multilingual...

textcodereasoning
262,144 ctx$0.09/1M in
Explore specs and pricingView details β†’

Qwen: Qwen3 Next 80B A3B Thinking

qwen

Qwen3-Next-80B-A3B-Thinking is a reasoning-first chat model in the Qwen3-Next line that outputs structured β€œthinking” traces by default. It’s designed for hard multi-step problems; math proofs, code synthesis/debugging, logic, and agentic...

textcodereasoning
131,072 ctx$0.10/1M in
Explore specs and pricingView details β†’

Qwen: Qwen3 Coder Flash

qwen

Qwen3 Coder Flash is Alibaba's fast and cost efficient version of their proprietary Qwen3 Coder Plus. It is a powerful coding agent model specializing in autonomous programming via tool calling...

textcodeagents
1,000,000 ctx$0.20/1M in
Explore specs and pricingView details β†’

Tongyi DeepResearch 30B A3B

alibaba

Tongyi DeepResearch is an agentic large language model developed by Tongyi Lab, with 30 billion total parameters activating only 3 billion per token. It's optimized for long-horizon, deep information-seeking tasks...

textagentscheap
131,072 ctx$0.09/1M in
Explore specs and pricingView details β†’

xAI: Grok 4 Fast

x-ai

Grok 4 Fast is xAI's latest multimodal model with SOTA cost-efficiency and a 2M token context window. It comes in two flavors: non-reasoning and reasoning. Read more about the model...

textvisionmultimodal
2,000,000 ctx$0.20/1M in
Explore specs and pricingView details β†’

DeepSeek: DeepSeek V3.1 Terminus

deepseek

DeepSeek-V3.1 Terminus is an update to [DeepSeek V3.1](/deepseek/deepseek-chat-v3.1) that maintains the model's original capabilities while addressing issues reported by users, including language consistency and agent capabilities, further optimizing the model's...

textagentscheap
163,840 ctx$0.21/1M in
Explore specs and pricingView details β†’

OpenAI: GPT-5 Codex

openai

GPT-5-Codex is a specialized version of GPT-5 optimized for software engineering and coding workflows. It is designed for both interactive development sessions and long, independent execution of complex engineering tasks....

textvisionmultimodal
400,000 ctx$1.25/1M in
Explore specs and pricingView details β†’

Qwen: Qwen3 Coder Plus

qwen

Qwen3 Coder Plus is Alibaba's proprietary version of the Open Source Qwen3 Coder 480B A35B. It is a powerful coding agent model specializing in autonomous programming via tool calling and...

textcodeagents
1,000,000 ctx$0.65/1M in
Explore specs and pricingView details β†’

Qwen: Qwen3 Max

qwen

Qwen3-Max is an updated release built on the Qwen3 series, offering major improvements in reasoning, instruction following, multilingual support, and long-tail knowledge coverage compared to the January 2025 version. It...

textreasoningmultilingual
262,144 ctx$0.78/1M in
Explore specs and pricingView details β†’

Qwen: Qwen3 VL 235B A22B Instruct

qwen

Qwen3-VL-235B-A22B Instruct is an open-weight multimodal model that unifies strong text generation with visual understanding across images and video. The Instruct model targets general vision-language use (VQA, document parsing, chart/table...

textvisionimage
Run locally
262,144 ctx$0.20/1M in
Explore specs and pricingView details β†’

Qwen: Qwen3 VL 235B A22B Thinking

qwen

Qwen3-VL-235B-A22B Thinking is a multimodal model that unifies strong text generation with visual understanding across images and video. The Thinking model is optimized for multimodal reasoning in STEM and math....

textvisionimage
131,072 ctx$0.26/1M in
Explore specs and pricingView details β†’

Google: Gemini 2.5 Flash Lite Preview 09-2025

google

Gemini 2.5 Flash-Lite is a lightweight reasoning model in the Gemini 2.5 family, optimized for ultra-low latency and cost efficiency. It offers improved throughput, faster token generation, and better performance...

textvisionimage
1,048,576 ctx$0.10/1M in
Explore specs and pricingView details β†’

Relace: Relace Apply 3

relace

Relace Apply 3 is a specialized code-patching LLM that merges AI-suggested edits straight into your source files. It can apply updates from GPT-4o, Claude, and others into your files at...

textcodecheap
256,000 ctx$0.85/1M in
Explore specs and pricingView details β†’

TheDrummer: Cydonia 24B V4.1

thedrummer

Uncensored and creative writing model based on Mistral Small 3.2 24B with good recall, prompt adherence, and intelligence.

textcheaplong-context
131,072 ctx$0.30/1M in
Explore specs and pricingView details β†’

DeepSeek: DeepSeek V3.2 Exp

deepseek

DeepSeek-V3.2-Exp is an experimental large language model released by DeepSeek as an intermediate step between V3.1 and future architectures. It introduces DeepSeek Sparse Attention (DSA), a fine-grained sparse attention mechanism...

textcheaplong-context
163,840 ctx$0.27/1M in
Explore specs and pricingView details β†’

Anthropic: Claude Sonnet 4.5

anthropic

Claude Sonnet 4.5 is Anthropic’s most advanced Sonnet model to date, optimized for real-world agents and coding workflows. It delivers state-of-the-art performance on coding benchmarks such as SWE-bench Verified, with...

textvisionmultimodal
Run locally
1,000,000 ctx$3.00/1M in
Explore specs and pricingView details β†’

Z.ai: GLM 4.6

z-ai

Compared with GLM-4.5, this generation brings several key improvements: Longer context window: The context window has been expanded from 128K to 200K tokens, enabling the model to handle more complex...

textcheaplong-context
Run locally
202,752 ctx$0.39/1M in
Explore specs and pricingView details β†’

OpenAI: GPT-5 Pro

openai

GPT-5 Pro is OpenAI’s most advanced model, offering major improvements in reasoning, code quality, and user experience. It is optimized for complex tasks that require step-by-step reasoning, instruction following, and...

textvisionmultimodal
Run locally
400,000 ctx$15.00/1M in
Explore specs and pricingView details β†’

Qwen: Qwen3 VL 30B A3B Instruct

qwen

Qwen3-VL-30B-A3B-Instruct is a multimodal model that unifies strong text generation with visual understanding for images and videos. Its Instruct variant optimizes instruction-following for general multimodal tasks. It excels in perception...

textvisionimage
Run locally
262,144 ctx$0.13/1M in
Explore specs and pricingView details β†’

Qwen: Qwen3 VL 30B A3B Thinking

qwen

Qwen3-VL-30B-A3B-Thinking is a multimodal model that unifies strong text generation with visual understanding for images and videos. Its Thinking variant enhances reasoning in STEM, math, and complex tasks. It excels...

textvisionimage
131,072 ctx$0.13/1M in
Explore specs and pricingView details β†’

Baidu: ERNIE 4.5 21B A3B Thinking

baidu

ERNIE-4.5-21B-A3B-Thinking is Baidu's upgraded lightweight MoE model, refined to boost reasoning depth and quality for top-tier performance in logical puzzles, math, science, coding, text generation, and expert-level academic benchmarks.

textcodereasoning
131,072 ctx$0.07/1M in
Explore specs and pricingView details β†’

NVIDIA: Llama 3.3 Nemotron Super 49B V1.5

nvidia

Llama-3.3-Nemotron-Super-49B-v1.5 is a 49B-parameter, English-centric reasoning/chat model derived from Meta’s Llama-3.3-70B-Instruct with a 128K context. It’s post-trained for agentic workflows (RAG, tool calling) via SFT across math, code, science, and...

textcodereasoning
131,072 ctx$0.10/1M in
Explore specs and pricingView details β†’

OpenAI: o4 Mini Deep Research

openai

o4-mini-deep-research is OpenAI's faster, more affordable deep research modelβ€”ideal for tackling complex, multi-step research tasks. Note: This model always uses the 'web_search' tool which adds additional cost.

textvisionmultimodal
200,000 ctx$2.00/1M in
Explore specs and pricingView details β†’

OpenAI: o3 Deep Research

openai

o3-deep-research is OpenAI's advanced model for deep research, designed to tackle complex, multi-step research tasks. Note: This model always uses the 'web_search' tool which adds additional cost.

textvisionmultimodal
200,000 ctx$10.00/1M in
Explore specs and pricingView details β†’

OpenAI: GPT-5 Image

openai

[GPT-5](https://openrouter.ai/openai/gpt-5) Image combines OpenAI's GPT-5 model with state-of-the-art image generation capabilities. It offers major improvements in reasoning, code quality, and user experience while incorporating GPT Image 1's superior instruction following,...

textvisionimage
Run locally
400,000 ctx$10.00/1M in
Explore specs and pricingView details β†’

Qwen: Qwen3 VL 8B Instruct

qwen

Qwen3-VL-8B-Instruct is a multimodal vision-language model from the Qwen3-VL series, built for high-fidelity understanding and reasoning across text, images, and video. It features improved multimodal fusion with Interleaved-MRoPE for long-horizon...

textvisionmultimodal
131,072 ctx$0.08/1M in
Explore specs and pricingView details β†’

Qwen: Qwen3 VL 8B Thinking

qwen

Qwen3-VL-8B-Thinking is the reasoning-optimized variant of the Qwen3-VL-8B multimodal model, designed for advanced visual and textual reasoning across complex scenes, documents, and temporal sequences. It integrates enhanced multimodal alignment and...

textvisionmultimodal
131,072 ctx$0.12/1M in
Explore specs and pricingView details β†’