modelstop.top
Home/All Models

AI Model Catalogue

Browse 91 models across providers, modalities, and use cases.

🌐 All Models

91 models Β· Page 1 of 3

qwen3-30b-a3b-fp8

qwen

Qwen3 is the latest generation of large language models in Qwen series, offering a comprehensive suite of dense and mixture-of-experts (MoE) models. Built upon extensive training, Qwen3 delivers groundbreaking advancements in reasoning, instruction-following, agent capabilities, and multilingual support.

textreasoningagents
32,768 ctx$0.05/1M in
Explore specs and pricingView details β†’

gpt-oss-20b

openai

OpenAI’s open-weight models designed for powerful reasoning, agentic tasks, and versatile developer use cases – gpt-oss-20b is for lower latency, and local or specialized use-cases.

textreasoningagents
128,000 ctx$0.20/1M in
Explore specs and pricingView details β†’

llama-3.2-3b-instruct

meta

The Llama 3.2 instruction-tuned text only models are optimized for multilingual dialogue use cases, including agentic retrieval and summarization tasks.

textagentsmultilingual
80,000 ctx$0.05/1M in
Explore specs and pricingView details β†’

flux

deepgram

Flux is the first conversational speech recognition model built specifically for voice agents.

audioagentsfree
ctx$0.00/1M in
Explore specs and pricingView details β†’

granite-4.0-h-micro

ibm-granite

Granite 4.0 instruct models deliver strong performance across benchmarks, achieving industry-leading results in key agentic tasks like instruction following and function calling. These efficiencies make the models well-suited for a wide range of use cases like retrieval-augmented generation (RAG), multi-agent workflows, and edge deployments.

textagentsinstruct
131,000 ctx$0.02/1M in
Explore specs and pricingView details β†’

gpt-oss-120b

openai

OpenAI’s open-weight models designed for powerful reasoning, agentic tasks, and versatile developer use cases – gpt-oss-120b is for production, general purpose, high reasoning use-cases.

textreasoningagents
128,000 ctx$0.35/1M in
Explore specs and pricingView details β†’

kimi-k2.6

moonshotai

Kimi K2.6 is a frontier-scale open-source 1T parameter model with a 262.1k context window, multi-turn tool calling, vision inputs, and structured outputs for agentic workloads.

textvisionagents
262,144 ctx$0.95/1M in
Explore specs and pricingView details β†’

llama-3.2-1b-instruct

meta

The Llama 3.2 instruction-tuned text only models are optimized for multilingual dialogue use cases, including agentic retrieval and summarization tasks.

textagentsmultilingual
60,000 ctx$0.03/1M in
Explore specs and pricingView details β†’

kimi-k2.5

moonshotai

Kimi K2.5 is a frontier-scale open-source model with a 256k context window, multi-turn tool calling, vision inputs, and structured outputs for agentic workloads.

textvisionagents
256,000 ctx$0.60/1M in
Explore specs and pricingView details β†’

nemotron-3-120b-a12b

nvidia

NVIDIA Nemotron 3 Super is a hybrid MoE model with leading accuracy for multi-agent applications and specialized agentic AI systems.

textagentscheap
256,000 ctx$0.50/1M in
Explore specs and pricingView details β†’

Qwen3 30B A3b

qwen

Qwen3, the latest generation in the Qwen large language model series, features both dense and mixture-of-experts (MoE) architectures to excel in reasoning, multilingual support, and advanced agent tasks. Its unique...

textreasoningagents
40,960 ctxFree in
Explore specs and pricingView details β†’

Kimi K2 Thinking

moonshotai

Kimi K2 Thinking is Moonshot AI’s most advanced open reasoning model to date, extending the K2 series into agentic, long-horizon reasoning. Built on the trillion-parameter Mixture-of-Experts (MoE) architecture introduced in...

textreasoningagents
262,144 ctx$1.20/1M in
Explore specs and pricingView details β†’

Qwen3 Next 80B A3b Thinking

qwen

Qwen3-Next-80B-A3B-Thinking is a reasoning-first chat model in the Qwen3-Next line that outputs structured β€œthinking” traces by default. It’s designed for hard multi-step problems; math proofs, code synthesis/debugging, logic, and agentic...

textcodereasoning
262,144 ctx$0.15/1M in
Explore specs and pricingView details β†’

Qwen3 Coder 30B A3b Instruct

qwen

Qwen3-Coder-30B-A3B-Instruct is a 30.5B parameter Mixture-of-Experts (MoE) model with 128 experts (8 active per forward pass), designed for advanced code generation, repository-scale understanding, and agentic tool use. Built on the...

textcodeagents
262,144 ctxFree in
Explore specs and pricingView details β†’

Qwen/Qwen3-30B-A3B

deepinfra

Qwen3, the latest generation in the Qwen large language model series, features both dense and mixture-of-experts (MoE) architectures to excel in reasoning, multilingual support, and advanced agent tasks. Its unique...

textreasoningagents
40,960 ctxFree in
Explore specs and pricingView details β†’

openai/gpt-oss-120b

deepinfra

gpt-oss-120b is an open-weight, 117B-parameter Mixture-of-Experts (MoE) language model from OpenAI designed for high-reasoning, agentic, and general-purpose production use cases. It activates 5.1B parameters per forward pass and is optimized...

textreasoningagents
131,072 ctxFree in
Explore specs and pricingView details β†’

nvidia/Nemotron-3-Nano-30B-A3B

deepinfra

NVIDIA Nemotron 3 Nano 30B A3B is a small language MoE model with highest compute efficiency and accuracy for developers to build specialized agentic AI systems. The model is fully...

textagentsfree
Run locally
256,000 ctxFree in
Explore specs and pricingView details β†’

devstral-medium-2507

mistralai

Our medium code-agentic model.

textcodeagents
131,072 ctxFree in
Explore specs and pricingView details β†’

devstral-small-2507

mistralai

Our small open-source code-agentic model.

textcodeagents
131,072 ctxFree in
Explore specs and pricingView details β†’

openai/gpt-oss-120b

groq

gpt-oss-120b is an open-weight, 117B-parameter Mixture-of-Experts (MoE) language model from OpenAI designed for high-reasoning, agentic, and general-purpose production use cases. It activates 5.1B parameters per forward pass and is optimized...

textreasoningagents
131,072 ctxFree in
Explore specs and pricingView details β†’

fibo

bria

SOTA Open source model trained on licensed data, transforming intent into structured control for precise, high-quality AI image generation in enterprise and agentic workflows.

visionimageagents
ctxFree in
Explore specs and pricingView details β†’

riverflow-2.0-pro

sourceful

Agentic image model optimized for robust, high-precision generations supporting font control

visionimageagents
ctxFree in
Explore specs and pricingView details β†’

claude-opus-4.6

anthropic

Anthropic's most intelligent model with state-of-the-art coding, reasoning, and agentic capabilities

textcodereasoning
ctxFree in
Explore specs and pricingView details β†’

video-agent

heygen

Turn a text prompt into a complete, polished video with AI-generated script, avatar presenter, voiceover, visuals, and editing.

agentsfree
ctxFree in
Explore specs and pricingView details β†’

Amazon Nova Pro

amazon

Amazon Nova Pro is a highly capable multimodal model with the best combination of accuracy, speed, and cost across a wide range of tasks. Supports text, image, and video inputs.

visionmultimodallong-context
300,000 ctx$0.80/1M in
Explore specs and pricingView details β†’

Nous: Hermes 3 405B Instruct (free)

nousresearch

Hermes 3 is a generalist language model with many improvements over Hermes 2, including advanced agentic capabilities, much better roleplaying, reasoning, multi-turn conversation, long context coherence, and improvements across the...

textreasoningagents
131,072 ctx$1.00/1M in
Explore specs and pricingView details β†’

Nous: Hermes 3 70B Instruct

nousresearch

Hermes 3 is a generalist language model with many improvements over [Hermes 2](/models/nousresearch/nous-hermes-2-mistral-7b-dpo), including advanced agentic capabilities, much better roleplaying, reasoning, multi-turn conversation, long context coherence, and improvements across the...

textreasoningagents
Run locally
131,072 ctx$0.30/1M in
Explore specs and pricingView details β†’

Cohere: Command R (08-2024)

cohere

command-r-08-2024 is an update of the [Command R](/models/cohere/command-r) with improved performance for multilingual retrieval-augmented generation (RAG) and tool use. More broadly, it is better at math, code and reasoning and...

textcodereasoning
Run locally
128,000 ctx$0.15/1M in
Explore specs and pricingView details β†’

Anthropic: Claude 3.5 Haiku

anthropic

Claude 3.5 Haiku features offers enhanced capabilities in speed, coding accuracy, and tool use. Engineered to excel in real-time applications, it delivers quick response times that are essential for dynamic...

textvisionmultimodal
Run locally
200,000 ctx$0.80/1M in
Explore specs and pricingView details β†’

Cohere: Command R7B (12-2024)

cohere

Command R7B (12-2024) is a small, fast update of the Command R+ model, delivered in December 2024. It excels at RAG, tool use, agents, and similar tasks requiring complex reasoning...

textreasoningagents
Run locally
128,000 ctx$0.04/1M in
Explore specs and pricingView details β†’

Cohere: Command A

cohere

Command A is an open-weights 111B parameter model with a 256k context window focused on delivering great performance across agentic, multilingual, and coding use cases. Compared to other leading proprietary...

textcodeagents
Run locally
256,000 ctx$2.50/1M in
Explore specs and pricingView details β†’

OpenAI: o4 Mini

openai

OpenAI o4-mini is a compact reasoning model in the o-series, optimized for fast, cost-efficient performance while retaining strong multimodal and agentic capabilities. It supports tool use and demonstrates competitive reasoning...

textvisionmultimodal
200,000 ctx$1.10/1M in
Explore specs and pricingView details β†’

Qwen: Qwen3 30B A3B

qwen

Qwen3, the latest generation in the Qwen large language model series, features both dense and mixture-of-experts (MoE) architectures to excel in reasoning, multilingual support, and advanced agent tasks. Its unique...

textreasoningagents
Run locally
131,072 ctx$0.08/1M in
Explore specs and pricingView details β†’

Anthropic: Claude Opus 4

anthropic

Claude Opus 4 is benchmarked as the world’s best coding model, at time of release, bringing sustained performance on complex, long-running tasks and agent workflows. It sets new benchmarks in...

textvisionmultimodal
Run locally
200,000 ctx$15.00/1M in
Explore specs and pricingView details β†’

Mistral: Devstral Small 1.1

mistralai

Devstral Small 1.1 is a 24B parameter open-weight language model for software engineering agents, developed by Mistral AI in collaboration with All Hands AI. Finetuned from Mistral Small 3.1 and...

textagentscheap
131,072 ctx$0.10/1M in
Explore specs and pricingView details β†’

Mistral: Devstral Medium

mistralai

Devstral Medium is a high-performance code generation and agentic reasoning model developed jointly by Mistral AI and All Hands AI. Positioned as a step up from Devstral Small, it achieves...

textcodereasoning
131,072 ctx$0.40/1M in
Explore specs and pricingView details β†’