modelstop.top
Home/All Models

AI Model Catalogue

Browse 454 models across providers, modalities, and use cases.

🌐 All Models

454 models Β· Page 3 of 13

Qwen3 Next 80B A3b Instruct

qwen

Qwen3-Next-80B-A3B-Instruct is an instruction-tuned chat model in the Qwen3-Next series optimized for fast, stable responses without β€œthinking” traces. It targets complex tasks across reasoning, code generation, knowledge QA, and multilingual...

textcodereasoning
262,144 ctx$0.15/1M in
Explore specs and pricingView details β†’

Qwen2.5 32B

qwen

textfreelong-context
131,072 ctxFree in
Explore specs and pricingView details β†’

Facebook CWM

facebook

textfreelong-context
Run locally
131,072 ctxFree in
Explore specs and pricingView details β†’

Qwen3 Coder 30B A3b Instruct

qwen

Qwen3-Coder-30B-A3B-Instruct is a 30.5B parameter Mixture-of-Experts (MoE) model with 128 experts (8 active per forward pass), designed for advanced code generation, repository-scale understanding, and agentic tool use. Built on the...

textcodeagents
262,144 ctxFree in
Explore specs and pricingView details β†’

Holo3 35B A3b

hcompany

textfreelong-context
262,144 ctxFree in
Explore specs and pricingView details β†’

Llama Guard 4 12B

meta-llama

Llama Guard 4 is a Llama 4 Scout-derived multimodal pretrained model, fine-tuned for content safety classification. Similar to previous versions, it can be used to classify content in both LLM...

textvisioncheap
1,048,576 ctx$0.20/1M in
Explore specs and pricingView details β†’

Qwen3 Next 80B A3b Thinking

qwen

Qwen3-Next-80B-A3B-Thinking is a reasoning-first chat model in the Qwen3-Next line that outputs structured β€œthinking” traces by default. It’s designed for hard multi-step problems; math proofs, code synthesis/debugging, logic, and agentic...

textcodereasoning
262,144 ctx$0.15/1M in
Explore specs and pricingView details β†’

DeepSeek R1 Distill Qwen 1.5B

deepseek-ai

textcheaplong-context
131,072 ctx$0.18/1M in
Explore specs and pricingView details β†’

DeepSeek R1 Distill Qwen 14B

deepseek-ai

textlong-context
Run locally
131,072 ctx$1.60/1M in
Explore specs and pricingView details β†’

Deepseek V3.1 Base

deepseek-ai

textfreelong-context
163,840 ctxFree in
Explore specs and pricingView details β†’

Qwen3 235B A22B Thinking 2507 FP8

qwen

Qwen3-235B-A22B-Thinking-2507 is a high-performance, open-weight Mixture-of-Experts (MoE) language model optimized for complex reasoning tasks. It activates 22B of its 235B parameters per forward pass and natively supports up to 262,144...

textreasoningcheap
262,144 ctx$0.65/1M in
Explore specs and pricingView details β†’

Qwen3-VL-32B-Instruct

qwen

Qwen3-VL-32B-Instruct is a large-scale multimodal vision-language model designed for high-precision understanding and reasoning across text, images, and video. With 32 billion parameters, it combines deep visual perception with advanced text...

textvisionreasoning
262,144 ctx$0.50/1M in
Explore specs and pricingView details β†’

GLM 4.7 Fp8

zai-org

textcheaplong-context
202,752 ctx$0.45/1M in
Explore specs and pricingView details β†’

Cogito V1 Preview Qwen 14B

deepcogito

textfreelong-context
131,072 ctxFree in
Explore specs and pricingView details β†’

Cogito V1 Preview Llama 8B

deepcogito

textfreelong-context
Run locally
131,072 ctxFree in
Explore specs and pricingView details β†’

Cogito V1 Preview Qwen 32B

deepcogito

textfreelong-context
Run locally
131,072 ctxFree in
Explore specs and pricingView details β†’

Cogito V1 Preview Llama 70B Turbo

deepcogito

textfreelong-context
Run locally
131,072 ctxFree in
Explore specs and pricingView details β†’

DeepSeek R1 Distill Qwen 7B

deepseek-ai

textfreelong-context
131,072 ctxFree in
Explore specs and pricingView details β†’

Cogito v2.1 671B

deepcogito

textlong-context
163,840 ctx$1.25/1M in
Explore specs and pricingView details β†’

Deepseek V3

deepseek-ai

textfreelong-context
163,840 ctxFree in
Explore specs and pricingView details β†’

Meta Llama 3.1 70B Instruct Turbo

meta-llama

textinstructcheap
131,072 ctx$0.88/1M in
Explore specs and pricingView details β†’

GLM 5 Fp4

zai-org

textfreelong-context
Run locally
202,752 ctxFree in
Explore specs and pricingView details β†’

Qwen3-VL-8B-Instruct

qwen

Qwen3-VL-8B-Instruct is a multimodal vision-language model from the Qwen3-VL series, built for high-fidelity understanding and reasoning across text, images, and video. It features improved multimodal fusion with Interleaved-MRoPE for long-horizon...

textvisionreasoning
262,144 ctx$0.18/1M in
Explore specs and pricingView details β†’

DeepSeek R1 Distill Llama 70B

deepseek-ai

textlong-context
131,072 ctx$2.00/1M in
Explore specs and pricingView details β†’

Cogito V1 Preview Llama 70B

deepcogito

textfreelong-context
131,072 ctxFree in
Explore specs and pricingView details β†’

google/gemma-4-26B-A4B-it

deepinfra

Gemma 4 26B A4B IT is an instruction-tuned Mixture-of-Experts (MoE) model from Google DeepMind. Despite 25.2B total parameters, only 3.8B activate per token during inference β€” delivering near-31B quality at...

textfreelong-context
Run locally
262,144 ctxFree in
Explore specs and pricingView details β†’

meta-llama/Llama-Guard-4-12B

deepinfra

Llama Guard 4 is a Llama 4 Scout-derived multimodal pretrained model, fine-tuned for content safety classification. Similar to previous versions, it can be used to classify content in both LLM...

textvisioncheap
163,840 ctxFree in
Explore specs and pricingView details β†’

Qwen/Qwen3-Max

deepinfra

Qwen3-Max is an updated release built on the Qwen3 series, offering major improvements in reasoning, instruction following, multilingual support, and long-tail knowledge coverage compared to the January 2025 version. It...

textreasoningmultilingual
262,144 ctxFree in
Explore specs and pricingView details β†’

google/gemma-3-4b-it

deepinfra

Gemma 3 introduces multimodality, supporting vision-language input and text outputs. It handles context windows up to 128k tokens, understands over 140 languages, and offers improved math, reasoning, and chat capabilities,...

textvisionreasoning
131,072 ctxFree in
Explore specs and pricingView details β†’

Qwen/Qwen3-Next-80B-A3B-Instruct

deepinfra

Qwen3-Next-80B-A3B-Instruct is an instruction-tuned chat model in the Qwen3-Next series optimized for fast, stable responses without β€œthinking” traces. It targets complex tasks across reasoning, code generation, knowledge QA, and multilingual...

textcodereasoning
Run locally
262,144 ctxFree in
Explore specs and pricingView details β†’

nvidia/Nemotron-3-Nano-30B-A3B

deepinfra

NVIDIA Nemotron 3 Nano 30B A3B is a small language MoE model with highest compute efficiency and accuracy for developers to build specialized agentic AI systems. The model is fully...

textagentsfree
Run locally
256,000 ctxFree in
Explore specs and pricingView details β†’

Qwen/Qwen3-VL-235B-A22B-Instruct

deepinfra

Qwen3-VL-235B-A22B Instruct is an open-weight multimodal model that unifies strong text generation with visual understanding across images and video. The Instruct model targets general vision-language use (VQA, document parsing, chart/table...

textvisioninstruct
262,144 ctxFree in
Explore specs and pricingView details β†’

google/gemma-4-31B-it

deepinfra

Gemma 4 31B Instruct is Google DeepMind's 30.7B dense multimodal model supporting text and image input with text output. Features a 256K token context window, configurable thinking/reasoning mode, native function...

textvisionreasoning
262,144 ctxFree in
Explore specs and pricingView details β†’

openai/gpt-oss-20b

deepinfra

gpt-oss-20b is an open-weight 21B parameter model released by OpenAI under the Apache 2.0 license. It uses a Mixture-of-Experts (MoE) architecture with 3.6B active parameters per forward pass, optimized for...

textfreelong-context
131,072 ctxFree in
Explore specs and pricingView details β†’

Qwen/Qwen3-235B-A22B-Thinking-2507

deepinfra

Qwen3-235B-A22B-Thinking-2507 is a high-performance, open-weight Mixture-of-Experts (MoE) language model optimized for complex reasoning tasks. It activates 22B of its 235B parameters per forward pass and natively supports up to 262,144...

textreasoningcheap
262,144 ctxFree in
Explore specs and pricingView details β†’

openai/gpt-oss-120b

deepinfra

gpt-oss-120b is an open-weight, 117B-parameter Mixture-of-Experts (MoE) language model from OpenAI designed for high-reasoning, agentic, and general-purpose production use cases. It activates 5.1B parameters per forward pass and is optimized...

textreasoningagents
131,072 ctxFree in
Explore specs and pricingView details β†’