modelstop.top
Home/All Models

AI Model Catalogue

Browse 352 models across providers, modalities, and use cases.

🌐 All Models

352 models Β· Page 3 of 10

Glm 4.5 Air Fp8

zai-org

textcheaplong-context
Run locally
131,072 ctx$0.20/1M in
Explore specs and pricingView details β†’

Qwen 2.5 14B Instruct

qwen

textinstructcheap
Run locally
32,768 ctx$0.80/1M in
Explore specs and pricingView details β†’

Llama 4 Maverick Instruct (17Bx128E) FP8

meta-llama

textinstructcheap
1,048,576 ctx$0.27/1M in
Explore specs and pricingView details β†’

Ministral 3 14B Instruct 2512

mistralai

textinstructcheap
Run locally
262,144 ctx$0.20/1M in
Explore specs and pricingView details β†’

GLM 4.6 Fp8

zai-org

textcheaplong-context
Run locally
202,752 ctx$0.60/1M in
Explore specs and pricingView details β†’

Meta Llama 3 70B Instruct Turbo

meta-llama

textinstructcheap
Run locally
8,192 ctx$0.88/1M in
Explore specs and pricingView details β†’

Deepseek Coder 33B Instruct

deepseek-ai

textcodeinstruct
Run locally
16,384 ctx$0.80/1M in
Explore specs and pricingView details β†’

Meta Llama 3.1 8B

meta-llama

textcheap
Run locally
16,384 ctx$0.20/1M in
Explore specs and pricingView details β†’

Mistral (7B) Instruct v0.3

mistralai

textinstructcheap
32,768 ctx$0.20/1M in
Explore specs and pricingView details β†’

Arize AI Qwen 2 1.5B Instruct

arize-ai

textinstructcheap
32,768 ctx$0.10/1M in
Explore specs and pricingView details β†’

Mixtral-8x7B Instruct v0.1

mistralai

textinstructcheap
Run locally
32,768 ctx$0.60/1M in
Explore specs and pricingView details β†’

Meta Llama 3.1 70B Instruct Turbo

meta-llama

textinstructcheap
131,072 ctx$0.88/1M in
Explore specs and pricingView details β†’

Qwen3 Next 80B A3b Thinking

qwen

Qwen3-Next-80B-A3B-Thinking is a reasoning-first chat model in the Qwen3-Next line that outputs structured β€œthinking” traces by default. It’s designed for hard multi-step problems; math proofs, code synthesis/debugging, logic, and agentic...

textcodereasoning
262,144 ctx$0.15/1M in
Explore specs and pricingView details β†’

Qwen3-VL-8B-Instruct

qwen

Qwen3-VL-8B-Instruct is a multimodal vision-language model from the Qwen3-VL series, built for high-fidelity understanding and reasoning across text, images, and video. It features improved multimodal fusion with Interleaved-MRoPE for long-horizon...

textvisionreasoning
262,144 ctx$0.18/1M in
Explore specs and pricingView details β†’

Qwen3-VL-32B-Instruct

qwen

Qwen3-VL-32B-Instruct is a large-scale multimodal vision-language model designed for high-precision understanding and reasoning across text, images, and video. With 32 billion parameters, it combines deep visual perception with advanced text...

textvisionreasoning
262,144 ctx$0.50/1M in
Explore specs and pricingView details β†’

GLM 4.7 Fp8

zai-org

textcheaplong-context
202,752 ctx$0.45/1M in
Explore specs and pricingView details β†’

EssentialAI Rnj-1 Instruct

essentialai

Rnj-1 is an 8B-parameter, dense, open-weight model family developed by Essential AI and trained from scratch with a focus on programming, math, and scientific reasoning. The model demonstrates strong performance...

textreasoninginstruct
32,768 ctx$0.15/1M in
Explore specs and pricingView details β†’

Llama Guard 4 12B

meta-llama

Llama Guard 4 is a Llama 4 Scout-derived multimodal pretrained model, fine-tuned for content safety classification. Similar to previous versions, it can be used to classify content in both LLM...

textvisioncheap
1,048,576 ctx$0.20/1M in
Explore specs and pricingView details β†’

DeepSeek R1 Distill Qwen 1.5B

deepseek-ai

textcheaplong-context
131,072 ctx$0.18/1M in
Explore specs and pricingView details β†’

Nous Hermes 2 Mixtral 8X7B Dpo

nousresearch

textcheap
Run locally
32,768 ctx$0.60/1M in
Explore specs and pricingView details β†’

Qwen3 235B A22B Thinking 2507 FP8

qwen

Qwen3-235B-A22B-Thinking-2507 is a high-performance, open-weight Mixture-of-Experts (MoE) language model optimized for complex reasoning tasks. It activates 22B of its 235B parameters per forward pass and natively supports up to 262,144...

textreasoningcheap
262,144 ctx$0.65/1M in
Explore specs and pricingView details β†’

LFM2-24B-A2B

liquidai

textcheap
Run locally
32,768 ctx$0.03/1M in
Explore specs and pricingView details β†’

Multilingual E5 Large Instruct

intfloat

textmultilingualinstruct
Run locally
514 ctx$0.02/1M in
Explore specs and pricingView details β†’

Qwen3 Coder 30B A3b Instruct

qwen

Qwen3-Coder-30B-A3B-Instruct is a 30.5B parameter Mixture-of-Experts (MoE) model with 128 experts (8 active per forward pass), designed for advanced code generation, repository-scale understanding, and agentic tool use. Built on the...

textcodeagents
262,144 ctxFree in
Explore specs and pricingView details β†’

Qwen/Qwen3-30B-A3B

deepinfra

Qwen3, the latest generation in the Qwen large language model series, features both dense and mixture-of-experts (MoE) architectures to excel in reasoning, multilingual support, and advanced agent tasks. Its unique...

textreasoningagents
40,960 ctxFree in
Explore specs and pricingView details β†’

Qwen/Qwen3-Max

deepinfra

Qwen3-Max is an updated release built on the Qwen3 series, offering major improvements in reasoning, instruction following, multilingual support, and long-tail knowledge coverage compared to the January 2025 version. It...

textreasoningmultilingual
262,144 ctxFree in
Explore specs and pricingView details β†’

Qwen/Qwen3-32B

deepinfra

Qwen3-32B is a dense 32.8B parameter causal language model from the Qwen3 series, optimized for both complex reasoning and efficient dialogue. It supports seamless switching between a "thinking" mode for...

textreasoningcheap
40,960 ctxFree in
Explore specs and pricingView details β†’

google/gemma-3-4b-it

deepinfra

Gemma 3 introduces multimodality, supporting vision-language input and text outputs. It handles context windows up to 128k tokens, understands over 140 languages, and offers improved math, reasoning, and chat capabilities,...

textvisionreasoning
131,072 ctxFree in
Explore specs and pricingView details β†’

meta-llama/Llama-Guard-4-12B

deepinfra

Llama Guard 4 is a Llama 4 Scout-derived multimodal pretrained model, fine-tuned for content safety classification. Similar to previous versions, it can be used to classify content in both LLM...

textvisioncheap
163,840 ctxFree in
Explore specs and pricingView details β†’

Qwen/Qwen3-14B

deepinfra

Qwen3-14B is a dense 14.8B parameter causal language model from the Qwen3 series, designed for both complex reasoning and efficient dialogue. It supports seamless switching between a "thinking" mode for...

textreasoningcheap
40,960 ctxFree in
Explore specs and pricingView details β†’

microsoft/phi-4

deepinfra

Microsoft Phi-4 14B β€” small language model achieving state-of-the-art results on reasoning tasks.

textreasoningcheap
16,384 ctxFree in
Explore specs and pricingView details β†’

mistralai/Mistral-Small-24B-Instruct-2501

deepinfra

Mistral Small 3 is a 24B-parameter language model optimized for low-latency performance across common AI tasks. Released under the Apache 2.0 license, it features both pre-trained and instruction-tuned versions designed...

textinstructcheap
32,768 ctxFree in
Explore specs and pricingView details β†’

Gryphe/MythoMax-L2-13b

deepinfra

One of the highest performing and most popular fine-tunes of Llama 2 13B, with rich descriptions and roleplay. #merge

textcheap
4,096 ctxFree in
Explore specs and pricingView details β†’

Qwen/Qwen3-235B-A22B-Thinking-2507

deepinfra

Qwen3-235B-A22B-Thinking-2507 is a high-performance, open-weight Mixture-of-Experts (MoE) language model optimized for complex reasoning tasks. It activates 22B of its 235B parameters per forward pass and natively supports up to 262,144...

textreasoningcheap
262,144 ctxFree in
Explore specs and pricingView details β†’

Qwen/Qwen3-Max-Thinking

deepinfra

Qwen3-Max-Thinking is the flagship reasoning model in the Qwen3 series, designed for high-stakes cognitive tasks that require deep, multi-step reasoning. By significantly scaling model capacity and reinforcement learning compute, it...

textreasoningcheap
262,144 ctxFree in
Explore specs and pricingView details β†’

Qwen/Qwen3-VL-30B-A3B-Instruct

deepinfra

Qwen3-VL-30B-A3B-Instruct is a multimodal model that unifies strong text generation with visual understanding for images and videos. Its Instruct variant optimizes instruction-following for general multimodal tasks. It excels in perception...

textvisioninstruct
131,072 ctxFree in
Explore specs and pricingView details β†’