modelstop.top
Home/All Models

AI Model Catalogue

Browse 287 models across providers, modalities, and use cases.

๐ŸŒ All Models

287 models ยท Page 2 of 8

DeepSeek R1 Distill Qwen 1.5B

deepseek-ai

textcheaplong-context
131,072 ctx$0.18/1M in
Explore specs and pricingView details โ†’

Qwen3-VL-32B-Instruct

qwen

Qwen3-VL-32B-Instruct is a large-scale multimodal vision-language model designed for high-precision understanding and reasoning across text, images, and video. With 32 billion parameters, it combines deep visual perception with advanced text...

textvisionreasoning
262,144 ctx$0.50/1M in
Explore specs and pricingView details โ†’

Arize AI Qwen 2 1.5B Instruct

arize-ai

textinstructcheap
32,768 ctx$0.10/1M in
Explore specs and pricingView details โ†’

Qwen/Qwen3-Max

deepinfra

Qwen3-Max is an updated release built on the Qwen3 series, offering major improvements in reasoning, instruction following, multilingual support, and long-tail knowledge coverage compared to the January 2025 version. It...

textreasoningmultilingual
262,144 ctxFree in
Explore specs and pricingView details โ†’

meta-llama/Llama-Guard-4-12B

deepinfra

Llama Guard 4 is a Llama 4 Scout-derived multimodal pretrained model, fine-tuned for content safety classification. Similar to previous versions, it can be used to classify content in both LLM...

textvisioncheap
163,840 ctxFree in
Explore specs and pricingView details โ†’

Qwen/Qwen3-30B-A3B

deepinfra

Qwen3, the latest generation in the Qwen large language model series, features both dense and mixture-of-experts (MoE) architectures to excel in reasoning, multilingual support, and advanced agent tasks. Its unique...

textreasoningagents
40,960 ctxFree in
Explore specs and pricingView details โ†’

Qwen/Qwen3-32B

deepinfra

Qwen3-32B is a dense 32.8B parameter causal language model from the Qwen3 series, optimized for both complex reasoning and efficient dialogue. It supports seamless switching between a "thinking" mode for...

textreasoningcheap
40,960 ctxFree in
Explore specs and pricingView details โ†’

Qwen/Qwen3-235B-A22B-Thinking-2507

deepinfra

Qwen3-235B-A22B-Thinking-2507 is a high-performance, open-weight Mixture-of-Experts (MoE) language model optimized for complex reasoning tasks. It activates 22B of its 235B parameters per forward pass and natively supports up to 262,144...

textreasoningcheap
262,144 ctxFree in
Explore specs and pricingView details โ†’

mistralai/Mistral-Small-24B-Instruct-2501

deepinfra

Mistral Small 3 is a 24B-parameter language model optimized for low-latency performance across common AI tasks. Released under the Apache 2.0 license, it features both pre-trained and instruction-tuned versions designed...

textinstructcheap
32,768 ctxFree in
Explore specs and pricingView details โ†’

Qwen/Qwen3-VL-30B-A3B-Instruct

deepinfra

Qwen3-VL-30B-A3B-Instruct is a multimodal model that unifies strong text generation with visual understanding for images and videos. Its Instruct variant optimizes instruction-following for general multimodal tasks. It excels in perception...

textvisioninstruct
131,072 ctxFree in
Explore specs and pricingView details โ†’

microsoft/phi-4

deepinfra

Microsoft Phi-4 14B โ€” small language model achieving state-of-the-art results on reasoning tasks.

textreasoningcheap
16,384 ctxFree in
Explore specs and pricingView details โ†’

Qwen/Qwen3-VL-235B-A22B-Instruct

deepinfra

Qwen3-VL-235B-A22B Instruct is an open-weight multimodal model that unifies strong text generation with visual understanding across images and video. The Instruct model targets general vision-language use (VQA, document parsing, chart/table...

textvisioninstruct
262,144 ctxFree in
Explore specs and pricingView details โ†’

Gryphe/MythoMax-L2-13b

deepinfra

One of the highest performing and most popular fine-tunes of Llama 2 13B, with rich descriptions and roleplay. #merge

textcheap
4,096 ctxFree in
Explore specs and pricingView details โ†’

Qwen/Qwen3-Max-Thinking

deepinfra

Qwen3-Max-Thinking is the flagship reasoning model in the Qwen3 series, designed for high-stakes cognitive tasks that require deep, multi-step reasoning. By significantly scaling model capacity and reinforcement learning compute, it...

textreasoningcheap
262,144 ctxFree in
Explore specs and pricingView details โ†’

Qwen/Qwen3-14B

deepinfra

Qwen3-14B is a dense 14.8B parameter causal language model from the Qwen3 series, designed for both complex reasoning and efficient dialogue. It supports seamless switching between a "thinking" mode for...

textreasoningcheap
40,960 ctxFree in
Explore specs and pricingView details โ†’

codestral-2508

mistralai

Our cutting-edge language model for coding released August 2025.

textcodecheap
256,000 ctxFree in
Explore specs and pricingView details โ†’

openai/gpt-oss-safeguard-20b

groq

gpt-oss-safeguard-20b is a safety reasoning model from OpenAI built upon gpt-oss-20b. This open-weight, 21B-parameter Mixture-of-Experts (MoE) model offers lower latency for safety tasks like content classification, LLM filtering, and trust...

textreasoningcheap
131,072 ctxFree in
Explore specs and pricingView details โ†’

mistral-embed-2312

mistralai

Official mistral-embed-2312 Mistral AI model

textcheap
8,192 ctxFree in
Explore specs and pricingView details โ†’

mistral-small-2603

mistralai

Mistral Small 4.

textcheaplong-context
262,144 ctxFree in
Explore specs and pricingView details โ†’

kimi-k2-thinking

moonshotai

Kimi K2 Thinking is the latest, most capable version of an open-source thinking model.

textreasoningcheap
262,144 ctxFree in
Explore specs and pricingView details โ†’

qwen3-coder-next

ollama

qwen3-coder-next โ€” available to run locally via Ollama on CPU and GPU hardware.

textcodecheap
262,144 ctxFree in
Explore specs and pricingView details โ†’

kimi-k2-thinking

ollama

kimi-k2-thinking โ€” available to run locally via Ollama on CPU and GPU hardware.

textreasoningcheap
262,144 ctxFree in
Explore specs and pricingView details โ†’

minimax-m2

ollama

minimax-m2 โ€” available to run locally via Ollama on CPU and GPU hardware.

textcheaplong-context
196,608 ctxFree in
Explore specs and pricingView details โ†’

Llama-3.1-70B-Instruct

meta-llama

Open-source Llama-3.1-70B-Instruct model from meta-llama โ€” available for download and self-hosting on Hugging Face.

textinstructcheap
131,072 ctxFree in
Explore specs and pricingView details โ†’

gemini-3-flash-preview

ollama

gemini-3-flash-preview โ€” available to run locally via Ollama on CPU and GPU hardware.

textcheaplong-context
1,048,576 ctxFree in
Explore specs and pricingView details โ†’

Llama-3.2-1B-Instruct

meta-llama

Open-source Llama-3.2-1B-Instruct model from meta-llama โ€” available for download and self-hosting on Hugging Face.

textinstructcheap
60,000 ctxFree in
Explore specs and pricingView details โ†’

Llama-3.1-8B-Instruct

meta-llama

Open-source Llama-3.1-8B-Instruct model from meta-llama โ€” available for download and self-hosting on Hugging Face.

textinstructcheap
16,384 ctxFree in
Explore specs and pricingView details โ†’

Qwen3-30B-A3B-Instruct-2507

qwen

Open-source Qwen3-30B-A3B-Instruct-2507 model from qwen โ€” available for download and self-hosting on Hugging Face.

textinstructcheap
262,144 ctx$0.00/1M in
Explore specs and pricingView details โ†’

Qwen3-Coder-30B-A3B-Instruct

qwen

Open-source Qwen3-Coder-30B-A3B-Instruct model from qwen โ€” available for download and self-hosting on Hugging Face.

textcodeinstruct
160,000 ctx$0.00/1M in
Explore specs and pricingView details โ†’

Qwen3-30B-A3B

qwen

Open-source Qwen3-30B-A3B model from qwen โ€” available for download and self-hosting on Hugging Face.

textcheap
40,960 ctx$0.00/1M in
Explore specs and pricingView details โ†’

Qwen3-14B

qwen

Open-source Qwen3-14B model from qwen โ€” available for download and self-hosting on Hugging Face.

textcheap
40,960 ctx$0.00/1M in
Explore specs and pricingView details โ†’

Qwen3-32B

qwen

Open-source Qwen3-32B model from qwen โ€” available for download and self-hosting on Hugging Face.

textcheap
40,960 ctx$0.00/1M in
Explore specs and pricingView details โ†’

Qwen3-8B

qwen

Open-source Qwen3-8B model from qwen โ€” available for download and self-hosting on Hugging Face.

textcheap
40,960 ctx$0.00/1M in
Explore specs and pricingView details โ†’

AI21 Jamba 1.6 Mini

ai21

AI21 Jamba 1.6 Mini is a lightweight Mamba-Transformer hybrid optimized for cost-effective, high-throughput inference with an impressive 256K context window. An excellent choice for document-heavy workloads on a budget.

long-contextinstructcheap
256,000 ctx$0.20/1M in
Explore specs and pricingView details โ†’

AI21 Jamba 1.6 Large

ai21

AI21 Jamba 1.6 Large uses a hybrid Mamba-Transformer architecture offering low memory footprint and high throughput compared to equivalent Transformer models. Features 256K context at a fraction of the inference cost.

long-contextinstructcheap
256,000 ctx$2.00/1M in
Explore specs and pricingView details โ†’

Microsoft Phi-4 Mini

microsoft

Microsoft Phi-4 Mini is a 3.8B parameter compact model from Microsoft. Delivers impressive reasoning capabilities for edge and mobile deployment scenarios, with strong performance on math and coding tasks relative to its size.

reasoningcodeinstruct
128,000 ctx$0.00/1M in
Explore specs and pricingView details โ†’