modelstop.top
Home/All Models

AI Model Catalogue

Browse 352 models across providers, modalities, and use cases.

๐ŸŒ All Models

352 models ยท Page 4 of 10

Qwen/Qwen3-235B-A22B-Thinking-2507

deepinfra

Qwen3-235B-A22B-Thinking-2507 is a high-performance, open-weight Mixture-of-Experts (MoE) language model optimized for complex reasoning tasks. It activates 22B of its 235B parameters per forward pass and natively supports up to 262,144...

textreasoningcheap
262,144 ctxFree in
Explore specs and pricingView details โ†’

qwen/qwen3-32b

groq

Qwen3-32B is a dense 32.8B parameter causal language model from the Qwen3 series, optimized for both complex reasoning and efficient dialogue. It supports seamless switching between a "thinking" mode for...

textreasoningcheap
131,072 ctxFree in
Explore specs and pricingView details โ†’

openai/gpt-oss-safeguard-20b

groq

gpt-oss-safeguard-20b is a safety reasoning model from OpenAI built upon gpt-oss-20b. This open-weight, 21B-parameter Mixture-of-Experts (MoE) model offers lower latency for safety tasks like content classification, LLM filtering, and trust...

textreasoningcheap
131,072 ctxFree in
Explore specs and pricingView details โ†’

mistral-small-2603

mistralai

Mistral Small 4.

textcheaplong-context
262,144 ctxFree in
Explore specs and pricingView details โ†’

mistral-embed-2312

mistralai

Official mistral-embed-2312 Mistral AI model

textcheap
8,192 ctxFree in
Explore specs and pricingView details โ†’

codestral-2508

mistralai

Our cutting-edge language model for coding released August 2025.

textcodecheap
256,000 ctxFree in
Explore specs and pricingView details โ†’

kimi-k2-thinking

moonshotai

Kimi K2 Thinking is the latest, most capable version of an open-source thinking model.

textreasoningcheap
Run locally
262,144 ctxFree in
Explore specs and pricingView details โ†’

gemini-3-flash-preview

ollama

gemini-3-flash-preview โ€” available to run locally via Ollama on CPU and GPU hardware.

textcheaplong-context
1,048,576 ctxFree in
Explore specs and pricingView details โ†’

minimax-m2

ollama

minimax-m2 โ€” available to run locally via Ollama on CPU and GPU hardware.

textcheaplong-context
Run locally
204,800 ctxFree in
Explore specs and pricingView details โ†’

qwen3-coder-next

ollama

qwen3-coder-next โ€” available to run locally via Ollama on CPU and GPU hardware.

textcodecheap
262,144 ctxFree in
Explore specs and pricingView details โ†’

Llama-3.1-70B-Instruct

meta-llama

Open-source Llama-3.1-70B-Instruct model from meta-llama โ€” available for download and self-hosting on Hugging Face.

textinstructcheap
131,072 ctxFree in
Explore specs and pricingView details โ†’

kimi-k2-thinking

ollama

kimi-k2-thinking โ€” available to run locally via Ollama on CPU and GPU hardware.

textreasoningcheap
262,144 ctxFree in
Explore specs and pricingView details โ†’

Llama-3.2-1B-Instruct

meta-llama

Open-source Llama-3.2-1B-Instruct model from meta-llama โ€” available for download and self-hosting on Hugging Face.

textinstructcheap
60,000 ctxFree in
Explore specs and pricingView details โ†’

Llama-3.1-8B-Instruct

meta-llama

Open-source Llama-3.1-8B-Instruct model from meta-llama โ€” available for download and self-hosting on Hugging Face.

textinstructcheap
Run locally
131,072 ctxFree in
Explore specs and pricingView details โ†’

Qwen3-30B-A3B-Instruct-2507

qwen

Open-source Qwen3-30B-A3B-Instruct-2507 model from qwen โ€” available for download and self-hosting on Hugging Face.

textinstructcheap
Run locally
262,144 ctx$0.00/1M in
Explore specs and pricingView details โ†’

Qwen3-Coder-30B-A3B-Instruct

qwen

Open-source Qwen3-Coder-30B-A3B-Instruct model from qwen โ€” available for download and self-hosting on Hugging Face.

textcodeinstruct
Run locally
160,000 ctx$0.00/1M in
Explore specs and pricingView details โ†’

Qwen3-30B-A3B

qwen

Open-source Qwen3-30B-A3B model from qwen โ€” available for download and self-hosting on Hugging Face.

textcheaplong-context
Run locally
131,072 ctx$0.00/1M in
Explore specs and pricingView details โ†’

Qwen3-14B

qwen

Open-source Qwen3-14B model from qwen โ€” available for download and self-hosting on Hugging Face.

textcheap
Run locally
40,960 ctx$0.00/1M in
Explore specs and pricingView details โ†’

Qwen3-32B

qwen

Open-source Qwen3-32B model from qwen โ€” available for download and self-hosting on Hugging Face.

textcheaplong-context
Run locally
131,072 ctx$0.00/1M in
Explore specs and pricingView details โ†’

Qwen3-8B

qwen

Open-source Qwen3-8B model from qwen โ€” available for download and self-hosting on Hugging Face.

textcheap
Run locally
40,960 ctx$0.00/1M in
Explore specs and pricingView details โ†’

AI21 Jamba 1.6 Mini

ai21

AI21 Jamba 1.6 Mini is a lightweight Mamba-Transformer hybrid optimized for cost-effective, high-throughput inference with an impressive 256K context window. An excellent choice for document-heavy workloads on a budget.

long-contextinstructcheap
256,000 ctx$0.20/1M in
Explore specs and pricingView details โ†’

AI21 Jamba 1.6 Large

ai21

AI21 Jamba 1.6 Large uses a hybrid Mamba-Transformer architecture offering low memory footprint and high throughput compared to equivalent Transformer models. Features 256K context at a fraction of the inference cost.

long-contextinstructcheap
256,000 ctx$2.00/1M in
Explore specs and pricingView details โ†’

Microsoft Phi-4 Mini

microsoft

Microsoft Phi-4 Mini is a 3.8B parameter compact model from Microsoft. Delivers impressive reasoning capabilities for edge and mobile deployment scenarios, with strong performance on math and coding tasks relative to its size.

reasoningcodeinstruct
Run locally
128,000 ctx$0.00/1M in
Explore specs and pricingView details โ†’

IBM Granite 3.0 2B Instruct

IBM Research

IBM Granite 3.0 2B Instruct is an ultra-compact enterprise model excelling at summarization, extraction, and classification. The smallest model in the Granite family, suitable for edge deployments and constrained environments.

instructopen-sourcecheap
Run locally
128,000 ctx$0.00/1M in
Explore specs and pricingView details โ†’

Amazon Nova Lite

amazon

Amazon Nova Lite is a very low-cost multimodal model that can process image, video, and text inputs. Fast and accurate for a wide range of tasks requiring visual and language understanding.

visionmultimodalcheap
300,000 ctx$0.06/1M in
Explore specs and pricingView details โ†’

Amazon Nova Micro

amazon

Amazon Nova Micro is the fastest and most cost-effective text-only model in the Nova family, optimized for speed and low latency. Ideal for customer service, summarization, and translation at scale.

cheapinstruct
128,000 ctx$0.04/1M in
Explore specs and pricingView details โ†’

OpenAI: GPT-3.5 Turbo

openai

GPT-3.5 Turbo is OpenAI's fastest model. It can understand and generate natural language or code, and is optimized for chat and traditional completion tasks. Training data up to Sep 2021.

textcodecheap
Run locally
16,385 ctx$0.50/1M in
Explore specs and pricingView details โ†’

MythoMax 13B

gryphe

One of the highest performing and most popular fine-tunes of Llama 2 13B, with rich descriptions and roleplay. #merge

textcheap
Run locally
4,096 ctx$0.06/1M in
Explore specs and pricingView details โ†’

ReMM SLERP 13B

undi95

A recreation trial of the original MythoMax-L2-B13 but with updated models. #merge

textcheap
6,144 ctx$0.45/1M in
Explore specs and pricingView details โ†’

Mancer: Weaver (alpha)

mancer

An attempt to recreate Claude-style verbosity, but don't expect the same level of coherence or memory. Meant for use in roleplay/narrative situations.

textcheap
Run locally
8,000 ctx$0.75/1M in
Explore specs and pricingView details โ†’

Mistral: Mistral 7B Instruct v0.1

mistralai

A 7.3B parameter model that outperforms Llama 2 13B on all benchmarks, with optimizations for speed and context length.

textinstructcheap
2,824 ctx$0.11/1M in
Explore specs and pricingView details โ†’

Auto Router

openrouter

Your prompt will be processed by a meta-model and routed to one of dozens of models (see below), optimizing for the best possible output. To see which model was used,...

textvisionmultimodal
2,000,000 ctxFree in
Explore specs and pricingView details โ†’

Mistral: Mixtral 8x7B Instruct

mistralai

Mixtral 8x7B Instruct is a pretrained generative Sparse Mixture of Experts, by Mistral AI, for chat and instruction use. Incorporates 8 experts (feed-forward networks) for a total of 47 billion...

textinstructcheap
32,768 ctx$0.54/1M in
Explore specs and pricingView details โ†’

OpenAI: GPT-3.5 Turbo (older v0613)

openai

GPT-3.5 Turbo is OpenAI's fastest model. It can understand and generate natural language or code, and is optimized for chat and traditional completion tasks. Training data up to Sep 2021.

textcodecheap
4,095 ctx$1.00/1M in
Explore specs and pricingView details โ†’

Anthropic: Claude 3 Haiku

anthropic

Claude 3 Haiku is Anthropic's fastest and most compact model for near-instant responsiveness. Quick and accurate targeted performance. See the launch announcement and benchmark results [here](https://www.anthropic.com/news/claude-3-haiku) #multimodal

textvisionmultimodal
Run locally
200,000 ctx$0.25/1M in
Explore specs and pricingView details โ†’

WizardLM-2 8x22B

microsoft

WizardLM-2 8x22B is Microsoft AI's most advanced Wizard model. It demonstrates highly competitive performance compared to leading proprietary models, and it consistently outperforms all existing state-of-the-art opensource models. It is...

textcheap
Run locally
65,536 ctx$0.62/1M in
Explore specs and pricingView details โ†’