modelstop.top
Home/All Models

AI Model Catalogue

Browse 837 models across providers, modalities, and use cases.

🌐 All Models

837 models Β· Page 23 of 24

phi-2

microsoft

Open-source phi-2 model from microsoft β€” available for download and self-hosting on Hugging Face.

textfree
ctx$0.00/1M in
Explore specs and pricingView details β†’

Meta-Llama-3-8B-Instruct

meta-llama

Open-source Meta-Llama-3-8B-Instruct model from meta-llama β€” available for download and self-hosting on Hugging Face.

textinstructfree
ctx$0.00/1M in
Explore specs and pricingView details β†’

SmolLM2-135M

huggingfacetb

SmolLM2-135M β€” open-source model from huggingfacetb, available for self-hosting on Hugging Face.

textfree
ctx$0.00/1M in
Explore specs and pricingView details β†’

Phi-3-mini-4k-instruct-gptq-4bit

kaitchup

Open-source Phi-3-mini-4k-instruct-gptq-4bit model from kaitchup β€” available for download and self-hosting on Hugging Face.

textinstructfree
ctx$0.00/1M in
Explore specs and pricingView details β†’

NVIDIA-Nemotron-3-Nano-30B-A3B-BF16

nvidia

Open-source NVIDIA-Nemotron-3-Nano-30B-A3B-BF16 model from nvidia β€” available for download and self-hosting on Hugging Face.

textfree
ctx$0.00/1M in
Explore specs and pricingView details β†’

tiny-random-LlamaForCausalLM

hmellor

tiny-random-LlamaForCausalLM β€” open-source model from hmellor, available for self-hosting on Hugging Face.

textfree
ctx$0.00/1M in
Explore specs and pricingView details β†’

Llama-2-7b-hf

meta-llama

Llama-2-7b-hf β€” Meta's Llama open-source language model, one of the most widely deployed open models.

textfree
ctx$0.00/1M in
Explore specs and pricingView details β†’

GLM-5-FP8

zai-org

GLM-5-FP8 β€” open-source model from zai, available for self-hosting on Hugging Face.

textfree
ctx$0.00/1M in
Explore specs and pricingView details β†’

NVIDIA-Nemotron-3-Super-120B-A12B-NVFP4

nvidia

Open-source NVIDIA-Nemotron-3-Super-120B-A12B-NVFP4 model from nvidia β€” available for download and self-hosting on Hugging Face.

textfree
ctx$0.00/1M in
Explore specs and pricingView details β†’

Qwen3-4B-Thinking-2507

qwen

Open-source Qwen3-4B-Thinking-2507 model from qwen β€” available for download and self-hosting on Hugging Face.

textreasoningfree
ctx$0.00/1M in
Explore specs and pricingView details β†’

DeepSeek-R1-Distill-Llama-8B

deepseek-ai

Open-source DeepSeek-R1-Distill-Llama-8B model from deepseek-ai β€” available for download and self-hosting on Hugging Face.

textfree
ctx$0.00/1M in
Explore specs and pricingView details β†’

pythia-160m

eleutherai

pythia-160m β€” open-source model from eleutherai, available for self-hosting on Hugging Face.

textfree
ctx$0.00/1M in
Explore specs and pricingView details β†’

distilgpt2

distilbert

Open-source distilgpt2 model from distilbert β€” available for download and self-hosting on Hugging Face.

textfree
ctx$0.00/1M in
Explore specs and pricingView details β†’

gpt2-large

openai-community

gpt2-large β€” open-source model from openai, available for self-hosting on Hugging Face.

textfree
ctx$0.00/1M in
Explore specs and pricingView details β†’

Meta-Llama-3-8B

meta-llama

Open-source Meta-Llama-3-8B model from meta-llama β€” available for download and self-hosting on Hugging Face.

textfree
ctx$0.00/1M in
Explore specs and pricingView details β†’

gpt-oss-120b

openai

Open-source gpt-oss-120b model from openai β€” available for download and self-hosting on Hugging Face.

textfreelong-context
131,072 ctx$0.00/1M in
Explore specs and pricingView details β†’

DeepSeek-R1

deepseek-ai

Open-source DeepSeek-R1 model from deepseek-ai β€” available for download and self-hosting on Hugging Face.

textfree
ctx$0.00/1M in
Explore specs and pricingView details β†’

gpt-oss-20b

openai

Open-source gpt-oss-20b model from openai β€” available for download and self-hosting on Hugging Face.

textfreelong-context
131,072 ctx$0.00/1M in
Explore specs and pricingView details β†’

opt-125m

facebook

Open-source opt-125m model from facebook β€” available for download and self-hosting on Hugging Face.

textfree
ctx$0.00/1M in
Explore specs and pricingView details β†’

Qwen3-4B-Instruct-2507

qwen

Open-source Qwen3-4B-Instruct-2507 model from qwen β€” available for download and self-hosting on Hugging Face.

textinstructfree
ctx$0.00/1M in
Explore specs and pricingView details β†’

Qwen3-4B

qwen

Open-source Qwen3-4B model from qwen β€” available for download and self-hosting on Hugging Face.

textfree
ctx$0.00/1M in
Explore specs and pricingView details β†’

gpt2

openai-community

Open-source gpt2 model from openai-community β€” available for download and self-hosting on Hugging Face.

textfree
ctx$0.00/1M in
Explore specs and pricingView details β†’

Nous: Hermes 3 405B Instruct (free)

nousresearch

Hermes 3 is a generalist language model with many improvements over Hermes 2, including advanced agentic capabilities, much better roleplaying, reasoning, multi-turn conversation, long context coherence, and improvements across the...

textreasoningagents
131,072 ctx$1.00/1M in
Explore specs and pricingView details β†’

Meta: Llama 3.2 3B Instruct (free)

meta-llama

Llama 3.2 3B is a 3-billion-parameter multilingual large language model, optimized for advanced natural language processing tasks like dialogue generation, reasoning, and summarization. Designed with the latest transformer architecture, it...

textreasoningmultilingual
131,072 ctx$0.05/1M in
Explore specs and pricingView details β†’

Meta: Llama 3.3 70B Instruct (free)

meta-llama

The Meta Llama 3.3 multilingual large language model (LLM) is a pretrained and instruction tuned generative model in 70B (text in/text out). The Llama 3.3 instruction tuned text only model...

textmultilingualinstruct
65,536 ctx$0.10/1M in
Explore specs and pricingView details β†’

Google: Gemma 3 27B (free)

google

Gemma 3 introduces multimodality, supporting vision-language input and text outputs. It handles context windows up to 128k tokens, understands over 140 languages, and offers improved math, reasoning, and chat capabilities,...

textvisionmultimodal
131,072 ctx$0.08/1M in
Explore specs and pricingView details β†’

Google: Gemma 3 12B (free)

google

Gemma 3 introduces multimodality, supporting vision-language input and text outputs. It handles context windows up to 128k tokens, understands over 140 languages, and offers improved math, reasoning, and chat capabilities,...

textvisionmultimodal
32,768 ctx$0.04/1M in
Explore specs and pricingView details β†’

Google: Gemma 3 4B (free)

google

Gemma 3 introduces multimodality, supporting vision-language input and text outputs. It handles context windows up to 128k tokens, understands over 140 languages, and offers improved math, reasoning, and chat capabilities,...

textvisionmultimodal
32,768 ctx$0.04/1M in
Explore specs and pricingView details β†’

Google: Gemma 3n 4B (free)

google

Gemma 3n E4B-it is optimized for efficient execution on mobile and low-resource devices, such as phones, laptops, and tablets. It supports multimodal inputsβ€”including text, visual data, and audioβ€”enabling diverse tasks...

textvisionaudio
8,192 ctx$0.02/1M in
Explore specs and pricingView details β†’

Qwen: Qwen3 Coder 480B A35B (free)

qwen

Qwen3-Coder-480B-A35B-Instruct is a Mixture-of-Experts (MoE) code generation model developed by the Qwen team. It is optimized for agentic coding tasks such as function calling, tool use, and long-context reasoning over...

textcodereasoning
262,000 ctx$0.22/1M in
Explore specs and pricingView details β†’

Z.ai: GLM 4.5 Air (free)

z-ai

GLM-4.5-Air is the lightweight variant of our latest flagship model family, also purpose-built for agent-centric applications. Like GLM-4.5, it adopts the Mixture-of-Experts (MoE) architecture but with a more compact parameter...

textagentsfree
131,072 ctx$0.13/1M in
Explore specs and pricingView details β†’

OpenAI: gpt-oss-20b (free)

openai

gpt-oss-20b is an open-weight 21B parameter model released by OpenAI under the Apache 2.0 license. It uses a Mixture-of-Experts (MoE) architecture with 3.6B active parameters per forward pass, optimized for...

textfreelong-context
131,072 ctx$0.03/1M in
Explore specs and pricingView details β†’

OpenAI: gpt-oss-120b (free)

openai

gpt-oss-120b is an open-weight, 117B-parameter Mixture-of-Experts (MoE) language model from OpenAI designed for high-reasoning, agentic, and general-purpose production use cases. It activates 5.1B parameters per forward pass and is optimized...

textreasoningagents
131,072 ctx$0.04/1M in
Explore specs and pricingView details β†’

NVIDIA: Nemotron Nano 9B V2 (free)

nvidia

NVIDIA-Nemotron-Nano-9B-v2 is a large language model (LLM) trained from scratch by NVIDIA, and designed as a unified model for both reasoning and non-reasoning tasks. It responds to user queries and...

textreasoningfree
128,000 ctx$0.04/1M in
Explore specs and pricingView details β†’

Qwen: Qwen3 Next 80B A3B Instruct (free)

qwen

Qwen3-Next-80B-A3B-Instruct is an instruction-tuned chat model in the Qwen3-Next series optimized for fast, stable responses without β€œthinking” traces. It targets complex tasks across reasoning, code generation, knowledge QA, and multilingual...

textcodereasoning
262,144 ctx$0.09/1M in
Explore specs and pricingView details β†’

NVIDIA: Nemotron Nano 12B 2 VL (free)

nvidia

NVIDIA Nemotron Nano 2 VL is a 12-billion-parameter open multimodal reasoning model designed for video understanding and document intelligence. It introduces a hybrid Transformer-Mamba architecture, combining transformer-level accuracy with Mamba’s...

textvisionmultimodal
128,000 ctx$0.20/1M in
Explore specs and pricingView details β†’