modelstop.top
Home/All Models

AI Model Catalogue

Browse 454 models across providers, modalities, and use cases.

πŸ“„ Long Context

454 models Β· Page 1 of 13

llama-4-scout-17b-16e-instruct

meta

Meta's Llama 4 Scout is a 17 billion parameter model with 16 experts that is natively multimodal. These models leverage a mixture-of-experts architecture to offer industry-leading performance in text and image understanding.

textvisioninstruct
131,000 ctx$0.27/1M in
Explore specs and pricingView details β†’

gemma-4-26b-a4b-it

google

Gemma 4 is Google's most intelligent family of open models, built from Gemini 3 research to maximize intelligence-per-parameter.

textcheaplong-context
256,000 ctx$0.10/1M in
Explore specs and pricingView details β†’

llama-3.2-11b-vision-instruct

meta

The Llama 3.2-Vision instruction-tuned models are optimized for visual recognition, image reasoning, captioning, and answering general questions about an image.

textvisionreasoning
128,000 ctx$0.05/1M in
Explore specs and pricingView details β†’

bge-base-en-v1.5

baai

BAAI general embedding (Base) model that transforms any given text into a 768-dimensional vector

textcheaplong-context
153,600 ctx$0.07/1M in
Explore specs and pricingView details β†’

mistral-small-3.1-24b-instruct

mistralai

Building upon Mistral Small 3 (2501), Mistral Small 3.1 (2503) adds state-of-the-art vision understanding and enhances long context capabilities up to 128k tokens without compromising text performance. With 24 billion parameters, this model achieves top-tier capabilities in both text and vision tasks.

textvisioninstruct
128,000 ctx$0.35/1M in
Explore specs and pricingView details β†’

gemma-sea-lion-v4-27b-it

aisingapore

SEA-LION stands for Southeast Asian Languages In One Network, which is a collection of Large Language Models (LLMs) which have been pretrained and instruct-tuned for the Southeast Asia (SEA) region.

textinstructcheap
128,000 ctx$0.35/1M in
Explore specs and pricingView details β†’

gpt-oss-20b

openai

OpenAI’s open-weight models designed for powerful reasoning, agentic tasks, and versatile developer use cases – gpt-oss-20b is for lower latency, and local or specialized use-cases.

textreasoningagents
128,000 ctx$0.20/1M in
Explore specs and pricingView details β†’

kimi-k2.6

moonshotai

Kimi K2.6 is a frontier-scale open-source 1T parameter model with a 262.1k context window, multi-turn tool calling, vision inputs, and structured outputs for agentic workloads.

textvisionagents
262,144 ctx$0.95/1M in
Explore specs and pricingView details β†’

llama-guard-3-8b

meta

Llama Guard 3 is a Llama-3.1-8B pretrained model, fine-tuned for content safety classification. Similar to previous versions, it can be used to classify content in both LLM inputs (prompt classification) and in LLM responses (response classification). It acts as an LLM – it generates text in its output that indicates whether a given prompt or response is safe or unsafe, and if unsafe, it also lists the content categories violated.

textcheaplong-context
131,072 ctx$0.48/1M in
Explore specs and pricingView details β†’

nemotron-3-120b-a12b

nvidia

NVIDIA Nemotron 3 Super is a hybrid MoE model with leading accuracy for multi-agent applications and specialized agentic AI systems.

textagentscheap
256,000 ctx$0.50/1M in
Explore specs and pricingView details β†’

granite-4.0-h-micro

ibm-granite

Granite 4.0 instruct models deliver strong performance across benchmarks, achieving industry-leading results in key agentic tasks like instruction following and function calling. These efficiencies make the models well-suited for a wide range of use cases like retrieval-augmented generation (RAG), multi-agent workflows, and edge deployments.

textagentsinstruct
131,000 ctx$0.02/1M in
Explore specs and pricingView details β†’

gpt-oss-120b

openai

OpenAI’s open-weight models designed for powerful reasoning, agentic tasks, and versatile developer use cases – gpt-oss-120b is for production, general purpose, high reasoning use-cases.

textreasoningagents
128,000 ctx$0.35/1M in
Explore specs and pricingView details β†’

kimi-k2.5

moonshotai

Kimi K2.5 is a frontier-scale open-source model with a 256k context window, multi-turn tool calling, vision inputs, and structured outputs for agentic workloads.

textvisionagents
256,000 ctx$0.60/1M in
Explore specs and pricingView details β†’

glm-4.7-flash

zai-org

GLM-4.7-Flash is a fast and efficient multilingual text generation model with a 131,072 token context window. Optimized for dialogue, instruction-following, and multi-turn tool calling across 100+ languages.

textmultilingualcheap
131,072 ctx$0.06/1M in
Explore specs and pricingView details β†’

Gemma 3 27B PT

google

Gemma 3 27B PT β€” available via AWS Bedrock (us-east-1).

textvisionmultimodal
131,072 ctxFree in
Explore specs and pricingView details β†’

GPT OSS Safeguard 20B

openai

GPT OSS Safeguard 20B β€” available via AWS Bedrock (us-east-1).

textcheaplong-context
131,072 ctxFree in
Explore specs and pricingView details β†’

Gemma 3 4B IT

google

Gemma 3 4B IT β€” available via AWS Bedrock (us-east-1).

textvisionmultimodal
131,072 ctxFree in
Explore specs and pricingView details β†’

glm-5p1

fireworks

textfreelong-context
202,752 ctxFree in
Explore specs and pricingView details β†’

glm-5

fireworks

textfreelong-context
202,752 ctxFree in
Explore specs and pricingView details β†’

Qwen3-VL-30B-A3B-Instruct

qwen

Open-source Qwen3-VL-30B-A3B-Instruct model from qwen β€” available for download and self-hosting on Hugging Face.

textinstructcheap
131,072 ctx$0.13/1M in
Explore specs and pricingView details β†’

Qwen3.5-35B-A3B

qwen

Open-source Qwen3.5-35B-A3B model from qwen β€” available for download and self-hosting on Hugging Face.

textcheaplong-context
Run locally
262,144 ctx$0.16/1M in
Explore specs and pricingView details β†’

Qwen3-VL-32B-Instruct

qwen

Open-source Qwen3-VL-32B-Instruct model from qwen β€” available for download and self-hosting on Hugging Face.

textinstructcheap
131,072 ctx$0.10/1M in
Explore specs and pricingView details β†’

Qwen3.5-9B

qwen

Open-source Qwen3.5-9B model from qwen β€” available for download and self-hosting on Hugging Face.

textcheaplong-context
Run locally
262,144 ctx$0.10/1M in
Explore specs and pricingView details β†’

Qwen3-VL-8B-Instruct

qwen

Open-source Qwen3-VL-8B-Instruct model from qwen β€” available for download and self-hosting on Hugging Face.

textinstructcheap
Run locally
256,000 ctx$0.08/1M in
Explore specs and pricingView details β†’

Qwen3.5-27B

qwen

Open-source Qwen3.5-27B model from qwen β€” available for download and self-hosting on Hugging Face.

textcheaplong-context
262,144 ctx$0.20/1M in
Explore specs and pricingView details β†’

Llama-Guard-4-12B

meta-llama

Open-source Llama-Guard-4-12B model from meta-llama β€” available for download and self-hosting on Hugging Face.

textcheaplong-context
163,840 ctx$0.18/1M in
Explore specs and pricingView details β†’

Llama-3.2-11B-Vision-Instruct

meta-llama

Open-source Llama-3.2-11B-Vision-Instruct model from meta-llama β€” available for download and self-hosting on Hugging Face.

textvisioninstruct
131,072 ctx$0.24/1M in
Explore specs and pricingView details β†’

Llama-Guard-3-8B

meta-llama

Open-source Llama-Guard-3-8B model from meta-llama β€” available for download and self-hosting on Hugging Face.

textcheaplong-context
131,072 ctx$0.48/1M in
Explore specs and pricingView details β†’

Qwen3-235B-A22B

qwen

Open-source Qwen3-235B-A22B model from qwen β€” available for download and self-hosting on Hugging Face.

textcheaplong-context
Run locally
131,072 ctx$0.46/1M in
Explore specs and pricingView details β†’

Llama-3.3-70B-Instruct

meta-llama

Open-source Llama-3.3-70B-Instruct model from meta-llama β€” available for download and self-hosting on Hugging Face.

textinstructfree
Run locally
131,072 ctxFree in
Explore specs and pricingView details β†’

Qwen3-Coder-Next

qwen

Open-source Qwen3-Coder-Next model from qwen β€” available for download and self-hosting on Hugging Face.

textcodecheap
262,144 ctx$0.15/1M in
Explore specs and pricingView details β†’

Meta Llama 3.2 1B Instruct

meta-llama

textinstructcheap
Run locally
131,072 ctx$0.06/1M in
Explore specs and pricingView details β†’

Meta Llama 3.1 8B Instruct Turbo

meta-llama

textinstructcheap
Run locally
131,072 ctx$0.18/1M in
Explore specs and pricingView details β†’

Qwen2.5 72B Instruct Turbo

qwen

textinstructlong-context
Run locally
131,072 ctx$1.20/1M in
Explore specs and pricingView details β†’

DeepSeek R1 (Original)

deepseek-ai

textfreelong-context
163,840 ctxFree in
Explore specs and pricingView details β†’

Nvidia Nemotron Nano 9B V2

nvidia

textcheaplong-context
Run locally
131,072 ctx$0.06/1M in
Explore specs and pricingView details β†’