modelstop.top
Home/All Models

AI Model Catalogue

Browse 174 models across providers, modalities, and use cases.

🌐 All Models

174 models · Page 5 of 5

OpenAI: GPT-5.1

openai

GPT-5.1 is the latest frontier-grade model in the GPT-5 series, offering stronger general-purpose reasoning, improved instruction adherence, and a more natural conversational style compared to GPT-5. It uses adaptive reasoning...

textvisionmultimodal
400,000 ctx$1.25/1M in
Explore specs and pricingView details →

xAI: Grok 4.1 Fast

x-ai

Grok 4.1 Fast is xAI's best agentic tool calling model that shines in real-world use cases like customer support and deep research. 2M context window. Reasoning can be enabled/disabled using...

textvisionmultimodal
2,000,000 ctx$0.20/1M in
Explore specs and pricingView details →

Google: Nano Banana Pro (Gemini 3 Pro Image Preview)

google

Nano Banana Pro is Google’s most advanced image-generation and editing model, built on Gemini 3 Pro. It extends the original Nano Banana with significantly improved multimodal reasoning, real-world grounding, and...

textvisionimage
65,536 ctx$2.00/1M in
Explore specs and pricingView details →

AllenAI: Olmo 3 32B Think

allenai

Olmo 3 32B Think is a large-scale, 32-billion-parameter model purpose-built for deep reasoning, complex logic chains and advanced instruction-following scenarios. Its capacity enables strong performance on demanding evaluation tasks and...

textreasoningcheap
65,536 ctx$0.15/1M in
Explore specs and pricingView details →

Anthropic: Claude Opus 4.5

anthropic

Claude Opus 4.5 is Anthropic’s frontier reasoning model optimized for complex software engineering, agentic workflows, and long-horizon computer use. It offers strong multimodal capabilities, competitive performance across real-world coding and...

textvisionmultimodal
200,000 ctx$5.00/1M in
Explore specs and pricingView details →

DeepSeek: DeepSeek V3.2

deepseek

DeepSeek-V3.2 is a large language model designed to harmonize high computational efficiency with strong reasoning and agentic tool-use performance. It introduces DeepSeek Sparse Attention (DSA), a fine-grained sparse attention mechanism...

textreasoningagents
163,840 ctx$0.26/1M in
Explore specs and pricingView details →

DeepSeek: DeepSeek V3.2 Speciale

deepseek

DeepSeek-V3.2-Speciale is a high-compute variant of DeepSeek-V3.2 optimized for maximum reasoning and agentic performance. It builds on DeepSeek Sparse Attention (DSA) for efficient long-context processing, then scales post-training reinforcement learning...

textreasoningagents
163,840 ctx$0.40/1M in
Explore specs and pricingView details →

Arcee AI: Trinity Mini

arcee-ai

Trinity Mini is a 26B-parameter (3B active) sparse mixture-of-experts language model featuring 128 experts with 8 active per token. Engineered for efficient reasoning over long contexts (131k) with robust function...

textreasoningcheap
131,072 ctx$0.04/1M in
Explore specs and pricingView details →

Amazon: Nova 2 Lite

amazon

Nova 2 Lite is a fast, cost-effective reasoning model for everyday workloads that can process text, images, and videos to generate text. Nova 2 Lite demonstrates standout capabilities in processing...

textvisionimage
1,000,000 ctx$0.30/1M in
Explore specs and pricingView details →

OpenAI: GPT-5.1-Codex-Max

openai

GPT-5.1-Codex-Max is OpenAI’s latest agentic coding model, designed for long-running, high-context software development tasks. It is based on an updated version of the 5.1 reasoning stack and trained on agentic...

textvisionmultimodal
400,000 ctx$1.25/1M in
Explore specs and pricingView details →

EssentialAI: Rnj 1 Instruct

essentialai

Rnj-1 is an 8B-parameter, dense, open-weight model family developed by Essential AI and trained from scratch with a focus on programming, math, and scientific reasoning. The model demonstrates strong performance...

textreasoninginstruct
32,768 ctx$0.15/1M in
Explore specs and pricingView details →

Z.ai: GLM 4.6V

z-ai

GLM-4.6V is a large multimodal model designed for high-fidelity visual understanding and long-context reasoning across images, documents, and mixed media. It supports up to 128K tokens, processes complex page layouts...

textvisionmultimodal
131,072 ctx$0.30/1M in
Explore specs and pricingView details →

OpenAI: GPT-5.2

openai

GPT-5.2 is the latest frontier-grade model in the GPT-5 series, offering stronger agentic and long context perfomance compared to GPT-5.1. It uses adaptive reasoning to allocate computation dynamically, responding quickly...

textvisionmultimodal
400,000 ctx$1.75/1M in
Explore specs and pricingView details →

OpenAI: GPT-5.2 Pro

openai

GPT-5.2 Pro is OpenAI’s most advanced model, offering major improvements in agentic coding and long context performance over GPT-5 Pro. It is optimized for complex tasks that require step-by-step reasoning,...

textvisionmultimodal
400,000 ctx$21.00/1M in
Explore specs and pricingView details →

OpenAI: GPT-5.2 Chat

openai

GPT-5.2 Chat (AKA Instant) is the fast, lightweight member of the 5.2 family, optimized for low-latency chat while retaining strong general intelligence. It uses adaptive reasoning to selectively “think” on...

textvisionmultimodal
128,000 ctx$1.75/1M in
Explore specs and pricingView details →

Google: Gemini 3 Flash Preview

google

Gemini 3 Flash Preview is a high speed, high value thinking model designed for agentic workflows, multi turn chat, and coding assistance. It delivers near Pro level reasoning and tool...

textvisionmultimodal
1,048,576 ctx$0.50/1M in
Explore specs and pricingView details →

Z.ai: GLM 4.7

z-ai

GLM-4.7 is Z.ai’s latest flagship model, featuring upgrades in two key areas: enhanced programming capabilities and more stable multi-step reasoning/execution. It demonstrates significant improvements in executing complex agent tasks while...

textreasoningagents
202,752 ctx$0.39/1M in
Explore specs and pricingView details →

ByteDance Seed: Seed 1.6

bytedance-seed

Seed 1.6 is a general-purpose model released by the ByteDance Seed team. It incorporates multimodal capabilities and adaptive deep thinking with a 256K context window.

textvisionmultimodal
262,144 ctx$0.25/1M in
Explore specs and pricingView details →

ByteDance Seed: Seed 1.6 Flash

bytedance-seed

Seed 1.6 Flash is an ultra-fast multimodal deep thinking model by ByteDance Seed, supporting both text and visual understanding. It features a 256k context window and can generate outputs of...

textvisionimage
262,144 ctx$0.07/1M in
Explore specs and pricingView details →

Qwen: Qwen3 Max Thinking

qwen

Qwen3-Max-Thinking is the flagship reasoning model in the Qwen3 series, designed for high-stakes cognitive tasks that require deep, multi-step reasoning. By significantly scaling model capacity and reinforcement learning compute, it...

textreasoningcheap
262,144 ctx$0.78/1M in
Explore specs and pricingView details →

Google: Gemini 3.1 Pro Preview

google

Gemini 3.1 Pro Preview is Google’s frontier reasoning model, delivering enhanced software engineering performance, improved agentic reliability, and more efficient token usage across complex workflows. Building on the multimodal foundation...

textvisionmultimodal
1,048,576 ctx$2.00/1M in
Explore specs and pricingView details →

OpenAI: GPT-5.3-Codex

openai

GPT-5.3-Codex is OpenAI’s most advanced agentic coding model, combining the frontier software engineering performance of GPT-5.2-Codex with the broader reasoning and professional knowledge capabilities of GPT-5.2. It achieves state-of-the-art results...

textvisionmultimodal
400,000 ctx$1.75/1M in
Explore specs and pricingView details →

ByteDance Seed: Seed-2.0-Mini

bytedance-seed

Seed-2.0-mini targets latency-sensitive, high-concurrency, and cost-sensitive scenarios, emphasizing fast response and flexible inference deployment. It delivers performance comparable to ByteDance-Seed-1.6, supports 256k context, four reasoning effort modes (minimal/low/medium/high), multimodal understanding,...

textvisionmultimodal
262,144 ctx$0.10/1M in
Explore specs and pricingView details →

Inception: Mercury 2

inception

Mercury 2 is an extremely fast reasoning LLM, and the first reasoning diffusion LLM (dLLM). Instead of generating tokens sequentially, Mercury 2 produces and refines multiple tokens in parallel, achieving...

textimagereasoning
128,000 ctx$0.25/1M in
Explore specs and pricingView details →

OpenAI: GPT-5.4 Pro

openai

GPT-5.4 Pro is OpenAI's most advanced model, building on GPT-5.4's unified architecture with enhanced reasoning capabilities for complex, high-stakes tasks. It features a 1M+ token context window (922K input, 128K...

textvisionmultimodal
1,050,000 ctx$30.00/1M in
Explore specs and pricingView details →

Qwen: Qwen3.5-9B

qwen

Qwen3.5-9B is a multimodal foundation model from the Qwen3.5 family, designed to deliver strong reasoning, coding, and visual understanding in an efficient 9B-parameter architecture. It uses a unified vision-language design...

textvisionmultimodal
262,144 ctx$0.05/1M in
Explore specs and pricingView details →

Mistral: Mistral Small 4

mistralai

Mistral Small 4 is the next major release in the Mistral Small family, unifying the capabilities of several flagship Mistral models into a single system. It combines strong reasoning from...

textvisionmultimodal
262,144 ctx$0.15/1M in
Explore specs and pricingView details →

OpenAI: GPT-5.4 Mini

openai

GPT-5.4 mini brings the core capabilities of GPT-5.4 to a faster, more efficient model optimized for high-throughput workloads. It supports text and image inputs with strong performance across reasoning, coding,...

textvisionmultimodal
400,000 ctx$0.75/1M in
Explore specs and pricingView details →

Arcee AI: Trinity Large Thinking

arcee-ai

Trinity Large Thinking is a powerful open source reasoning model from the team at Arcee AI. It shows strong performance in PinchBench, agentic workloads, and reasoning tasks. Launch video: https://youtu.be/Gc82AXLa0Rg?si=4RLn6WBz33qT--B7

textreasoningagents
262,144 ctx$0.22/1M in
Explore specs and pricingView details →

Google: Gemma 4 31B (free)

google

Gemma 4 31B Instruct is Google DeepMind's 30.7B dense multimodal model supporting text and image input with text output. Features a 256K token context window, configurable thinking/reasoning mode, native function...

textvisionmultimodal
262,144 ctx$0.14/1M in
Explore specs and pricingView details →