modelstop.top
Home/All Models

AI Model Catalogue

Browse 287 models across providers, modalities, and use cases.

🌐 All Models

287 models Β· Page 8 of 8

AllenAI: Olmo 3.1 32B Instruct

allenai

Olmo 3.1 32B Instruct is a large-scale, 32-billion-parameter instruction-tuned language model engineered for high-performance conversational AI, multi-turn dialogue, and practical instruction following. As part of the Olmo 3.1 family, this...

textinstructcheap
65,536 ctx$0.20/1M in
Explore specs and pricingView details β†’

Z.ai: GLM 4.7 Flash

z-ai

As a 30B-class SOTA model, GLM-4.7-Flash offers a new option that balances performance and efficiency. It is further optimized for agentic coding use cases, strengthening coding capabilities, long-horizon task planning,...

textcodeagents
202,752 ctx$0.06/1M in
Explore specs and pricingView details β†’

OpenAI: GPT Audio Mini

openai

A cost-efficient version of GPT Audio. The new snapshot features an upgraded decoder for more natural sounding voices and maintains better voice consistency. Input is priced at $0.60 per million...

textaudiocheap
128,000 ctx$0.60/1M in
Explore specs and pricingView details β†’

Writer: Palmyra X5

writer

Palmyra X5 is Writer's most advanced model, purpose-built for building and scaling AI agents across the enterprise. It delivers industry-leading speed and efficiency on context windows up to 1 million...

textagentscheap
1,040,000 ctx$0.60/1M in
Explore specs and pricingView details β†’

MiniMax: MiniMax M2-her

minimax

MiniMax M2-her is a dialogue-first large language model built for immersive roleplay, character-driven chat, and expressive multi-turn conversations. Designed to stay consistent in tone and personality, it supports rich message...

textcheap
65,536 ctx$0.30/1M in
Explore specs and pricingView details β†’

Upstage: Solar Pro 3

upstage

Solar Pro 3 is Upstage's powerful Mixture-of-Experts (MoE) language model. With 102B total parameters and 12B active parameters per forward pass, it delivers exceptional performance while maintaining computational efficiency. Optimized...

textcheaplong-context
128,000 ctx$0.15/1M in
Explore specs and pricingView details β†’

StepFun: Step 3.5 Flash

stepfun

Step 3.5 Flash is StepFun's most capable open-source foundation model. Built on a sparse Mixture of Experts (MoE) architecture, it selectively activates only 11B of its 196B parameters per token....

textcheaplong-context
262,144 ctx$0.10/1M in
Explore specs and pricingView details β†’

Qwen: Qwen3 Coder Next

qwen

Qwen3-Coder-Next is an open-weight causal language model optimized for coding agents and local development workflows. It uses a sparse MoE design with 80B total parameters and only 3B activated per...

textcodeagents
262,144 ctx$0.12/1M in
Explore specs and pricingView details β†’

Qwen: Qwen3 Max Thinking

qwen

Qwen3-Max-Thinking is the flagship reasoning model in the Qwen3 series, designed for high-stakes cognitive tasks that require deep, multi-step reasoning. By significantly scaling model capacity and reinforcement learning compute, it...

textreasoningcheap
262,144 ctx$0.78/1M in
Explore specs and pricingView details β†’

Z.ai: GLM 5

z-ai

GLM-5 is Z.ai’s flagship open-source foundation model engineered for complex systems design and long-horizon agent workflows. Built for expert developers, it delivers production-grade performance on large-scale programming tasks, rivaling leading...

textagentscheap
80,000 ctx$0.72/1M in
Explore specs and pricingView details β†’

Qwen: Qwen3.5 397B A17B

qwen

The Qwen3.5 series 397B-A17B native vision-language model is built on a hybrid architecture that integrates a linear attention mechanism with a sparse mixture-of-experts model, achieving higher inference efficiency. It delivers...

textvisionmultimodal
262,144 ctx$0.39/1M in
Explore specs and pricingView details β†’

Qwen: Qwen3.5 Plus 2026-02-15

qwen

The Qwen3.5 native vision-language series Plus models are built on a hybrid architecture that integrates linear attention mechanisms with sparse mixture-of-experts models, achieving higher inference efficiency. In a variety of...

textvisionmultimodal
1,000,000 ctx$0.26/1M in
Explore specs and pricingView details β†’

AionLabs: Aion-2.0

aion-labs

Aion-2.0 is a variant of DeepSeek V3.2 optimized for immersive roleplaying and storytelling. It is particularly strong at introducing tension, crises, and conflict into stories, making narratives feel more engaging....

textcheaplong-context
131,072 ctx$0.80/1M in
Explore specs and pricingView details β†’

LiquidAI: LFM2-24B-A2B

liquid

LFM2-24B-A2B is the largest model in the LFM2 family of hybrid architectures designed for efficient on-device deployment. Built as a 24B parameter Mixture-of-Experts model with only 2B active parameters per...

textcheap
32,768 ctx$0.03/1M in
Explore specs and pricingView details β†’

Qwen: Qwen3.5-Flash

qwen

The Qwen3.5 native vision-language Flash models are built on a hybrid architecture that integrates a linear attention mechanism with a sparse mixture-of-experts model, achieving higher inference efficiency. Compared to the...

textvisionmultimodal
1,000,000 ctx$0.07/1M in
Explore specs and pricingView details β†’

Qwen: Qwen3.5-122B-A10B

qwen

The Qwen3.5 122B-A10B native vision-language model is built on a hybrid architecture that integrates a linear attention mechanism with a sparse mixture-of-experts model, achieving higher inference efficiency. In terms of...

textvisionmultimodal
262,144 ctx$0.26/1M in
Explore specs and pricingView details β†’

Qwen: Qwen3.5-27B

qwen

The Qwen3.5 27B native vision-language Dense model incorporates a linear attention mechanism, delivering fast response times while balancing inference speed and performance. Its overall capabilities are comparable to those of...

textvisionmultimodal
262,144 ctx$0.20/1M in
Explore specs and pricingView details β†’

Qwen: Qwen3.5-35B-A3B

qwen

The Qwen3.5 Series 35B-A3B is a native vision-language model designed with a hybrid architecture that integrates linear attention mechanisms and a sparse mixture-of-experts model, achieving higher inference efficiency. Its overall...

textvisionmultimodal
262,144 ctx$0.16/1M in
Explore specs and pricingView details β†’

Google: Nano Banana 2 (Gemini 3.1 Flash Image Preview)

google

Gemini 3.1 Flash Image Preview, a.k.a. "Nano Banana 2," is Google’s latest state of the art image generation and editing model, delivering Pro-level visual quality at Flash speed. It combines...

textvisionimage
65,536 ctx$0.50/1M in
Explore specs and pricingView details β†’

ByteDance Seed: Seed-2.0-Mini

bytedance-seed

Seed-2.0-mini targets latency-sensitive, high-concurrency, and cost-sensitive scenarios, emphasizing fast response and flexible inference deployment. It delivers performance comparable to ByteDance-Seed-1.6, supports 256k context, four reasoning effort modes (minimal/low/medium/high), multimodal understanding,...

textvisionmultimodal
262,144 ctx$0.10/1M in
Explore specs and pricingView details β†’

Google: Gemini 3.1 Flash Lite Preview

google

Gemini 3.1 Flash Lite Preview is Google's high-efficiency model optimized for high-volume use cases. It outperforms Gemini 2.5 Flash Lite on overall quality and approaches Gemini 2.5 Flash performance across...

textvisionmultimodal
1,048,576 ctx$0.25/1M in
Explore specs and pricingView details β†’

Inception: Mercury 2

inception

Mercury 2 is an extremely fast reasoning LLM, and the first reasoning diffusion LLM (dLLM). Instead of generating tokens sequentially, Mercury 2 produces and refines multiple tokens in parallel, achieving...

textimagereasoning
128,000 ctx$0.25/1M in
Explore specs and pricingView details β†’

Qwen: Qwen3.5-9B

qwen

Qwen3.5-9B is a multimodal foundation model from the Qwen3.5 family, designed to deliver strong reasoning, coding, and visual understanding in an efficient 9B-parameter architecture. It uses a unified vision-language design...

textvisionmultimodal
262,144 ctx$0.05/1M in
Explore specs and pricingView details β†’

ByteDance Seed: Seed-2.0-Lite

bytedance-seed

Seed-2.0-Lite is a versatile, cost‑efficient enterprise workhorse that delivers strong multimodal and agent capabilities while offering noticeably lower latency, making it a practical default choice for most production workloads across...

textvisionmultimodal
262,144 ctx$0.25/1M in
Explore specs and pricingView details β†’

Mistral: Mistral Small 4

mistralai

Mistral Small 4 is the next major release in the Mistral Small family, unifying the capabilities of several flagship Mistral models into a single system. It combines strong reasoning from...

textvisionmultimodal
262,144 ctx$0.15/1M in
Explore specs and pricingView details β†’

OpenAI: GPT-5.4 Mini

openai

GPT-5.4 mini brings the core capabilities of GPT-5.4 to a faster, more efficient model optimized for high-throughput workloads. It supports text and image inputs with strong performance across reasoning, coding,...

textvisionmultimodal
400,000 ctx$0.75/1M in
Explore specs and pricingView details β†’

OpenAI: GPT-5.4 Nano

openai

GPT-5.4 nano is the most lightweight and cost-efficient variant of the GPT-5.4 family, optimized for speed-critical and high-volume tasks. It supports text and image inputs and is designed for low-latency...

textvisionmultimodal
400,000 ctx$0.20/1M in
Explore specs and pricingView details β†’

MiniMax: MiniMax M2.7

minimax

MiniMax-M2.7 is a next-generation large language model designed for autonomous, real-world productivity and continuous improvement. Built to actively participate in its own evolution, M2.7 integrates advanced agentic capabilities through multi-agent...

textagentscheap
196,608 ctx$0.30/1M in
Explore specs and pricingView details β†’

Xiaomi: MiMo-V2-Pro

xiaomi

MiMo-V2-Pro is Xiaomi's flagship foundation model, featuring over 1T total parameters and a 1M context length, deeply optimized for agentic scenarios. It is highly adaptable to general agent frameworks like...

textagentscheap
1,048,576 ctx$1.00/1M in
Explore specs and pricingView details β†’

Xiaomi: MiMo-V2-Omni

xiaomi

MiMo-V2-Omni is a frontier omni-modal model that natively processes image, video, and audio inputs within a unified architecture. It combines strong multimodal perception with agentic capability - visual grounding, multi-step...

textvisionmultimodal
262,144 ctx$0.40/1M in
Explore specs and pricingView details β†’

Reka Edge

rekaai

Reka Edge is an extremely efficient 7B multimodal vision-language model that accepts image/video+text inputs and generates text outputs. This model is optimized specifically to deliver industry-leading performance in image understanding,...

textvisionimage
16,384 ctx$0.10/1M in
Explore specs and pricingView details β†’

Kwaipilot: KAT-Coder-Pro V2

kwaipilot

KAT-Coder-Pro V2 is the latest high-performance model in KwaiKAT’s KAT-Coder series, designed for complex enterprise-grade software engineering and SaaS integration. It builds on the agentic coding strengths of earlier versions,...

textcodeagents
256,000 ctx$0.30/1M in
Explore specs and pricingView details β†’

Arcee AI: Trinity Large Thinking

arcee-ai

Trinity Large Thinking is a powerful open source reasoning model from the team at Arcee AI. It shows strong performance in PinchBench, agentic workloads, and reasoning tasks. Launch video: https://youtu.be/Gc82AXLa0Rg?si=4RLn6WBz33qT--B7

textreasoningagents
262,144 ctx$0.22/1M in
Explore specs and pricingView details β†’

Qwen: Qwen3.6 Plus

qwen

Qwen 3.6 Plus builds on a hybrid architecture that combines efficient linear attention with sparse mixture-of-experts routing, enabling strong scalability and high-performance inference. Compared to the 3.5 series, it delivers...

textvisionmultimodal
1,000,000 ctx$0.33/1M in
Explore specs and pricingView details β†’

Z.ai: GLM 5.1

z-ai

GLM-5.1 delivers a major leap in coding capability, with particularly significant gains in handling long-horizon tasks. Unlike previous models built around minute-level interactions, GLM-5.1 can work independently and continuously on...

textcodecheap
202,752 ctx$0.95/1M in
Explore specs and pricingView details β†’