modelstop.top
Home/All Models

AI Model Catalogue

Browse 287 models across providers, modalities, and use cases.

🌐 All Models

287 models Β· Page 4 of 8

Amazon: Nova Micro 1.0

amazon

Amazon Nova Micro 1.0 is a text-only model that delivers the lowest latency responses in the Amazon Nova family of models at a very low cost. With a context length...

textcheaplong-context
128,000 ctx$0.04/1M in
Explore specs and pricingView details β†’

Amazon: Nova Lite 1.0

amazon

Amazon Nova Lite 1.0 is a very low-cost multimodal model from Amazon that focused on fast processing of image, video, and text inputs to generate text output. Amazon Nova Lite...

textvisionimage
300,000 ctx$0.06/1M in
Explore specs and pricingView details β†’

Cohere: Command R7B (12-2024)

cohere

Command R7B (12-2024) is a small, fast update of the Command R+ model, delivered in December 2024. It excels at RAG, tool use, agents, and similar tasks requiring complex reasoning...

textreasoningagents
128,000 ctx$0.04/1M in
Explore specs and pricingView details β†’

Sao10K: Llama 3.3 Euryale 70B

sao10k

Euryale L3.3 70B is a model focused on creative roleplay from [Sao10k](https://ko-fi.com/sao10k). It is the successor of [Euryale L3 70B v2.2](/models/sao10k/l3-euryale-70b).

textcheaplong-context
131,072 ctx$0.65/1M in
Explore specs and pricingView details β†’

DeepSeek: DeepSeek V3

deepseek

DeepSeek-V3 is the latest model from the DeepSeek team, building upon the instruction following and coding abilities of the previous versions. Pre-trained on nearly 15 trillion tokens, the reported evaluations...

textcodecheap
163,840 ctx$0.32/1M in
Explore specs and pricingView details β†’

Microsoft: Phi 4

microsoft

[Microsoft Research](/microsoft) Phi-4 is designed to perform well in complex reasoning tasks and can operate efficiently in situations with limited memory or where quick responses are needed. At 14 billion...

textreasoningcheap
16,384 ctx$0.07/1M in
Explore specs and pricingView details β†’

MiniMax: MiniMax-01

minimax

MiniMax-01 is a combines MiniMax-Text-01 for text generation and MiniMax-VL-01 for image understanding. It has 456 billion parameters, with 45.9 billion parameters activated per inference, and can handle a context...

textvisionimage
1,000,192 ctx$0.20/1M in
Explore specs and pricingView details β†’

DeepSeek: R1

deepseek

DeepSeek R1 is here: Performance on par with [OpenAI o1](/openai/o1), but open-sourced and with fully open reasoning tokens. It's 671B parameters in size, with 37B active in an inference pass....

textreasoningcheap
64,000 ctx$0.70/1M in
Explore specs and pricingView details β†’

DeepSeek: R1 Distill Llama 70B

deepseek

DeepSeek R1 Distill Llama 70B is a distilled large language model based on [Llama-3.3-70B-Instruct](/meta-llama/llama-3.3-70b-instruct), using outputs from [DeepSeek R1](/deepseek/deepseek-r1). The model combines advanced distillation techniques to achieve high performance across...

textinstructcheap
131,072 ctx$0.70/1M in
Explore specs and pricingView details β†’

Perplexity: Sonar

perplexity

Sonar is lightweight, affordable, fast, and simple to use β€” now featuring citations and the ability to customize sources. It is designed for companies seeking to integrate lightweight question-and-answer features...

textvisionmultimodal
127,072 ctx$1.00/1M in
Explore specs and pricingView details β†’

DeepSeek: R1 Distill Qwen 32B

deepseek

DeepSeek R1 Distill Qwen 32B is a distilled large language model based on [Qwen 2.5 32B](https://huggingface.co/Qwen/Qwen2.5-32B), using outputs from [DeepSeek R1](/deepseek/deepseek-r1). It outperforms OpenAI's o1-mini across various benchmarks, achieving new...

textcheap
32,768 ctx$0.29/1M in
Explore specs and pricingView details β†’

Mistral: Mistral Small 3

mistralai

Mistral Small 3 is a 24B-parameter language model optimized for low-latency performance across common AI tasks. Released under the Apache 2.0 license, it features both pre-trained and instruction-tuned versions designed...

textcheap
32,768 ctx$0.05/1M in
Explore specs and pricingView details β†’

Qwen: Qwen-Plus

qwen

Qwen-Plus, based on the Qwen2.5 foundation model, is a 131K context model with a balanced performance, speed, and cost combination.

textcheaplong-context
1,000,000 ctx$0.26/1M in
Explore specs and pricingView details β†’

Qwen: Qwen2.5 VL 72B Instruct

qwen

Qwen2.5-VL is proficient in recognizing common objects such as flowers, birds, fish, and insects. It is also highly capable of analyzing texts, charts, icons, graphics, and layouts within images.

textvisionmultimodal
32,000 ctx$0.80/1M in
Explore specs and pricingView details β†’

Qwen: Qwen-Turbo

qwen

Qwen-Turbo, based on Qwen2.5, is a 1M context model that provides fast speed and low cost, suitable for simple tasks.

textcheaplong-context
131,072 ctx$0.03/1M in
Explore specs and pricingView details β†’

Qwen: Qwen VL Max

qwen

Qwen VL Max is a visual understanding model with 7500 tokens context length. It excels in delivering optimal performance for a broader spectrum of complex tasks.

textvisionmultimodal
131,072 ctx$0.52/1M in
Explore specs and pricingView details β†’

AionLabs: Aion-RP 1.0 (8B)

aion-labs

Aion-RP-Llama-3.1-8B ranks the highest in the character evaluation portion of the RPBench-Auto benchmark, a roleplaying-specific variant of Arena-Hard-Auto, where LLMs evaluate each other’s responses. It is a fine-tuned base model...

textcheap
32,768 ctx$0.80/1M in
Explore specs and pricingView details β†’

AionLabs: Aion-1.0-Mini

aion-labs

Aion-1.0-Mini 32B parameter model is a distilled version of the DeepSeek-R1 model, designed for strong performance in reasoning domains such as mathematics, coding, and logic. It is a modified variant...

textcodereasoning
131,072 ctx$0.70/1M in
Explore specs and pricingView details β†’

Qwen: Qwen VL Plus

qwen

Qwen's Enhanced Large Visual Language Model. Significantly upgraded for detailed recognition capabilities and text recognition abilities, supporting ultra-high pixel resolutions up to millions of pixels and extreme aspect ratios for...

textvisionmultimodal
131,072 ctx$0.14/1M in
Explore specs and pricingView details β†’

Google: Gemini 2.0 Flash

google

Gemini Flash 2.0 offers a significantly faster time to first token (TTFT) compared to [Gemini Flash 1.5](/google/gemini-flash-1.5), while maintaining quality on par with larger models like [Gemini Pro 1.5](/google/gemini-pro-1.5). It...

textvisionmultimodal
1,000,000 ctx$0.10/1M in
Explore specs and pricingView details β†’

Llama Guard 3 8B

meta-llama

Llama Guard 3 is a Llama-3.1-8B pretrained model, fine-tuned for content safety classification. Similar to previous versions, it can be used to classify content in both LLM inputs (prompt classification)...

textcheaplong-context
131,072 ctx$0.02/1M in
Explore specs and pricingView details β†’

Mistral: Saba

mistralai

Mistral Saba is a 24B-parameter language model specifically designed for the Middle East and South Asia, delivering accurate and contextually relevant responses while maintaining efficient performance. Trained on curated regional...

textcheap
32,768 ctx$0.20/1M in
Explore specs and pricingView details β†’

Google: Gemini 2.0 Flash Lite

google

Gemini 2.0 Flash Lite offers a significantly faster time to first token (TTFT) compared to [Gemini Flash 1.5](/google/gemini-flash-1.5), while maintaining quality on par with larger models like [Gemini Pro 1.5](/google/gemini-pro-1.5),...

textvisionmultimodal
1,048,576 ctx$0.07/1M in
Explore specs and pricingView details β†’

Qwen: QwQ 32B

qwen

QwQ is the reasoning model of the Qwen series. Compared with conventional instruction-tuned models, QwQ, which is capable of thinking and reasoning, can achieve significantly enhanced performance in downstream tasks,...

textreasoningcheap
131,072 ctx$0.15/1M in
Explore specs and pricingView details β†’

TheDrummer: Skyfall 36B V2

thedrummer

Skyfall 36B v2 is an enhanced iteration of Mistral Small 2501, specifically fine-tuned for improved creativity, nuanced writing, role-playing, and coherent storytelling.

textcheap
32,768 ctx$0.55/1M in
Explore specs and pricingView details β†’

Reka Flash 3

rekaai

Reka Flash 3 is a general-purpose, instruction-tuned large language model with 21 billion parameters, developed by Reka. It excels at general chat, coding tasks, instruction-following, and function calling. Featuring a...

textcodecheap
65,536 ctx$0.10/1M in
Explore specs and pricingView details β†’

OpenAI: GPT-4o-mini Search Preview

openai

GPT-4o mini Search Preview is a specialized model for web search in Chat Completions. It is trained to understand and execute web search queries.

textcheaplong-context
128,000 ctx$0.15/1M in
Explore specs and pricingView details β†’

AllenAI: Olmo 2 32B Instruct

allenai

OLMo-2 32B Instruct is a supervised instruction-finetuned variant of the OLMo-2 32B March 2025 base model. It excels in complex reasoning and instruction-following tasks across diverse benchmarks such as GSM8K,...

textreasoninginstruct
128,000 ctx$0.05/1M in
Explore specs and pricingView details β†’

Mistral: Mistral Small 3.1 24B

mistralai

Mistral Small 3.1 24B Instruct is an upgraded variant of Mistral Small 3 (2501), featuring 24 billion parameters with advanced multimodal capabilities. It provides state-of-the-art performance in text-based reasoning and...

textvisionmultimodal
128,000 ctx$0.03/1M in
Explore specs and pricingView details β†’

DeepSeek: DeepSeek V3 0324

deepseek

DeepSeek V3, a 685B-parameter, mixture-of-experts model, is the latest iteration of the flagship chat model family from the DeepSeek team. It succeeds the [DeepSeek V3](/deepseek/deepseek-chat-v3) model and performs really well...

textcheaplong-context
163,840 ctx$0.20/1M in
Explore specs and pricingView details β†’

Qwen: Qwen2.5 VL 32B Instruct

qwen

Qwen2.5-VL-32B is a multimodal vision-language model fine-tuned through reinforcement learning for enhanced mathematical reasoning, structured outputs, and visual problem-solving capabilities. It excels at visual analysis tasks, including object recognition, textual...

textvisionmultimodal
128,000 ctx$0.20/1M in
Explore specs and pricingView details β†’

Meta: Llama 4 Scout

meta-llama

Llama 4 Scout 17B Instruct (16E) is a mixture-of-experts (MoE) language model developed by Meta, activating 17 billion parameters out of a total of 109B. It supports native multimodal input...

textvisionmultimodal
327,680 ctx$0.08/1M in
Explore specs and pricingView details β†’

Meta: Llama 4 Maverick

meta-llama

Llama 4 Maverick 17B Instruct (128E) is a high-capacity multimodal language model from Meta, built on a mixture-of-experts (MoE) architecture with 128 experts and 17 billion active parameters per forward...

textvisionmultimodal
1,048,576 ctx$0.15/1M in
Explore specs and pricingView details β†’

NVIDIA: Llama 3.1 Nemotron Ultra 253B v1

nvidia

Llama-3.1-Nemotron-Ultra-253B-v1 is a large language model (LLM) optimized for advanced reasoning, human-interactive chat, retrieval-augmented generation (RAG), and tool-calling tasks. Derived from Meta’s Llama-3.1-405B-Instruct, it has been significantly customized using Neural...

reasoninginstructcheap
131,072 ctx$0.60/1M in
Explore specs and pricingView details β†’

xAI: Grok 3 Mini Beta

x-ai

Grok 3 Mini is a lightweight, smaller thinking model. Unlike traditional models that generate answers immediately, Grok 3 Mini thinks before responding. It’s ideal for reasoning-heavy tasks that don’t demand...

textreasoningcheap
131,072 ctx$0.30/1M in
Explore specs and pricingView details β†’

AlfredPros: CodeLLaMa 7B Instruct Solidity

alfredpros

A finetuned 7 billion parameters Code LLaMA - Instruct model to generate Solidity smart contract using 4-bit QLoRA finetuning provided by PEFT library.

textcodeinstruct
4,096 ctx$0.80/1M in
Explore specs and pricingView details β†’