modelstop.top
Home/All Models

AI Model Catalogue

Browse 352 models across providers, modalities, and use cases.

๐ŸŒ All Models

352 models ยท Page 5 of 10

Meta: Llama 3 70B Instruct

meta-llama

Meta's latest class of model (Llama 3) launched with a variety of sizes & flavors. This 70B instruct-tuned version was optimized for high quality dialogue usecases. It has demonstrated strong...

textinstructcheap
Run locally
8,192 ctx$0.51/1M in
Explore specs and pricingView details โ†’

Meta: Llama 3 8B Instruct

meta-llama

Meta's latest class of model (Llama 3) launched with a variety of sizes & flavors. This 8B instruct-tuned version was optimized for high quality dialogue usecases. It has demonstrated strong...

textinstructcheap
8,192 ctx$0.03/1M in
Explore specs and pricingView details โ†’

NousResearch: Hermes 2 Pro - Llama-3 8B

nousresearch

Hermes 2 Pro is an upgraded, retrained version of Nous Hermes 2, consisting of an updated and cleaned version of the OpenHermes 2.5 Dataset, as well as a newly introduced...

textcheap
8,192 ctx$0.14/1M in
Explore specs and pricingView details โ†’

Google: Gemma 2 9B

google

Gemma 2 9B by Google is an advanced, open-source language model that sets a new standard for efficiency and performance in its size class. Designed for a wide variety of...

cheap
8,192 ctx$0.03/1M in
Explore specs and pricingView details โ†’

Google: Gemma 2 27B

google

Gemma 2 27B by Google is an open model built from the same research and technology used to create the [Gemini models](/models?q=gemini). Gemma models are well-suited for a variety of...

textcheap
Run locally
8,192 ctx$0.65/1M in
Explore specs and pricingView details โ†’

OpenAI: GPT-4o-mini

openai

GPT-4o mini is OpenAI's newest model after [GPT-4 Omni](/models/openai/gpt-4o), supporting both text and image inputs with text outputs. As their most advanced small model, it is many multiples more affordable...

textvisionmultimodal
Run locally
128,000 ctx$0.15/1M in
Explore specs and pricingView details โ†’

OpenAI: GPT-4o-mini (2024-07-18)

openai

GPT-4o mini is OpenAI's newest model after [GPT-4 Omni](/models/openai/gpt-4o), supporting both text and image inputs with text outputs. As their most advanced small model, it is many multiples more affordable...

textvisionmultimodal
128,000 ctx$0.15/1M in
Explore specs and pricingView details โ†’

Mistral: Mistral Nemo

mistralai

A 12B parameter model with a 128k token context length built by Mistral in collaboration with NVIDIA. The model is multilingual, supporting English, French, German, Spanish, Italian, Portuguese, Chinese, Japanese,...

textmultilingualcheap
131,072 ctx$0.02/1M in
Explore specs and pricingView details โ†’

Meta: Llama 3.1 70B Instruct

meta-llama

Meta's latest class of model (Llama 3.1) launched with a variety of sizes & flavors. This 70B instruct-tuned version is optimized for high quality dialogue usecases. It has demonstrated strong...

textinstructcheap
131,072 ctx$0.40/1M in
Explore specs and pricingView details โ†’

Meta: Llama 3.1 8B Instruct

meta-llama

Meta's latest class of model (Llama 3.1) launched with a variety of sizes & flavors. This 8B instruct-tuned version is fast and efficient. It has demonstrated strong performance compared to...

textinstructcheap
16,384 ctx$0.02/1M in
Explore specs and pricingView details โ†’

Sao10K: Llama 3 8B Lunaris

sao10k

Lunaris 8B is a versatile generalist and roleplaying model based on Llama 3. It's a strategic merge of multiple models, designed to balance creativity with improved logic and general knowledge....

textcheap
8,192 ctx$0.04/1M in
Explore specs and pricingView details โ†’

Nous: Hermes 3 70B Instruct

nousresearch

Hermes 3 is a generalist language model with many improvements over [Hermes 2](/models/nousresearch/nous-hermes-2-mistral-7b-dpo), including advanced agentic capabilities, much better roleplaying, reasoning, multi-turn conversation, long context coherence, and improvements across the...

textreasoningagents
Run locally
131,072 ctx$0.30/1M in
Explore specs and pricingView details โ†’

Sao10K: Llama 3.1 Euryale 70B v2.2

sao10k

Euryale L3.1 70B v2.2 is a model focused on creative roleplay from [Sao10k](https://ko-fi.com/sao10k). It is the successor of [Euryale L3 70B v2.1](/models/sao10k/l3-euryale-70b).

textcheaplong-context
Run locally
131,072 ctx$0.85/1M in
Explore specs and pricingView details โ†’

Cohere: Command R (08-2024)

cohere

command-r-08-2024 is an update of the [Command R](/models/cohere/command-r) with improved performance for multilingual retrieval-augmented generation (RAG) and tool use. More broadly, it is better at math, code and reasoning and...

textcodereasoning
Run locally
128,000 ctx$0.15/1M in
Explore specs and pricingView details โ†’

Qwen2.5 72B Instruct

qwen

Qwen2.5 72B is the latest series of Qwen large language models. Qwen2.5 brings the following improvements upon Qwen2: - Significantly more knowledge and has greatly improved capabilities in coding and...

textcodeinstruct
Run locally
131,072 ctx$0.12/1M in
Explore specs and pricingView details โ†’

Meta: Llama 3.2 11B Vision Instruct

meta-llama

Llama 3.2 11B Vision is a multimodal model with 11 billion parameters, designed to handle tasks combining visual and textual data. It excels in tasks such as image captioning and...

textvisionmultimodal
131,072 ctx$0.24/1M in
Explore specs and pricingView details โ†’

Meta: Llama 3.2 1B Instruct

meta-llama

Llama 3.2 1B is a 1-billion-parameter language model focused on efficiently performing natural language tasks, such as summarization, dialogue, and multilingual text analysis. Its smaller size allows it to operate...

textmultilingualinstruct
Run locally
131,072 ctx$0.03/1M in
Explore specs and pricingView details โ†’

TheDrummer: Rocinante 12B

thedrummer

Rocinante 12B is designed for engaging storytelling and rich prose. Early testers have reported: - Expanded vocabulary with unique and expressive word choices - Enhanced creativity for vivid narratives -...

textcheap
Run locally
32,768 ctx$0.17/1M in
Explore specs and pricingView details โ†’

Qwen: Qwen2.5 7B Instruct

qwen

Qwen2.5 7B is the latest series of Qwen large language models. Qwen2.5 brings the following improvements upon Qwen2: - Significantly more knowledge and has greatly improved capabilities in coding and...

textcodeinstruct
Run locally
131,072 ctx$0.04/1M in
Explore specs and pricingView details โ†’

Anthropic: Claude 3.5 Haiku

anthropic

Claude 3.5 Haiku features offers enhanced capabilities in speed, coding accuracy, and tool use. Engineered to excel in real-time applications, it delivers quick response times that are essential for dynamic...

textvisionmultimodal
Run locally
200,000 ctx$0.80/1M in
Explore specs and pricingView details โ†’

TheDrummer: UnslopNemo 12B

thedrummer

UnslopNemo v4.1 is the latest addition from the creator of Rocinante, designed for adventure writing and role-play scenarios.

textcheap
32,768 ctx$0.40/1M in
Explore specs and pricingView details โ†’

Qwen2.5 Coder 32B Instruct

qwen

Qwen2.5-Coder is the latest series of Code-Specific Qwen large language models (formerly known as CodeQwen). Qwen2.5-Coder brings the following improvements upon CodeQwen1.5: - Significantly improvements in **code generation**, **code reasoning**...

textcodereasoning
Run locally
128,000 ctx$0.66/1M in
Explore specs and pricingView details โ†’

Amazon: Nova Pro 1.0

amazon

Amazon Nova Pro 1.0 is a capable multimodal model from Amazon focused on providing a combination of accuracy, speed, and cost for a wide range of tasks. As of December...

textvisionmultimodal
Run locally
300,000 ctx$0.80/1M in
Explore specs and pricingView details โ†’

Amazon: Nova Micro 1.0

amazon

Amazon Nova Micro 1.0 is a text-only model that delivers the lowest latency responses in the Amazon Nova family of models at a very low cost. With a context length...

textcheaplong-context
Run locally
128,000 ctx$0.04/1M in
Explore specs and pricingView details โ†’

Amazon: Nova Lite 1.0

amazon

Amazon Nova Lite 1.0 is a very low-cost multimodal model from Amazon that focused on fast processing of image, video, and text inputs to generate text output. Amazon Nova Lite...

textvisionimage
Run locally
300,000 ctx$0.06/1M in
Explore specs and pricingView details โ†’

Cohere: Command R7B (12-2024)

cohere

Command R7B (12-2024) is a small, fast update of the Command R+ model, delivered in December 2024. It excels at RAG, tool use, agents, and similar tasks requiring complex reasoning...

textreasoningagents
Run locally
128,000 ctx$0.04/1M in
Explore specs and pricingView details โ†’

Sao10K: Llama 3.3 Euryale 70B

sao10k

Euryale L3.3 70B is a model focused on creative roleplay from [Sao10k](https://ko-fi.com/sao10k). It is the successor of [Euryale L3 70B v2.2](/models/sao10k/l3-euryale-70b).

textcheaplong-context
131,072 ctx$0.65/1M in
Explore specs and pricingView details โ†’

DeepSeek: DeepSeek V3

deepseek

DeepSeek-V3 is the latest model from the DeepSeek team, building upon the instruction following and coding abilities of the previous versions. Pre-trained on nearly 15 trillion tokens, the reported evaluations...

textcodecheap
Run locally
131,072 ctx$0.32/1M in
Explore specs and pricingView details โ†’

Microsoft: Phi 4

microsoft

[Microsoft Research](/microsoft) Phi-4 is designed to perform well in complex reasoning tasks and can operate efficiently in situations with limited memory or where quick responses are needed. At 14 billion...

textreasoningcheap
Run locally
16,384 ctx$0.07/1M in
Explore specs and pricingView details โ†’

MiniMax: MiniMax-01

minimax

MiniMax-01 is a combines MiniMax-Text-01 for text generation and MiniMax-VL-01 for image understanding. It has 456 billion parameters, with 45.9 billion parameters activated per inference, and can handle a context...

textvisionimage
Run locally
1,000,192 ctx$0.20/1M in
Explore specs and pricingView details โ†’

DeepSeek: R1

deepseek

DeepSeek R1 is here: Performance on par with [OpenAI o1](/openai/o1), but open-sourced and with fully open reasoning tokens. It's 671B parameters in size, with 37B active in an inference pass....

textreasoningcheap
64,000 ctx$0.70/1M in
Explore specs and pricingView details โ†’

DeepSeek: R1 Distill Llama 70B

deepseek

DeepSeek R1 Distill Llama 70B is a distilled large language model based on [Llama-3.3-70B-Instruct](/meta-llama/llama-3.3-70b-instruct), using outputs from [DeepSeek R1](/deepseek/deepseek-r1). The model combines advanced distillation techniques to achieve high performance across...

textinstructcheap
Run locally
131,072 ctx$0.70/1M in
Explore specs and pricingView details โ†’

Perplexity: Sonar

perplexity

Sonar is lightweight, affordable, fast, and simple to use โ€” now featuring citations and the ability to customize sources. It is designed for companies seeking to integrate lightweight question-and-answer features...

textvisionmultimodal
Run locally
127,072 ctx$1.00/1M in
Explore specs and pricingView details โ†’

DeepSeek: R1 Distill Qwen 32B

deepseek

DeepSeek R1 Distill Qwen 32B is a distilled large language model based on [Qwen 2.5 32B](https://huggingface.co/Qwen/Qwen2.5-32B), using outputs from [DeepSeek R1](/deepseek/deepseek-r1). It outperforms OpenAI's o1-mini across various benchmarks, achieving new...

textcheaplong-context
Run locally
128,000 ctx$0.29/1M in
Explore specs and pricingView details โ†’

Mistral: Mistral Small 3

mistralai

Mistral Small 3 is a 24B-parameter language model optimized for low-latency performance across common AI tasks. Released under the Apache 2.0 license, it features both pre-trained and instruction-tuned versions designed...

textcheap
Run locally
32,768 ctx$0.05/1M in
Explore specs and pricingView details โ†’

Qwen: Qwen-Plus

qwen

Qwen-Plus, based on the Qwen2.5 foundation model, is a 131K context model with a balanced performance, speed, and cost combination.

textcheaplong-context
1,000,000 ctx$0.26/1M in
Explore specs and pricingView details โ†’