modelstop.top
Home/All Models

AI Model Catalogue

Browse 2,392 models across providers, modalities, and use cases.

🌐 All Models

2,392 models Β· Page 5 of 67

mistral-7b-instruct-v0.2-lora

mistral

The Mistral-7B-Instruct-v0.2 Large Language Model (LLM) is an instruct fine-tuned version of the Mistral-7B-v0.2.

textinstructfree
15,000 ctx$0.00/1M in
Explore specs and pricingView details β†’

whisper

openai

Whisper is a general-purpose speech recognition model. It is trained on a large dataset of diverse audio and is also a multitasking model that can perform multilingual speech recognition, speech translation, and language identification.

audiomultilingualfree
ctx$0.00/1M in
Explore specs and pricingView details β†’

flux

deepgram

Flux is the first conversational speech recognition model built specifically for voice agents.

audioagentsfree
ctx$0.00/1M in
Explore specs and pricingView details β†’

llama-2-7b-chat-fp16

meta

Full precision (fp16) generative text model with 7 billion parameters from Meta

textcheap
4,096 ctx$0.56/1M in
Explore specs and pricingView details β†’

mistral-7b-instruct-v0.1

mistral

Instruct fine-tuned version of the Mistral-7b generative text model with 7 billion parameters

textinstructcheap
2,824 ctx$0.11/1M in
Explore specs and pricingView details β†’

melotts

myshell-ai

MeloTTS is a high-quality multi-lingual text-to-speech library by MyShell.ai.

audiofree
ctx$0.00/1M in
Explore specs and pricingView details β†’

plamo-embedding-1b

pfnet

PLaMo-Embedding-1B is a Japanese text embedding model developed by Preferred Networks, Inc. It can convert Japanese text input into numerical vectors and can be used for a wide range of applications, including information retrieval, text classification, and clustering.

textcheap
ctx$0.02/1M in
Explore specs and pricingView details β†’

flux-1-schnell

black-forest-labs

FLUX.1 [schnell] is a 12 billion parameter rectified flow transformer capable of generating images from text descriptions.

visionimagefree
ctx$0.00/1M in
Explore specs and pricingView details β†’

phoenix-1.0

leonardo

Phoenix 1.0 is a model by Leonardo.Ai that generates images with exceptional prompt adherence and coherent text.

visionimagefree
ctx$0.00/1M in
Explore specs and pricingView details β†’

stable-diffusion-v1-5-inpainting

runwayml

Stable Diffusion Inpainting is a latent text-to-image diffusion model capable of generating photo-realistic images given any text input, with the extra capability of inpainting the pictures by using a mask.

visionimagefree
ctx$0.00/1M in
Explore specs and pricingView details β†’

qwen1.5-7b-chat-awq

qwen

Qwen1.5 is the improved version of Qwen, the large language model series developed by Alibaba Cloud. AWQ is an efficient, accurate and blazing-fast low-bit weight quantization method, currently supporting 4-bit quantization.

textfree
20,000 ctx$0.00/1M in
Explore specs and pricingView details β†’

llama-3.2-3b-instruct

meta

The Llama 3.2 instruction-tuned text only models are optimized for multilingual dialogue use cases, including agentic retrieval and summarization tasks.

textagentsmultilingual
80,000 ctx$0.05/1M in
Explore specs and pricingView details β†’

nova-3

deepgram

Transcribe audio using Deepgram’s speech-to-text model

audiofree
ctx$0.00/1M in
Explore specs and pricingView details β†’

llama-3-8b-instruct

meta

Generation over generation, Meta Llama 3 demonstrates state-of-the-art performance on a wide range of industry benchmarks and offers new capabilities, including improved reasoning.

textreasoninginstruct
7,968 ctx$0.28/1M in
Explore specs and pricingView details β†’

flux-2-klein-9b

black-forest-labs

FLUX.2 [klein] 9B is a 9 billion parameter model that can generate images from text descriptions and supports multi-reference editing capabilities.

visionimagefree
ctx$0.00/1M in
Explore specs and pricingView details β†’

kimi-k2.5

moonshotai

Kimi K2.5 is a frontier-scale open-source model with a 256k context window, multi-turn tool calling, vision inputs, and structured outputs for agentic workloads.

textvisionagents
256,000 ctx$0.60/1M in
Explore specs and pricingView details β†’

llama-guard-3-8b

meta

Llama Guard 3 is a Llama-3.1-8B pretrained model, fine-tuned for content safety classification. Similar to previous versions, it can be used to classify content in both LLM inputs (prompt classification) and in LLM responses (response classification). It acts as an LLM – it generates text in its output that indicates whether a given prompt or response is safe or unsafe, and if unsafe, it also lists the content categories violated.

textcheaplong-context
131,072 ctx$0.48/1M in
Explore specs and pricingView details β†’

qwen1.5-0.5b-chat

qwen

Qwen1.5 is the improved version of Qwen, the large language model series developed by Alibaba Cloud.

textfree
32,000 ctx$0.00/1M in
Explore specs and pricingView details β†’

bge-m3

baai

Multi-Functionality, Multi-Linguality, and Multi-Granularity embeddings model.

textcheap
60,000 ctx$0.01/1M in
Explore specs and pricingView details β†’

gpt-oss-120b

openai

OpenAI’s open-weight models designed for powerful reasoning, agentic tasks, and versatile developer use cases – gpt-oss-120b is for production, general purpose, high reasoning use-cases.

textreasoningagents
128,000 ctx$0.35/1M in
Explore specs and pricingView details β†’

gemma-2b-it-lora

google

This is a Gemma-2B base model that Cloudflare dedicates for inference with LoRA adapters. Gemma is a family of lightweight, state-of-the-art open models from Google, built from the same research and technology used to create the Gemini models.

textfree
8,192 ctx$0.00/1M in
Explore specs and pricingView details β†’

tinyllama-1.1b-chat-v1.0

tinyllama

The TinyLlama project aims to pretrain a 1.1B Llama model on 3 trillion tokens. This is the chat model finetuned on top of TinyLlama/TinyLlama-1.1B-intermediate-step-1431k-3T.

textfree
2,048 ctx$0.00/1M in
Explore specs and pricingView details β†’

deepseek-r1-distill-qwen-32b

deepseek-ai

DeepSeek-R1-Distill-Qwen-32B is a model distilled from DeepSeek-R1 based on Qwen2.5. It outperforms OpenAI-o1-mini across various benchmarks, achieving new state-of-the-art results for dense models.

textcheap
80,000 ctx$0.50/1M in
Explore specs and pricingView details β†’

stable-diffusion-xl-base-1.0

stabilityai

Diffusion-based text-to-image generative model by Stability AI. Generates and modify images based on text prompts.

visionimagefree
ctx$0.00/1M in
Explore specs and pricingView details β†’

m2m100-1.2b

meta

Multilingual encoder-decoder (seq-to-seq) model trained for Many-to-Many multilingual translation

textmultilingualcheap
ctx$0.34/1M in
Explore specs and pricingView details β†’

distilbert-sst-2-int8

huggingface

Distilled BERT model that was finetuned on SST-2 for sentiment classification

textcheap
ctx$0.03/1M in
Explore specs and pricingView details β†’

nemotron-3-120b-a12b

nvidia

NVIDIA Nemotron 3 Super is a hybrid MoE model with leading accuracy for multi-agent applications and specialized agentic AI systems.

textagentscheap
256,000 ctx$0.50/1M in
Explore specs and pricingView details β†’

qwen2.5-coder-32b-instruct

qwen

Qwen2.5-Coder is the latest series of Code-Specific Qwen large language models (formerly known as CodeQwen). As of now, Qwen2.5-Coder has covered six mainstream model sizes, 0.5, 1.5, 3, 7, 14, 32 billion parameters, to meet the needs of different developers. Qwen2.5-Coder brings the following improvements upon CodeQwen1.5:

textcodeinstruct
32,768 ctx$0.66/1M in
Explore specs and pricingView details β†’

smart-turn-v2

pipecat-ai

An open source, community-driven, native audio turn detection model in 2nd version

textaudiofree
ctx$0.00/1M in
Explore specs and pricingView details β†’

deepseek-math-7b-instruct

deepseek-ai

DeepSeekMath-Instruct 7B is a mathematically instructed tuning model derived from DeepSeekMath-Base 7B. DeepSeekMath is initialized with DeepSeek-Coder-v1.5 7B and continues pre-training on math-related tokens sourced from Common Crawl, together with natural language and code data for 500B tokens.

textcodeinstruct
4,096 ctx$0.00/1M in
Explore specs and pricingView details β†’

indictrans2-en-indic-1B

ai4bharat

IndicTrans2 is the first open-source transformer-based multilingual NMT model that supports high-quality translations across all the 22 scheduled Indic languages

textmultilingualcheap
ctx$0.34/1M in
Explore specs and pricingView details β†’

flux-2-klein-4b

black-forest-labs

FLUX.2 [klein] is an ultra-fast, distilled image model. It unifies image generation and editing in a single model, delivering state-of-the-art quality enabling interactive workflows, real-time previews, and latency-critical applications.

visionimagefree
ctx$0.00/1M in
Explore specs and pricingView details β†’

qwen3-embedding-0.6b

qwen

The Qwen3 Embedding model series is the latest proprietary model of the Qwen family, specifically designed for text embedding and ranking tasks.

textcheap
8,192 ctx$0.01/1M in
Explore specs and pricingView details β†’

bge-small-en-v1.5

baai

BAAI general embedding (Small) model that transforms any given text into a 384-dimensional vector

textcheap
ctx$0.02/1M in
Explore specs and pricingView details β†’

falcon-7b-instruct

tiiuae

Falcon-7B-Instruct is a 7B parameters causal decoder-only model built by TII based on Falcon-7B and finetuned on a mixture of chat/instruct datasets.

textinstructfree
4,096 ctx$0.00/1M in
Explore specs and pricingView details β†’

llama-3.2-1b-instruct

meta

The Llama 3.2 instruction-tuned text only models are optimized for multilingual dialogue use cases, including agentic retrieval and summarization tasks.

textagentsmultilingual
60,000 ctx$0.03/1M in
Explore specs and pricingView details β†’