modelstop.top
Home/All Models

AI Model Catalogue

Browse 1,750 models across providers, modalities, and use cases.

🌐 All Models

1,750 models Β· Page 5 of 49

llama-3-8b-instruct

meta

Generation over generation, Meta Llama 3 demonstrates state-of-the-art performance on a wide range of industry benchmarks and offers new capabilities, including improved reasoning.

textreasoninginstruct
7,968 ctx$0.28/1M in
Explore specs and pricingView details β†’

gemma-2b-it-lora

google

This is a Gemma-2B base model that Cloudflare dedicates for inference with LoRA adapters. Gemma is a family of lightweight, state-of-the-art open models from Google, built from the same research and technology used to create the Gemini models.

textfree
8,192 ctx$0.00/1M in
Explore specs and pricingView details β†’

kimi-k2.5

moonshotai

Kimi K2.5 is a frontier-scale open-source model with a 256k context window, multi-turn tool calling, vision inputs, and structured outputs for agentic workloads.

textvisionagents
256,000 ctx$0.60/1M in
Explore specs and pricingView details β†’

qwen1.5-0.5b-chat

qwen

Qwen1.5 is the improved version of Qwen, the large language model series developed by Alibaba Cloud.

textfree
32,000 ctx$0.00/1M in
Explore specs and pricingView details β†’

bge-m3

baai

Multi-Functionality, Multi-Linguality, and Multi-Granularity embeddings model.

textcheap
60,000 ctx$0.01/1M in
Explore specs and pricingView details β†’

plamo-embedding-1b

pfnet

PLaMo-Embedding-1B is a Japanese text embedding model developed by Preferred Networks, Inc. It can convert Japanese text input into numerical vectors and can be used for a wide range of applications, including information retrieval, text classification, and clustering.

textcheap
ctx$0.02/1M in
Explore specs and pricingView details β†’

qwen1.5-7b-chat-awq

qwen

Qwen1.5 is the improved version of Qwen, the large language model series developed by Alibaba Cloud. AWQ is an efficient, accurate and blazing-fast low-bit weight quantization method, currently supporting 4-bit quantization.

textfree
20,000 ctx$0.00/1M in
Explore specs and pricingView details β†’

distilbert-sst-2-int8

huggingface

Distilled BERT model that was finetuned on SST-2 for sentiment classification

textcheap
ctx$0.03/1M in
Explore specs and pricingView details β†’

llama-guard-3-8b

meta

Llama Guard 3 is a Llama-3.1-8B pretrained model, fine-tuned for content safety classification. Similar to previous versions, it can be used to classify content in both LLM inputs (prompt classification) and in LLM responses (response classification). It acts as an LLM – it generates text in its output that indicates whether a given prompt or response is safe or unsafe, and if unsafe, it also lists the content categories violated.

textcheaplong-context
131,072 ctx$0.48/1M in
Explore specs and pricingView details β†’

llama-3.2-1b-instruct

meta

The Llama 3.2 instruction-tuned text only models are optimized for multilingual dialogue use cases, including agentic retrieval and summarization tasks.

textagentsmultilingual
60,000 ctx$0.03/1M in
Explore specs and pricingView details β†’

kimi-k2.6

moonshotai

Kimi K2.6 is a frontier-scale open-source 1T parameter model with a 262.1k context window, multi-turn tool calling, vision inputs, and structured outputs for agentic workloads.

textvisionagents
262,144 ctx$0.95/1M in
Explore specs and pricingView details β†’

deepseek-r1-distill-qwen-32b

deepseek-ai

DeepSeek-R1-Distill-Qwen-32B is a model distilled from DeepSeek-R1 based on Qwen2.5. It outperforms OpenAI-o1-mini across various benchmarks, achieving new state-of-the-art results for dense models.

textcheap
80,000 ctx$0.50/1M in
Explore specs and pricingView details β†’

smart-turn-v2

pipecat-ai

An open source, community-driven, native audio turn detection model in 2nd version

textaudiofree
ctx$0.00/1M in
Explore specs and pricingView details β†’

qwen3-embedding-0.6b

qwen

The Qwen3 Embedding model series is the latest proprietary model of the Qwen family, specifically designed for text embedding and ranking tasks.

textcheap
8,192 ctx$0.01/1M in
Explore specs and pricingView details β†’

gpt-oss-120b

openai

OpenAI’s open-weight models designed for powerful reasoning, agentic tasks, and versatile developer use cases – gpt-oss-120b is for production, general purpose, high reasoning use-cases.

textreasoningagents
128,000 ctx$0.35/1M in
Explore specs and pricingView details β†’

tinyllama-1.1b-chat-v1.0

tinyllama

The TinyLlama project aims to pretrain a 1.1B Llama model on 3 trillion tokens. This is the chat model finetuned on top of TinyLlama/TinyLlama-1.1B-intermediate-step-1431k-3T.

textfree
2,048 ctx$0.00/1M in
Explore specs and pricingView details β†’

Nova Micro

amazon

Nova Micro β€” available via AWS Bedrock (us-east-1).

textfree
ctxFree in
Explore specs and pricingView details β†’

Mistral Small (24.02)

mistral

Mistral Small (24.02) β€” available via AWS Bedrock (us-east-1).

textfree
ctxFree in
Explore specs and pricingView details β†’

Mistral 7B Instruct

mistral

Mistral 7B Instruct β€” available via AWS Bedrock (us-east-1).

textinstructfree
ctxFree in
Explore specs and pricingView details β†’

Claude Sonnet 4.5

anthropic

Claude Sonnet 4.5 β€” available via AWS Bedrock (us-east-1).

textvisionmultimodal
ctxFree in
Explore specs and pricingView details β†’

Claude 3.5 Haiku

anthropic

Claude 3.5 Haiku β€” available via AWS Bedrock (us-east-1).

textfree
ctxFree in
Explore specs and pricingView details β†’

Claude 3 Haiku

anthropic

Claude 3 Haiku β€” available via AWS Bedrock (us-east-1).

textvisionmultimodal
ctxFree in
Explore specs and pricingView details β†’

Command R+

cohere

Command R+ β€” available via AWS Bedrock (us-east-1).

textfree
ctxFree in
Explore specs and pricingView details β†’

Titan Embeddings G1 - Text

amazon

Titan Embeddings G1 - Text β€” available via AWS Bedrock (us-east-1).

textfree
ctxFree in
Explore specs and pricingView details β†’

Nova Premier

amazon

Nova Premier β€” available via AWS Bedrock (us-east-1).

textvisionmultimodal
ctxFree in
Explore specs and pricingView details β†’

Llama 3 70B Instruct

meta

Llama 3 70B Instruct β€” available via AWS Bedrock (us-east-1).

textinstructfree
ctxFree in
Explore specs and pricingView details β†’

Llama 4 Maverick 17B Instruct

meta

Llama 4 Maverick 17B Instruct β€” available via AWS Bedrock (us-east-1).

textvisionmultimodal
ctxFree in
Explore specs and pricingView details β†’

Llama 3.1 8B Instruct

meta

Llama 3.1 8B Instruct β€” available via AWS Bedrock (us-east-1).

textinstructfree
ctxFree in
Explore specs and pricingView details β†’

Command R

cohere

Command R β€” available via AWS Bedrock (us-east-1).

textfree
ctxFree in
Explore specs and pricingView details β†’

Mixtral 8x7B Instruct

mistral

Mixtral 8x7B Instruct β€” available via AWS Bedrock (us-east-1).

textinstructfree
ctxFree in
Explore specs and pricingView details β†’

Titan Text Embeddings V2

amazon

Titan Text Embeddings V2 β€” available via AWS Bedrock (us-east-1).

textfree
ctxFree in
Explore specs and pricingView details β†’

Titan Multimodal Embeddings G1

amazon

Titan Multimodal Embeddings G1 β€” available via AWS Bedrock (us-east-1).

textvisionmultimodal
ctxFree in
Explore specs and pricingView details β†’

Claude 3.7 Sonnet

anthropic

Claude 3.7 Sonnet β€” available via AWS Bedrock (us-east-1).

textvisionmultimodal
ctxFree in
Explore specs and pricingView details β†’

Gemma 3 4B IT

google

Gemma 3 4B IT β€” available via AWS Bedrock (us-east-1).

textvisionmultimodal
131,072 ctxFree in
Explore specs and pricingView details β†’

Llama 3.2 1B Instruct

meta

Llama 3.2 1B Instruct β€” available via AWS Bedrock (us-east-1).

textinstructfree
ctxFree in
Explore specs and pricingView details β†’

gpt-oss-20b

openai

gpt-oss-20b β€” available via AWS Bedrock (us-east-1).

textfree
ctxFree in
Explore specs and pricingView details β†’