modelstop.top
Home/All Models

AI Model Catalogue

Browse 194 models across providers, modalities, and use cases.

🌐 All Models

194 models Β· Page 2 of 6

Llama 4 Scout 17B Instruct

meta

Llama 4 Scout 17B Instruct β€” available via AWS Bedrock (us-east-1).

textvisionmultimodal
ctxFree in
Explore specs and pricingView details β†’

Nova Canvas

amazon

Nova Canvas β€” available via AWS Bedrock (us-east-1).

textvisionmultimodal
ctxFree in
Explore specs and pricingView details β†’

Nova Premier

amazon

Nova Premier β€” available via AWS Bedrock (us-east-1).

textvisionmultimodal
ctxFree in
Explore specs and pricingView details β†’

Claude 3 Sonnet

anthropic

Claude 3 Sonnet β€” available via AWS Bedrock (us-east-1).

textvisionmultimodal
ctxFree in
Explore specs and pricingView details β†’

Nova Premier

amazon

Nova Premier β€” available via AWS Bedrock (us-east-1).

textvisionmultimodal
ctxFree in
Explore specs and pricingView details β†’

Nova Pro

amazon

Nova Pro β€” available via AWS Bedrock (us-east-1).

textvisionmultimodal
ctxFree in
Explore specs and pricingView details β†’

Pixtral Large (25.02)

mistral

Pixtral Large (25.02) β€” available via AWS Bedrock (us-east-1).

textvisionmultimodal
ctxFree in
Explore specs and pricingView details β†’

Nova Premier

amazon

Nova Premier β€” available via AWS Bedrock (us-east-1).

textvisionmultimodal
ctxFree in
Explore specs and pricingView details β†’

Gemma 3 4B IT

google

Gemma 3 4B IT β€” available via AWS Bedrock (us-east-1).

textvisionmultimodal
131,072 ctxFree in
Explore specs and pricingView details β†’

Claude Opus 4.5

anthropic

Claude Opus 4.5 β€” available via AWS Bedrock (us-east-1).

textvisionmultimodal
ctxFree in
Explore specs and pricingView details β†’

Claude Sonnet 4.5

anthropic

Claude Sonnet 4.5 β€” available via AWS Bedrock (us-east-1).

textvisionmultimodal
ctxFree in
Explore specs and pricingView details β†’

Titan Multimodal Embeddings G1

amazon

Titan Multimodal Embeddings G1 β€” available via AWS Bedrock (us-east-1).

textvisionmultimodal
ctxFree in
Explore specs and pricingView details β†’

Claude 3.7 Sonnet

anthropic

Claude 3.7 Sonnet β€” available via AWS Bedrock (us-east-1).

textvisionmultimodal
ctxFree in
Explore specs and pricingView details β†’

Writer Palmyra Vision 7B

writer

Writer Palmyra Vision 7B β€” available via AWS Bedrock (us-east-1).

textvisionmultimodal
ctxFree in
Explore specs and pricingView details β†’

Claude 3 Haiku

anthropic

Claude 3 Haiku β€” available via AWS Bedrock (us-east-1).

textvisionmultimodal
ctxFree in
Explore specs and pricingView details β†’

Stable Image Inpaint

stability

Stable Image Inpaint β€” available via AWS Bedrock (us-east-1).

textvisionmultimodal
ctxFree in
Explore specs and pricingView details β†’

Titan Multimodal Embeddings G1

amazon

Titan Multimodal Embeddings G1 β€” available via AWS Bedrock (us-east-1).

textvisionmultimodal
ctxFree in
Explore specs and pricingView details β†’

Amazon Nova Pro

amazon

Amazon Nova Pro is a highly capable multimodal model with the best combination of accuracy, speed, and cost across a wide range of tasks. Supports text, image, and video inputs.

visionmultimodallong-context
300,000 ctx$0.80/1M in
Explore specs and pricingView details β†’

Amazon Nova Lite

amazon

Amazon Nova Lite is a very low-cost multimodal model that can process image, video, and text inputs. Fast and accurate for a wide range of tasks requiring visual and language understanding.

visionmultimodalcheap
300,000 ctx$0.06/1M in
Explore specs and pricingView details β†’

Auto Router

openrouter

Your prompt will be processed by a meta-model and routed to one of dozens of models (see below), optimizing for the best possible output. To see which model was used,...

textvisionmultimodal
2,000,000 ctxFree in
Explore specs and pricingView details β†’

Anthropic: Claude 3 Haiku

anthropic

Claude 3 Haiku is Anthropic's fastest and most compact model for near-instant responsiveness. Quick and accurate targeted performance. See the launch announcement and benchmark results [here](https://www.anthropic.com/news/claude-3-haiku) #multimodal

textvisionmultimodal
Run locally
200,000 ctx$0.25/1M in
Explore specs and pricingView details β†’

OpenAI: GPT-4 Turbo

openai

The latest GPT-4 Turbo model with vision capabilities. Vision requests can now use JSON mode and function calling. Training data: up to December 2023.

textvisionmultimodal
Run locally
128,000 ctx$10.00/1M in
Explore specs and pricingView details β†’

OpenAI: GPT-4o

openai

GPT-4o ("o" for "omni") is OpenAI's latest AI model, supporting both text and image inputs with text outputs. It maintains the intelligence level of [GPT-4 Turbo](/models/openai/gpt-4-turbo) while being twice as...

textvisionmultimodal
Run locally
128,000 ctx$2.50/1M in
Explore specs and pricingView details β†’

OpenAI: GPT-4o (2024-05-13)

openai

GPT-4o ("o" for "omni") is OpenAI's latest AI model, supporting both text and image inputs with text outputs. It maintains the intelligence level of [GPT-4 Turbo](/models/openai/gpt-4-turbo) while being twice as...

textvisionmultimodal
Run locally
128,000 ctx$5.00/1M in
Explore specs and pricingView details β†’

OpenAI: GPT-4o-mini

openai

GPT-4o mini is OpenAI's newest model after [GPT-4 Omni](/models/openai/gpt-4o), supporting both text and image inputs with text outputs. As their most advanced small model, it is many multiples more affordable...

textvisionmultimodal
Run locally
128,000 ctx$0.15/1M in
Explore specs and pricingView details β†’

OpenAI: GPT-4o-mini (2024-07-18)

openai

GPT-4o mini is OpenAI's newest model after [GPT-4 Omni](/models/openai/gpt-4o), supporting both text and image inputs with text outputs. As their most advanced small model, it is many multiples more affordable...

textvisionmultimodal
128,000 ctx$0.15/1M in
Explore specs and pricingView details β†’

OpenAI: GPT-4o (2024-08-06)

openai

The 2024-08-06 version of GPT-4o offers improved performance in structured outputs, with the ability to supply a JSON schema in the respone_format. Read more [here](https://openai.com/index/introducing-structured-outputs-in-the-api/). GPT-4o ("o" for "omni") is...

textvisionmultimodal
128,000 ctx$2.50/1M in
Explore specs and pricingView details β†’

Meta: Llama 3.2 11B Vision Instruct

meta-llama

Llama 3.2 11B Vision is a multimodal model with 11 billion parameters, designed to handle tasks combining visual and textual data. It excels in tasks such as image captioning and...

textvisionmultimodal
131,072 ctx$0.24/1M in
Explore specs and pricingView details β†’

Anthropic: Claude 3.5 Haiku

anthropic

Claude 3.5 Haiku features offers enhanced capabilities in speed, coding accuracy, and tool use. Engineered to excel in real-time applications, it delivers quick response times that are essential for dynamic...

textvisionmultimodal
Run locally
200,000 ctx$0.80/1M in
Explore specs and pricingView details β†’

Mistral: Pixtral Large 2411

mistralai

Pixtral Large is a 124B parameter, open-weight, multimodal model built on top of [Mistral Large 2](/mistralai/mistral-large-2411). The model is able to understand documents, charts and natural images. The model is...

textvisionmultimodal
131,072 ctx$2.00/1M in
Explore specs and pricingView details β†’

OpenAI: GPT-4o (2024-11-20)

openai

The 2024-11-20 version of GPT-4o offers a leveled-up creative writing ability with more natural, engaging, and tailored writing to improve relevance & readability. It’s also better at working with uploaded...

textvisionmultimodal
Run locally
128,000 ctx$2.50/1M in
Explore specs and pricingView details β†’

Amazon: Nova Pro 1.0

amazon

Amazon Nova Pro 1.0 is a capable multimodal model from Amazon focused on providing a combination of accuracy, speed, and cost for a wide range of tasks. As of December...

textvisionmultimodal
Run locally
300,000 ctx$0.80/1M in
Explore specs and pricingView details β†’

Amazon: Nova Lite 1.0

amazon

Amazon Nova Lite 1.0 is a very low-cost multimodal model from Amazon that focused on fast processing of image, video, and text inputs to generate text output. Amazon Nova Lite...

textvisionimage
Run locally
300,000 ctx$0.06/1M in
Explore specs and pricingView details β†’

OpenAI: o1

openai

The latest and strongest model family from OpenAI, o1 is designed to spend more time thinking before responding. The o1 model series is trained with large-scale reinforcement learning to reason...

textvisionmultimodal
Run locally
200,000 ctx$15.00/1M in
Explore specs and pricingView details β†’

MiniMax: MiniMax-01

minimax

MiniMax-01 is a combines MiniMax-Text-01 for text generation and MiniMax-VL-01 for image understanding. It has 456 billion parameters, with 45.9 billion parameters activated per inference, and can handle a context...

textvisionimage
Run locally
1,000,192 ctx$0.20/1M in
Explore specs and pricingView details β†’

Perplexity: Sonar

perplexity

Sonar is lightweight, affordable, fast, and simple to use β€” now featuring citations and the ability to customize sources. It is designed for companies seeking to integrate lightweight question-and-answer features...

textvisionmultimodal
Run locally
127,072 ctx$1.00/1M in
Explore specs and pricingView details β†’