π All Models
194 models Β· Page 2 of 6
Llama 4 Scout 17B Instruct
Llama 4 Scout 17B Instruct β available via AWS Bedrock (us-east-1).
Nova Canvas
Nova Canvas β available via AWS Bedrock (us-east-1).
Nova Premier
Nova Premier β available via AWS Bedrock (us-east-1).
Claude 3 Sonnet
Claude 3 Sonnet β available via AWS Bedrock (us-east-1).
Nova Premier
Nova Premier β available via AWS Bedrock (us-east-1).
Nova Pro
Nova Pro β available via AWS Bedrock (us-east-1).
Pixtral Large (25.02)
Pixtral Large (25.02) β available via AWS Bedrock (us-east-1).
Nova Premier
Nova Premier β available via AWS Bedrock (us-east-1).
Gemma 3 4B IT
Gemma 3 4B IT β available via AWS Bedrock (us-east-1).
Claude Opus 4.5
Claude Opus 4.5 β available via AWS Bedrock (us-east-1).
Claude Sonnet 4.5
Claude Sonnet 4.5 β available via AWS Bedrock (us-east-1).
Titan Multimodal Embeddings G1
Titan Multimodal Embeddings G1 β available via AWS Bedrock (us-east-1).
Claude 3.7 Sonnet
Claude 3.7 Sonnet β available via AWS Bedrock (us-east-1).
Writer Palmyra Vision 7B
Writer Palmyra Vision 7B β available via AWS Bedrock (us-east-1).
Claude 3 Haiku
Claude 3 Haiku β available via AWS Bedrock (us-east-1).
Stable Image Inpaint
Stable Image Inpaint β available via AWS Bedrock (us-east-1).
Titan Multimodal Embeddings G1
Titan Multimodal Embeddings G1 β available via AWS Bedrock (us-east-1).
Amazon Nova Pro
Amazon Nova Pro is a highly capable multimodal model with the best combination of accuracy, speed, and cost across a wide range of tasks. Supports text, image, and video inputs.
Amazon Nova Lite
Amazon Nova Lite is a very low-cost multimodal model that can process image, video, and text inputs. Fast and accurate for a wide range of tasks requiring visual and language understanding.
Auto Router
Your prompt will be processed by a meta-model and routed to one of dozens of models (see below), optimizing for the best possible output. To see which model was used,...
Anthropic: Claude 3 Haiku
Claude 3 Haiku is Anthropic's fastest and most compact model for near-instant responsiveness. Quick and accurate targeted performance. See the launch announcement and benchmark results [here](https://www.anthropic.com/news/claude-3-haiku) #multimodal
OpenAI: GPT-4 Turbo
The latest GPT-4 Turbo model with vision capabilities. Vision requests can now use JSON mode and function calling. Training data: up to December 2023.
OpenAI: GPT-4o
GPT-4o ("o" for "omni") is OpenAI's latest AI model, supporting both text and image inputs with text outputs. It maintains the intelligence level of [GPT-4 Turbo](/models/openai/gpt-4-turbo) while being twice as...
OpenAI: GPT-4o (2024-05-13)
GPT-4o ("o" for "omni") is OpenAI's latest AI model, supporting both text and image inputs with text outputs. It maintains the intelligence level of [GPT-4 Turbo](/models/openai/gpt-4-turbo) while being twice as...
OpenAI: GPT-4o-mini
GPT-4o mini is OpenAI's newest model after [GPT-4 Omni](/models/openai/gpt-4o), supporting both text and image inputs with text outputs. As their most advanced small model, it is many multiples more affordable...
OpenAI: GPT-4o-mini (2024-07-18)
GPT-4o mini is OpenAI's newest model after [GPT-4 Omni](/models/openai/gpt-4o), supporting both text and image inputs with text outputs. As their most advanced small model, it is many multiples more affordable...
OpenAI: GPT-4o (2024-08-06)
The 2024-08-06 version of GPT-4o offers improved performance in structured outputs, with the ability to supply a JSON schema in the respone_format. Read more [here](https://openai.com/index/introducing-structured-outputs-in-the-api/). GPT-4o ("o" for "omni") is...
Meta: Llama 3.2 11B Vision Instruct
Llama 3.2 11B Vision is a multimodal model with 11 billion parameters, designed to handle tasks combining visual and textual data. It excels in tasks such as image captioning and...
Anthropic: Claude 3.5 Haiku
Claude 3.5 Haiku features offers enhanced capabilities in speed, coding accuracy, and tool use. Engineered to excel in real-time applications, it delivers quick response times that are essential for dynamic...
Mistral: Pixtral Large 2411
Pixtral Large is a 124B parameter, open-weight, multimodal model built on top of [Mistral Large 2](/mistralai/mistral-large-2411). The model is able to understand documents, charts and natural images. The model is...
OpenAI: GPT-4o (2024-11-20)
The 2024-11-20 version of GPT-4o offers a leveled-up creative writing ability with more natural, engaging, and tailored writing to improve relevance & readability. Itβs also better at working with uploaded...
Amazon: Nova Pro 1.0
Amazon Nova Pro 1.0 is a capable multimodal model from Amazon focused on providing a combination of accuracy, speed, and cost for a wide range of tasks. As of December...
Amazon: Nova Lite 1.0
Amazon Nova Lite 1.0 is a very low-cost multimodal model from Amazon that focused on fast processing of image, video, and text inputs to generate text output. Amazon Nova Lite...
OpenAI: o1
The latest and strongest model family from OpenAI, o1 is designed to spend more time thinking before responding. The o1 model series is trained with large-scale reinforcement learning to reason...
MiniMax: MiniMax-01
MiniMax-01 is a combines MiniMax-Text-01 for text generation and MiniMax-VL-01 for image understanding. It has 456 billion parameters, with 45.9 billion parameters activated per inference, and can handle a context...
Perplexity: Sonar
Sonar is lightweight, affordable, fast, and simple to use β now featuring citations and the ability to customize sources. It is designed for companies seeking to integrate lightweight question-and-answer features...
