modelstop.top — Every AI Model, One Place

llama-3.2-11b-vision-instruct

meta

The Llama 3.2-Vision instruction-tuned models are optimized for visual recognition, image reasoning, captioning, and answering general questions about an image.

textvisionreasoning

Input$0.0490/1M

Output$0.6800/1M

📏128kcontext

Explore specs and pricingView details →

llama-4-scout-17b-16e-instruct

meta

Meta's Llama 4 Scout is a 17 billion parameter model with 16 experts that is natively multimodal. These models leverage a mixture-of-experts architecture to offer industry-leading performance in text and image understanding.

textvisioninstruct

Input$0.2700/1M

Output$0.8500/1M

📏131kcontext

⚡418msp50

Explore specs and pricingView details →

Llama 3.2 11B Instruct

meta

Llama 3.2 11B Instruct — available via AWS Bedrock (us-east-1).

textvisionmultimodal

InputFree

Explore specs and pricingView details →

Llama 3.2 90B Instruct

meta

Llama 3.2 90B Instruct — available via AWS Bedrock (us-east-1).

textvisionmultimodal

InputFree

Explore specs and pricingView details →

Llama 4 Scout 17B Instruct

meta

Llama 4 Scout 17B Instruct — available via AWS Bedrock (us-east-1).

textvisionmultimodal

InputFree

Explore specs and pricingView details →

Llama 4 Maverick 17B Instruct

meta

Llama 4 Maverick 17B Instruct — available via AWS Bedrock (us-east-1).

textvisionmultimodal

InputFree

Explore specs and pricingView details →

AI Model Catalogue

llama-3.2-11b-vision-instruct

llama-4-scout-17b-16e-instruct

Llama 3.2 11B Instruct

Llama 3.2 90B Instruct

Llama 4 Scout 17B Instruct

Llama 4 Maverick 17B Instruct