๐ All Models
26 models ยท Page 1 of 1
llama-3.2-11b-vision-instruct
The Llama 3.2-Vision instruction-tuned models are optimized for visual recognition, image reasoning, captioning, and answering general questions about an image.
llama-3-8b-instruct-awq
Quantized (int4) generative text model with 8 billion parameters from Meta.
llama-4-scout-17b-16e-instruct
Meta's Llama 4 Scout is a 17 billion parameter model with 16 experts that is natively multimodal. These models leverage a mixture-of-experts architecture to offer industry-leading performance in text and image understanding.
llama-3.1-8b-instruct-awq
Quantized (int4) generative text model with 8 billion parameters from Meta.
llama-3.2-1b-instruct
The Llama 3.2 instruction-tuned text only models are optimized for multilingual dialogue use cases, including agentic retrieval and summarization tasks.
llama-3-8b-instruct
Generation over generation, Meta Llama 3 demonstrates state-of-the-art performance on a wide range of industry benchmarks and offers new capabilities, including improved reasoning.
llama-3.2-3b-instruct
The Llama 3.2 instruction-tuned text only models are optimized for multilingual dialogue use cases, including agentic retrieval and summarization tasks.
llama-guard-3-8b
Llama Guard 3 is a Llama-3.1-8B pretrained model, fine-tuned for content safety classification. Similar to previous versions, it can be used to classify content in both LLM inputs (prompt classification) and in LLM responses (response classification). It acts as an LLM โ it generates text in its output that indicates whether a given prompt or response is safe or unsafe, and if unsafe, it also lists the content categories violated.
llama-2-7b-chat-fp16
Full precision (fp16) generative text model with 7 billion parameters from Meta
llama-2-7b-chat-int8
Quantized (int8) generative text model with 7 billion parameters from Meta
llama-3.1-8b-instruct-fp8
Llama 3.1 8B quantized to FP8 precision
llama-3.3-70b-instruct-fp8-fast
Llama 3.3 70B quantized to fp8 precision, optimized to be faster.
m2m100-1.2b
Multilingual encoder-decoder (seq-to-seq) model trained for Many-to-Many multilingual translation
Llama 3.3 70B Instruct
Llama 3.3 70B Instruct โ available via AWS Bedrock (us-east-1).
Llama 3.2 3B Instruct
Llama 3.2 3B Instruct โ available via AWS Bedrock (us-east-1).
Llama 4 Scout 17B Instruct
Llama 4 Scout 17B Instruct โ available via AWS Bedrock (us-east-1).
Llama 3 70B Instruct
Llama 3 70B Instruct โ available via AWS Bedrock (us-east-1).
Llama 3 8B Instruct
Llama 3 8B Instruct โ available via AWS Bedrock (us-east-1).
Llama 3.2 1B Instruct
Llama 3.2 1B Instruct โ available via AWS Bedrock (us-east-1).
Llama 4 Maverick 17B Instruct
Llama 4 Maverick 17B Instruct โ available via AWS Bedrock (us-east-1).
Llama 3.1 8B Instruct
Llama 3.1 8B Instruct โ available via AWS Bedrock (us-east-1).
Llama 3.1 70B Instruct
Llama 3.1 70B Instruct โ available via AWS Bedrock (us-east-1).
Llama 3.2 11B Instruct
Llama 3.2 11B Instruct โ available via AWS Bedrock (us-east-1).
Llama 3.2 90B Instruct
Llama 3.2 90B Instruct โ available via AWS Bedrock (us-east-1).
llama-4-maverick-instruct
A 17 billion parameter model with 128 experts
llama-4-scout-instruct
A 17 billion parameter model with 16 experts
