SEA-LION stands for Southeast Asian Languages In One Network, which is a collection of Large Language Models (LLMs) which have been pretrained and instruct-tuned for the Southeast Asia (SEA) region.

textinstructcheap

Input$0.3500/1M

Output$0.5600/1M

📏128kcontext

Explore specs and pricingView details →

llama-3.1-8b-instruct-awq

meta

Quantized (int4) generative text model with 8 billion parameters from Meta.

Explore specs and pricingView details →

llama-4-scout-17b-16e-instruct

meta

Meta's Llama 4 Scout is a 17 billion parameter model with 16 experts that is natively multimodal. These models leverage a mixture-of-experts architecture to offer industry-leading performance in text and image understanding.

Explore specs and pricingView details →

llama-3-8b-instruct-awq

meta

Quantized (int4) generative text model with 8 billion parameters from Meta.

Explore specs and pricingView details →

llama-3-8b-instruct

meta

Generation over generation, Meta Llama 3 demonstrates state-of-the-art performance on a wide range of industry benchmarks and offers new capabilities, including improved reasoning.

textreasoninginstruct

Input$0.2800/1M

Output$0.8300/1M

📏8kcontext

Explore specs and pricingView details →

llama-3.2-3b-instruct

meta

The Llama 3.2 instruction-tuned text only models are optimized for multilingual dialogue use cases, including agentic retrieval and summarization tasks.

textagentsmultilingual

Input$0.0510/1M

Output$0.3400/1M

📏80kcontext

Explore specs and pricingView details →

mistral-7b-instruct-v0.1

mistral

Instruct fine-tuned version of the Mistral-7b generative text model with 7 billion parameters

Explore specs and pricingView details →

mistral-7b-instruct-v0.2-lora

mistral

The Mistral-7B-Instruct-v0.2 Large Language Model (LLM) is an instruct fine-tuned version of the Mistral-7B-v0.2.

Explore specs and pricingView details →

llama-3.1-8b-instruct-fp8

meta

Llama 3.1 8B quantized to FP8 precision

Explore specs and pricingView details →

llama-3.3-70b-instruct-fp8-fast

meta

Llama 3.3 70B quantized to fp8 precision, optimized to be faster.

Explore specs and pricingView details →

granite-4.0-h-micro

ibm-granite

Granite 4.0 instruct models deliver strong performance across benchmarks, achieving industry-leading results in key agentic tasks like instruction following and function calling. These efficiencies make the models well-suited for a wide range of use cases like retrieval-augmented generation (RAG), multi-agent workflows, and edge deployments.

Explore specs and pricingView details →

qwen2.5-coder-32b-instruct

qwen

Qwen2.5-Coder is the latest series of Code-Specific Qwen large language models (formerly known as CodeQwen). As of now, Qwen2.5-Coder has covered six mainstream model sizes, 0.5, 1.5, 3, 7, 14, 32 billion parameters, to meet the needs of different developers. Qwen2.5-Coder brings the following improvements upon CodeQwen1.5:

Explore specs and pricingView details →

deepseek-math-7b-instruct

deepseek-ai

DeepSeekMath-Instruct 7B is a mathematically instructed tuning model derived from DeepSeekMath-Base 7B. DeepSeekMath is initialized with DeepSeek-Coder-v1.5 7B and continues pre-training on math-related tokens sourced from Common Crawl, together with natural language and code data for 500B tokens.

Explore specs and pricingView details →

falcon-7b-instruct

tiiuae

Falcon-7B-Instruct is a 7B parameters causal decoder-only model built by TII based on Falcon-7B and finetuned on a mixture of chat/instruct datasets.

Explore specs and pricingView details →

llama-3.2-1b-instruct

meta

The Llama 3.2 instruction-tuned text only models are optimized for multilingual dialogue use cases, including agentic retrieval and summarization tasks.

textagentsmultilingual

Input$0.0270/1M

Output$0.2000/1M

📏60kcontext

Explore specs and pricingView details →

⭐Top Rated