π Long Context
454 models Β· Page 1 of 13
llama-4-scout-17b-16e-instruct
Meta's Llama 4 Scout is a 17 billion parameter model with 16 experts that is natively multimodal. These models leverage a mixture-of-experts architecture to offer industry-leading performance in text and image understanding.
gemma-4-26b-a4b-it
Gemma 4 is Google's most intelligent family of open models, built from Gemini 3 research to maximize intelligence-per-parameter.
llama-3.2-11b-vision-instruct
The Llama 3.2-Vision instruction-tuned models are optimized for visual recognition, image reasoning, captioning, and answering general questions about an image.
bge-base-en-v1.5
BAAI general embedding (Base) model that transforms any given text into a 768-dimensional vector
mistral-small-3.1-24b-instruct
Building upon Mistral Small 3 (2501), Mistral Small 3.1 (2503) adds state-of-the-art vision understanding and enhances long context capabilities up to 128k tokens without compromising text performance. With 24 billion parameters, this model achieves top-tier capabilities in both text and vision tasks.
gemma-sea-lion-v4-27b-it
SEA-LION stands for Southeast Asian Languages In One Network, which is a collection of Large Language Models (LLMs) which have been pretrained and instruct-tuned for the Southeast Asia (SEA) region.
gpt-oss-20b
OpenAIβs open-weight models designed for powerful reasoning, agentic tasks, and versatile developer use cases β gpt-oss-20b is for lower latency, and local or specialized use-cases.
kimi-k2.6
Kimi K2.6 is a frontier-scale open-source 1T parameter model with a 262.1k context window, multi-turn tool calling, vision inputs, and structured outputs for agentic workloads.
llama-guard-3-8b
Llama Guard 3 is a Llama-3.1-8B pretrained model, fine-tuned for content safety classification. Similar to previous versions, it can be used to classify content in both LLM inputs (prompt classification) and in LLM responses (response classification). It acts as an LLM β it generates text in its output that indicates whether a given prompt or response is safe or unsafe, and if unsafe, it also lists the content categories violated.
nemotron-3-120b-a12b
NVIDIA Nemotron 3 Super is a hybrid MoE model with leading accuracy for multi-agent applications and specialized agentic AI systems.
granite-4.0-h-micro
Granite 4.0 instruct models deliver strong performance across benchmarks, achieving industry-leading results in key agentic tasks like instruction following and function calling. These efficiencies make the models well-suited for a wide range of use cases like retrieval-augmented generation (RAG), multi-agent workflows, and edge deployments.
gpt-oss-120b
OpenAIβs open-weight models designed for powerful reasoning, agentic tasks, and versatile developer use cases β gpt-oss-120b is for production, general purpose, high reasoning use-cases.
kimi-k2.5
Kimi K2.5 is a frontier-scale open-source model with a 256k context window, multi-turn tool calling, vision inputs, and structured outputs for agentic workloads.
glm-4.7-flash
GLM-4.7-Flash is a fast and efficient multilingual text generation model with a 131,072 token context window. Optimized for dialogue, instruction-following, and multi-turn tool calling across 100+ languages.
Gemma 3 27B PT
Gemma 3 27B PT β available via AWS Bedrock (us-east-1).
GPT OSS Safeguard 20B
GPT OSS Safeguard 20B β available via AWS Bedrock (us-east-1).
Gemma 3 4B IT
Gemma 3 4B IT β available via AWS Bedrock (us-east-1).
glm-5p1
glm-5
Qwen3-VL-30B-A3B-Instruct
Open-source Qwen3-VL-30B-A3B-Instruct model from qwen β available for download and self-hosting on Hugging Face.
Qwen3.5-35B-A3B
Open-source Qwen3.5-35B-A3B model from qwen β available for download and self-hosting on Hugging Face.
Qwen3-VL-32B-Instruct
Open-source Qwen3-VL-32B-Instruct model from qwen β available for download and self-hosting on Hugging Face.
Qwen3.5-9B
Open-source Qwen3.5-9B model from qwen β available for download and self-hosting on Hugging Face.
Qwen3-VL-8B-Instruct
Open-source Qwen3-VL-8B-Instruct model from qwen β available for download and self-hosting on Hugging Face.
Qwen3.5-27B
Open-source Qwen3.5-27B model from qwen β available for download and self-hosting on Hugging Face.
Llama-Guard-4-12B
Open-source Llama-Guard-4-12B model from meta-llama β available for download and self-hosting on Hugging Face.
Llama-3.2-11B-Vision-Instruct
Open-source Llama-3.2-11B-Vision-Instruct model from meta-llama β available for download and self-hosting on Hugging Face.
Llama-Guard-3-8B
Open-source Llama-Guard-3-8B model from meta-llama β available for download and self-hosting on Hugging Face.
Qwen3-235B-A22B
Open-source Qwen3-235B-A22B model from qwen β available for download and self-hosting on Hugging Face.
Llama-3.3-70B-Instruct
Open-source Llama-3.3-70B-Instruct model from meta-llama β available for download and self-hosting on Hugging Face.
Qwen3-Coder-Next
Open-source Qwen3-Coder-Next model from qwen β available for download and self-hosting on Hugging Face.
