๐ Long Context
408 models ยท Page 3 of 12
openai/gpt-oss-120b
gpt-oss-120b is an open-weight, 117B-parameter Mixture-of-Experts (MoE) language model from OpenAI designed for high-reasoning, agentic, and general-purpose production use cases. It activates 5.1B parameters per forward pass and is optimized...
nvidia/Nemotron-3-Nano-30B-A3B
NVIDIA Nemotron 3 Nano 30B A3B is a small language MoE model with highest compute efficiency and accuracy for developers to build specialized agentic AI systems. The model is fully...
Qwen/Qwen3-Max-Thinking
Qwen3-Max-Thinking is the flagship reasoning model in the Qwen3 series, designed for high-stakes cognitive tasks that require deep, multi-step reasoning. By significantly scaling model capacity and reinforcement learning compute, it...
Qwen/Qwen3-235B-A22B-Thinking-2507
Qwen3-235B-A22B-Thinking-2507 is a high-performance, open-weight Mixture-of-Experts (MoE) language model optimized for complex reasoning tasks. It activates 22B of its 235B parameters per forward pass and natively supports up to 262,144...
google/gemma-3-27b-it
Gemma 3 introduces multimodality, supporting vision-language input and text outputs. It handles context windows up to 128k tokens, understands over 140 languages, and offers improved math, reasoning, and chat capabilities,...
Qwen/Qwen3-VL-30B-A3B-Instruct
Qwen3-VL-30B-A3B-Instruct is a multimodal model that unifies strong text generation with visual understanding for images and videos. Its Instruct variant optimizes instruction-following for general multimodal tasks. It excels in perception...
command-a-vision-07-2025
mistral-small-2603
Mistral Small 4.
open-mistral-nemo
Our best multilingual open source model released July 2024.
magistral-small-2509
Our efficient reasoning model released September 2025.
magistral-medium-2509
Our frontier-class reasoning model release candidate September 2025.
mistral-large-2512
Official mistral-large-2512 Mistral AI model
pixtral-large-2411
Official pixtral-large-2411 Mistral AI model
groq/compound
devstral-2512
Official devstral-2512 Mistral AI model
magistral-medium-2509
Our frontier-class reasoning model release candidate September 2025.
mistral-medium-2508
Update on Mistral Medium 3 with improved capabilities.
mistral-small-2603
Mistral Small 4.
mistral-medium-2508
Update on Mistral Medium 3 with improved capabilities.
devstral-small-2507
Our small open-source code-agentic model.
mistral-medium-2505
Our frontier-class multimodal model released May 2025.
llama-3.1-8b-instant
Meta's Llama 3.1 8B served on Groq's LPU for ultra-low latency โ ideal for fast, lightweight text tasks.
devstral-medium-2507
Our medium code-agentic model.
open-mistral-nemo
Our best multilingual open source model released July 2024.
mistral-medium-2508
Update on Mistral Medium 3 with improved capabilities.
mistral-medium-2508
Update on Mistral Medium 3 with improved capabilities.
mistral-vibe-cli-latest
Devstral 2512 release model
ministral-8b-2512
Ministral 3 (a.k.a. Tinystral) 8B Instruct.
labs-leanstral-2603
A mid & post-trained version of mistral small 4 for Lean
mistral-large-2512
Official mistral-large-2512 Mistral AI model
magistral-small-2509
Our efficient reasoning model released September 2025.
mistral-moderation-2603
Official mistral-moderation-2603 Mistral AI model
codestral-2508
Our cutting-edge language model for coding released August 2025.
mistral-small-2506
Our latest enterprise-grade small model with the latest version released June 2025.
open-mistral-nemo
Our best multilingual open source model released July 2024.
ministral-8b-2512
Ministral 3 (a.k.a. Tinystral) 8B Instruct.
