๐ Long Context
454 models ยท Page 5 of 13
openai/gpt-oss-120b
gpt-oss-120b is an open-weight, 117B-parameter Mixture-of-Experts (MoE) language model from OpenAI designed for high-reasoning, agentic, and general-purpose production use cases. It activates 5.1B parameters per forward pass and is optimized...
openai/gpt-oss-20b
gpt-oss-20b is an open-weight 21B parameter model released by OpenAI under the Apache 2.0 license. It uses a Mixture-of-Experts (MoE) architecture with 3.6B active parameters per forward pass, optimized for...
openai/gpt-oss-safeguard-20b
gpt-oss-safeguard-20b is a safety reasoning model from OpenAI built upon gpt-oss-20b. This open-weight, 21B-parameter Mixture-of-Experts (MoE) model offers lower latency for safety tasks like content classification, LLM filtering, and trust...
devstral-2512
Official devstral-2512 Mistral AI model
magistral-medium-2509
Our frontier-class reasoning model release candidate September 2025.
devstral-small-2507
Our small open-source code-agentic model.
ministral-8b-2512
Ministral 3 (a.k.a. Tinystral) 8B Instruct.
devstral-2512
Official devstral-2512 Mistral AI model
ministral-3b-2512
Ministral 3 (a.k.a. Tinystral) 3B Instruct.
ministral-3b-2512
Ministral 3 (a.k.a. Tinystral) 3B Instruct.
mistral-small-2603
Mistral Small 4.
mistral-medium-2508
Update on Mistral Medium 3 with improved capabilities.
mistral-small-2506
Our latest enterprise-grade small model with the latest version released June 2025.
c4ai-aya-expanse-32b
pixtral-large-2411
Official pixtral-large-2411 Mistral AI model
mistral-large-2411
Our top-tier reasoning model for high-complexity tasks with the lastest version released November 2024.
pixtral-large-2411
Official pixtral-large-2411 Mistral AI model
devstral-2512
Official devstral-2512 Mistral AI model
mistral-medium-2508
Update on Mistral Medium 3 with improved capabilities.
qwen/qwen3-32b
Qwen3-32B is a dense 32.8B parameter causal language model from the Qwen3 series, optimized for both complex reasoning and efficient dialogue. It supports seamless switching between a "thinking" mode for...
gpt-oss-120b
120b open-weight language model from OpenAI
gpt-oss-20b
20b open-weight language model from OpenAI
kimi-k2-thinking
Kimi K2 Thinking is the latest, most capable version of an open-source thinking model.
qwen3-coder-next
qwen3-coder-next โ available to run locally via Ollama on CPU and GPU hardware.
Llama-3.1-70B-Instruct
Open-source Llama-3.1-70B-Instruct model from meta-llama โ available for download and self-hosting on Hugging Face.
gpt-oss:20b
gpt-oss:20b โ available to run locally via Ollama on CPU and GPU hardware.
gpt-oss:120b
gpt-oss:120b โ available to run locally via Ollama on CPU and GPU hardware.
gemini-3-flash-preview
gemini-3-flash-preview โ available to run locally via Ollama on CPU and GPU hardware.
kimi-k2-thinking
kimi-k2-thinking โ available to run locally via Ollama on CPU and GPU hardware.
minimax-m2
minimax-m2 โ available to run locally via Ollama on CPU and GPU hardware.
Llama-3.2-3B-Instruct
Open-source Llama-3.2-3B-Instruct model from meta-llama โ available for download and self-hosting on Hugging Face.
Llama-3.1-8B-Instruct
Open-source Llama-3.1-8B-Instruct model from meta-llama โ available for download and self-hosting on Hugging Face.
Elephant
Elephant Alpha is a 100B-parameter text model focused on intelligence efficiency, delivering strong performance while minimizing token usage. It supports a 256K context window with up to 32K output tokens,...
Qwen3-30B-A3B-Instruct-2507
Open-source Qwen3-30B-A3B-Instruct-2507 model from qwen โ available for download and self-hosting on Hugging Face.
Qwen3-Coder-30B-A3B-Instruct
Open-source Qwen3-Coder-30B-A3B-Instruct model from qwen โ available for download and self-hosting on Hugging Face.
Qwen3-30B-A3B
Open-source Qwen3-30B-A3B model from qwen โ available for download and self-hosting on Hugging Face.
