๐ Long Context
408 models ยท Page 1 of 12
Qwen2.5 1.5B
Qwen2.5 1.5B โ Alibaba's Qwen series language model with strong multilingual and coding capabilities.
Qwen3.5 397B A17b Fp8
Qwen3.5 397B A17b Fp8 โ Alibaba's Qwen series language model with strong multilingual and coding capabilities.
Ministral 3 14B Instruct 2512
Minimax M1 40K
Llama 4 Maverick 17B 128E
Nvidia Nemotron Nano 9B V2
Meta Llama 3.2 3B
Qwen QwQ-32B
QwQ is the reasoning model of the Qwen series. Compared with conventional instruction-tuned models, QwQ, which is capable of thinking and reasoning, can achieve significantly enhanced performance in downstream tasks,...
Nvidia Nemotron 3 Super 120B A12b Fp8
Deepcoder 14B Preview
Deepseek V3.1 Terminus
MiniMax M2
Nvidia Nemotron 3 Nano 30B A3b Bf16
Qwen2.5 7B
Qwen2.5 7B โ Alibaba's Qwen series language model with strong multilingual and coding capabilities.
Qwen3.5 9B Fp8
Minimax M1 80K
Meta Llama 3.2 1B Instruct
Qwen3-VL-235B-A22B-Instruct-FP8
Qwen2.5 14B
DeepSeek R1 0528
Qwen2.5 72B
Glm 4.5 Air Fp8
Qwen2.5 72B Instruct Turbo
Kimi K2 Thinking
Kimi K2 Thinking is Moonshot AIโs most advanced open reasoning model to date, extending the K2 series into agentic, long-horizon reasoning. Built on the trillion-parameter Mixture-of-Experts (MoE) architecture introduced in...
Qwen3 30B A3B Instruct 2507 Lora
Nvidia Nemotron 3 Super 120B A12b Bf16
Deepseek V3.2
Deepseek V3.2 Exp
DeepSeek R1 (Original)
Llama 4 Maverick Instruct (17Bx128E) FP8
Llama 3.2 1B
Qwen3 4B Instruct 2507
Meta Llama 3.1 8B Instruct Turbo
Llama 3.1 405B
Devstral Small 2505
Llama 3.1 70B
Llama 3.1 70B โ Meta's Llama open-source language model, one of the most widely deployed open models.
