๐ All Models
1,316 models ยท Page 24 of 37
Gemma-4-31B-IT-NVFP4
Open-source Gemma-4-31B-IT-NVFP4 model from nvidia โ available for download and self-hosting on Hugging Face.
NVIDIA-Nemotron-3-Nano-30B-A3B-FP8
Open-source NVIDIA-Nemotron-3-Nano-30B-A3B-FP8 model from nvidia โ available for download and self-hosting on Hugging Face.
qwen3-vl:235b
qwen3-vl:235b โ available to run locally via Ollama on CPU and GPU hardware.
qwen3-coder:480b
qwen3-coder:480b โ available to run locally via Ollama on CPU and GPU hardware.
deepseek-v3.2
deepseek-v3.2 โ available to run locally via Ollama on CPU and GPU hardware.
Qwen2.5-Coder-7B-Instruct
Qwen2.5-Coder-7B-Instruct โ Alibaba's Qwen series language model with strong multilingual and coding capabilities.
Qwen2.5-0.5B
Qwen2.5-0.5B โ Alibaba's Qwen series language model with strong multilingual and coding capabilities.
Qwen2.5-14B-Instruct
Qwen2.5-14B-Instruct โ Alibaba's Qwen series language model with strong multilingual and coding capabilities.
OpenELM-1_1B-Instruct
OpenELM-1_1B-Instruct โ open-source model from apple, available for self-hosting on Hugging Face.
Qwen2.5-Coder-7B
Qwen2.5-Coder-7B โ Alibaba's Qwen series language model with strong multilingual and coding capabilities.
Qwen2.5-32B-Instruct-AWQ
Qwen2.5-32B-Instruct-AWQ โ Alibaba's Qwen series language model with strong multilingual and coding capabilities.
gemma3:4b
gemma3:4b โ available to run locally via Ollama on CPU and GPU hardware.
qwen3-vl:235b-instruct
qwen3-vl:235b-instruct โ Alibaba's Qwen series language model with strong multilingual and coding capabilities.
minimax-m2.5
minimax-m2.5 โ available to run locally via Ollama on CPU and GPU hardware.
ministral-3:3b
ministral-3:3b โ available to run locally via Ollama on CPU and GPU hardware.
gpt-oss:20b
gpt-oss:20b โ available to run locally via Ollama on CPU and GPU hardware.
gemma4:31b
gemma4:31b โ available to run locally via Ollama on CPU and GPU hardware.
minimax-m2
minimax-m2 โ available to run locally via Ollama on CPU and GPU hardware.
TinyLlama-1.1B-Chat-v1.0
Open-source TinyLlama-1.1B-Chat-v1.0 model from tinyllama โ available for download and self-hosting on Hugging Face.
Qwen2-1.5B-Instruct
Qwen2-1.5B-Instruct โ Alibaba's Qwen series language model with strong multilingual and coding capabilities.
Qwen2.5-32B-Instruct
Open-source Qwen2.5-32B-Instruct model from qwen โ available for download and self-hosting on Hugging Face.
Llama-3.2-1B-Instruct
Open-source Llama-3.2-1B-Instruct model from meta-llama โ available for download and self-hosting on Hugging Face.
dolphin-2.9.1-yi-1.5-34b
dolphin-2.9.1-yi-1.5-34b โ open-source model from dphn, available for self-hosting on Hugging Face.
Qwen2.5-0.5B-Instruct
Open-source Qwen2.5-0.5B-Instruct model from qwen โ available for download and self-hosting on Hugging Face.
Llama-3.2-3B-Instruct
Open-source Llama-3.2-3B-Instruct model from meta-llama โ available for download and self-hosting on Hugging Face.
tiny-Qwen2ForCausalLM-2.5
Open-source tiny-Qwen2ForCausalLM-2.5 model from trl-internal-testing โ available for download and self-hosting on Hugging Face.
Qwen3-1.7B
Open-source Qwen3-1.7B model from qwen โ available for download and self-hosting on Hugging Face.
Llama-3.1-8B-Instruct
Open-source Llama-3.1-8B-Instruct model from meta-llama โ available for download and self-hosting on Hugging Face.
DeepSeek-V3.2
Open-source DeepSeek-V3.2 model from deepseek-ai โ available for download and self-hosting on Hugging Face.
Qwen2.5-3B-Instruct
Open-source Qwen2.5-3B-Instruct model from qwen โ available for download and self-hosting on Hugging Face.
Qwen2.5-1.5B-Instruct
Qwen2.5-1.5B-Instruct โ Alibaba's Qwen series language model with strong multilingual and coding capabilities.
Qwen2.5-7B-Instruct
Open-source Qwen2.5-7B-Instruct model from qwen โ available for download and self-hosting on Hugging Face.
Qwen3-0.6B
Open-source Qwen3-0.6B model from qwen โ available for download and self-hosting on Hugging Face.
Elephant
Elephant Alpha is a 100B-parameter text model focused on intelligence efficiency, delivering strong performance while minimizing token usage. It supports a 256K context window with up to 32K output tokens,...
autogluon/chronos-t5-tiny
autogluon/chronos-t5-tiny is a time series forecasting model on Hugging Face with ~74,825 monthly downloads. Open access.
facebook/nllb-200-3.3B
facebook/nllb-200-3.3B is a translation model on Hugging Face with ~77,317 monthly downloads. Open access.
