๐ฌ Text Generation
850 models ยท Page 9 of 24
meta-llama/Meta-Llama-3.1-8B-Instruct
Meta Llama 3.1 8B Instruct on DeepInfra โ fast, affordable open-source model with 128K context.
Gryphe/MythoMax-L2-13b
One of the highest performing and most popular fine-tunes of Llama 2 13B, with rich descriptions and roleplay. #merge
ByteDance/Seedream-4.5
Qwen/Qwen3.5-9B
intfloat/multilingual-e5-large-instruct
stepfun-ai/Step-3.5-Flash
Bria/remove_background
sentence-transformers/all-MiniLM-L12-v2
Sao10K/L3-8B-Lunaris-v1-Turbo
zai-org/GLM-4.7-Flash
Qwen/Qwen-Image-Max
microsoft/phi-4
Microsoft Phi-4 14B โ small language model achieving state-of-the-art results on reasoning tasks.
stabilityai/sdxl-turbo
BAAI/bge-m3-multi
nvidia/NVIDIA-Nemotron-Nano-9B-v2
ClarityAI/creative
google/gemini-2.5-flash
black-forest-labs/FLUX-2-klein-4b
meta-llama/Llama-4-Maverick-17B-128E-Instruct-FP8
Qwen/Qwen3.5-35B-A3B
embed-english-light-v3.0
google/gemini-1.5-flash-8b
Bria/replace_background
ClarityAI/crystal
deepseek-ai/DeepSeek-V3
DeepSeek V3 โ 671B MoE model with exceptional coding and math performance at very low cost.
embed-multilingual-light-v3.0
deepseek-ai/DeepSeek-R1-0528-Turbo
Qwen/Qwen3-Max-Thinking
Qwen3-Max-Thinking is the flagship reasoning model in the Qwen3 series, designed for high-stakes cognitive tasks that require deep, multi-step reasoning. By significantly scaling model capacity and reinforcement learning compute, it...
nvidia/NVIDIA-Nemotron-Nano-12B-v2-VL
PaddlePaddle/PaddleOCR-VL-0.9B
zai-org/GLM-4.7
Bria/expand
PrunaAI/p-image-Edit
openai/gpt-oss-120b
gpt-oss-120b is an open-weight, 117B-parameter Mixture-of-Experts (MoE) language model from OpenAI designed for high-reasoning, agentic, and general-purpose production use cases. It activates 5.1B parameters per forward pass and is optimized...
