๐ฌ Text Generation
1,750 models ยท Page 35 of 49
openai/gpt-oss-120b
gpt-oss-120b is an open-weight, 117B-parameter Mixture-of-Experts (MoE) language model from OpenAI designed for high-reasoning, agentic, and general-purpose production use cases. It activates 5.1B parameters per forward pass and is optimized...
Qwen/Qwen3-VL-30B-A3B-Instruct
Qwen3-VL-30B-A3B-Instruct is a multimodal model that unifies strong text generation with visual understanding for images and videos. Its Instruct variant optimizes instruction-following for general multimodal tasks. It excels in perception...
microsoft/phi-4
Microsoft Phi-4 14B โ small language model achieving state-of-the-art results on reasoning tasks.
Qwen/Qwen3-14B
Qwen3-14B is a dense 14.8B parameter causal language model from the Qwen3 series, designed for both complex reasoning and efficient dialogue. It supports seamless switching between a "thinking" mode for...
Qwen/Qwen3-Embedding-0.6B
nvidia/NVIDIA-Nemotron-Nano-12B-v2-VL
black-forest-labs/FLUX-2-pro
Qwen/Qwen3-Embedding-8B-batch
zai-org/GLM-5.1
Bria/expand
black-forest-labs/FLUX-2-klein-4b
google/gemma-4-31B-it
Gemma 4 31B Instruct is Google DeepMind's 30.7B dense multimodal model supporting text and image input with text output. Features a 256K token context window, configurable thinking/reasoning mode, native function...
zai-org/GLM-4.7-Flash
nvidia/llama-nemotron-embed-vl-1b-v2
Qwen/Qwen3.5-9B
meta-llama/Meta-Llama-3.1-8B-Instruct
Meta Llama 3.1 8B Instruct on DeepInfra โ fast, affordable open-source model with 128K context.
anthropic/claude-3-7-sonnet-latest
stepfun-ai/Step-3.5-Flash
Wan-AI/Wan2.7-Image-Edit
Wan-AI/Wan2.7-Image-Edit โ served on DeepInfra's GPU cloud for scalable, cost-efficient inference.
Wan-AI/Wan2.6-T2I
Sao10K/L3-8B-Lunaris-v1-Turbo
zai-org/GLM-4.7
Qwen/Qwen3-Embedding-4B-batch
Bria/erase_foreground
Bria/erase_foreground โ served on DeepInfra's GPU cloud for scalable, cost-efficient inference.
Qwen/Qwen3-Coder-480B-A35B-Instruct
deepseek-ai/DeepSeek-V3
DeepSeek V3 โ 671B MoE model with exceptional coding and math performance at very low cost.
Wan-AI/Wan2.6-Image-Edit
Qwen/Qwen3-235B-A22B-Thinking-2507
Qwen3-235B-A22B-Thinking-2507 is a high-performance, open-weight Mixture-of-Experts (MoE) language model optimized for complex reasoning tasks. It activates 22B of its 235B parameters per forward pass and natively supports up to 262,144...
Qwen/Qwen3-Embedding-8B
anthropic/claude-4-opus
Bria/remove_background
mistralai/Mixtral-8x7B-Instruct-v0.1
Mixtral 8ร7B Instruct on DeepInfra โ popular MoE model with 32K context and strong multilingual performance.
embed-v4.0
Cohere's latest multimodal embedding model supporting text and images for advanced semantic search.
