Home/All Models

AI Model Catalogue

Browse 408 models across providers, modalities, and use cases.

🌐All Models 💬Text Generation 💻Code & Reasoning 👁️Vision & Multimodal 🎨Image Generation 🎙️Audio & Speech 🤖Agents & Tools 📄Long Context 🆓Free & Open 🧠Reasoning 🌍Multilingual

Filter & Sort

🌐 All Models

408 models · Page 4 of 12

ministral-8b-2512

Ministral 3 (a.k.a. Tinystral) 8B Instruct.

textinstructfree

262,144 ctxFree in

Explore specs and pricingView details →

magistral-medium-2509

Our frontier-class reasoning model release candidate September 2025.

textreasoningfree

131,072 ctxFree in

Explore specs and pricingView details →

pixtral-large-2411

Official pixtral-large-2411 Mistral AI model

textfreelong-context

131,072 ctxFree in

Explore specs and pricingView details →

command-r7b-arabic-02-2025

textfreelong-context

128,000 ctxFree in

Explore specs and pricingView details →

command-a-reasoning-08-2025

textreasoningfree

288,768 ctxFree in

Explore specs and pricingView details →

llama-3.3-70b-versatile

Meta's Llama 3.3 70B — latest iteration with improved instruction following, served on Groq LPU.

textfreelong-context

131,072 ctxFree in

Explore specs and pricingView details →

groq/compound-mini

textfreelong-context

131,072 ctxFree in

Explore specs and pricingView details →

magistral-small-2509

Our efficient reasoning model released September 2025.

textreasoningfree

131,072 ctxFree in

Explore specs and pricingView details →

openai/gpt-oss-safeguard-20b

gpt-oss-safeguard-20b is a safety reasoning model from OpenAI built upon gpt-oss-20b. This open-weight, 21B-parameter Mixture-of-Experts (MoE) model offers lower latency for safety tasks like content classification, LLM filtering, and trust...

textreasoningcheap

131,072 ctxFree in

Explore specs and pricingView details →

ministral-14b-2512

Ministral 3 (a.k.a. Tinystral) 14B Instruct.

textinstructfree

262,144 ctxFree in

Explore specs and pricingView details →

magistral-medium-2509

Our frontier-class reasoning model release candidate September 2025.

textreasoningfree

131,072 ctxFree in

Explore specs and pricingView details →

open-mistral-nemo

Our best multilingual open source model released July 2024.

textmultilingualfree

131,072 ctxFree in

Explore specs and pricingView details →

command-a-vision-07-2025

128,000 ctxFree in

Explore specs and pricingView details →

groq/compound

textfreelong-context

131,072 ctxFree in

Explore specs and pricingView details →

ministral-8b-2512

Ministral 3 (a.k.a. Tinystral) 8B Instruct.

textinstructfree

262,144 ctxFree in

Explore specs and pricingView details →

llama-3.1-8b-instant

Meta's Llama 3.1 8B served on Groq's LPU for ultra-low latency — ideal for fast, lightweight text tasks.

textfreelong-context

131,072 ctxFree in

Explore specs and pricingView details →

pixtral-large-2411

Official pixtral-large-2411 Mistral AI model

textfreelong-context

131,072 ctxFree in

Explore specs and pricingView details →

mistral-vibe-cli-latest

Devstral 2512 release model

textfreelong-context

262,144 ctxFree in

Explore specs and pricingView details →

mistral-small-2506

Our latest enterprise-grade small model with the latest version released June 2025.

textfreelong-context

131,072 ctxFree in

Explore specs and pricingView details →

mistral-medium-2508

Update on Mistral Medium 3 with improved capabilities.

textfreelong-context

131,072 ctxFree in

Explore specs and pricingView details →

devstral-2512

Official devstral-2512 Mistral AI model

textfreelong-context

262,144 ctxFree in

Explore specs and pricingView details →

devstral-medium-2507

Our medium code-agentic model.

131,072 ctxFree in

Explore specs and pricingView details →

openai/gpt-oss-120b

gpt-oss-120b is an open-weight, 117B-parameter Mixture-of-Experts (MoE) language model from OpenAI designed for high-reasoning, agentic, and general-purpose production use cases. It activates 5.1B parameters per forward pass and is optimized...

textreasoningagents

131,072 ctxFree in

Explore specs and pricingView details →

gpt-oss-120b

120b open-weight language model from OpenAI

textfreelong-context

131,072 ctxFree in

Explore specs and pricingView details →

gpt-oss-20b

20b open-weight language model from OpenAI

textfreelong-context

131,072 ctxFree in

Explore specs and pricingView details →

kimi-k2-thinking

Kimi K2 Thinking is the latest, most capable version of an open-source thinking model.

textreasoningcheap

262,144 ctxFree in

Explore specs and pricingView details →

kimi-k2-thinking

kimi-k2-thinking — available to run locally via Ollama on CPU and GPU hardware.

textreasoningcheap

262,144 ctxFree in

Explore specs and pricingView details →

gpt-oss:20b

gpt-oss:20b — available to run locally via Ollama on CPU and GPU hardware.

textfreelong-context

131,072 ctxFree in

Explore specs and pricingView details →

qwen3-coder-next

qwen3-coder-next — available to run locally via Ollama on CPU and GPU hardware.

262,144 ctxFree in

Explore specs and pricingView details →

gemini-3-flash-preview

gemini-3-flash-preview — available to run locally via Ollama on CPU and GPU hardware.

textcheaplong-context

1,048,576 ctxFree in

Explore specs and pricingView details →

gpt-oss:120b

gpt-oss:120b — available to run locally via Ollama on CPU and GPU hardware.

textfreelong-context

131,072 ctxFree in

Explore specs and pricingView details →

Llama-3.1-70B-Instruct

Open-source Llama-3.1-70B-Instruct model from meta-llama — available for download and self-hosting on Hugging Face.

textinstructcheap

131,072 ctxFree in

Explore specs and pricingView details →

minimax-m2

minimax-m2 — available to run locally via Ollama on CPU and GPU hardware.

textcheaplong-context

196,608 ctxFree in

Explore specs and pricingView details →

Llama-3.2-3B-Instruct

Open-source Llama-3.2-3B-Instruct model from meta-llama — available for download and self-hosting on Hugging Face.

textinstructfree

131,072 ctxFree in

Explore specs and pricingView details →

Elephant

Elephant Alpha is a 100B-parameter text model focused on intelligence efficiency, delivering strong performance while minimizing token usage. It supports a 256K context window with up to 32K output tokens,...

textfreelong-context

262,144 ctx$0.00/1M in

Explore specs and pricingView details →

Qwen3-30B-A3B-Instruct-2507

Open-source Qwen3-30B-A3B-Instruct-2507 model from qwen — available for download and self-hosting on Hugging Face.

textinstructcheap

262,144 ctx$0.00/1M in

Explore specs and pricingView details →