Home/All Models

AI Model Catalogue

Browse 1,316 models across providers, modalities, and use cases.

🌐All Models 💬Text Generation 💻Code & Reasoning 👁️Vision & Multimodal 🎨Image Generation 🎙️Audio & Speech 🤖Agents & Tools 📄Long Context 🆓Free & Open 🧠Reasoning 🌍Multilingual

Filter & Sort

🌐 All Models

1,316 models · Page 31 of 37

Anthropic: Claude 3.7 Sonnet

anthropic

Claude 3.7 Sonnet is an advanced large language model with improved reasoning, coding, and problem-solving capabilities. It introduces a hybrid reasoning approach, allowing users to choose between rapid responses and...

textvisionmultimodal

200,000 ctx$3.00/1M in

Explore specs and pricingView details →

Google: Gemini 2.0 Flash Lite

google

Gemini 2.0 Flash Lite offers a significantly faster time to first token (TTFT) compared to [Gemini Flash 1.5](/google/gemini-flash-1.5), while maintaining quality on par with larger models like [Gemini Pro 1.5](/google/gemini-pro-1.5),...

textvisionmultimodal

1,048,576 ctx$0.07/1M in

Explore specs and pricingView details →

Qwen: QwQ 32B

qwen

QwQ is the reasoning model of the Qwen series. Compared with conventional instruction-tuned models, QwQ, which is capable of thinking and reasoning, can achieve significantly enhanced performance in downstream tasks,...

textreasoningcheap

131,072 ctx$0.15/1M in

Explore specs and pricingView details →

Perplexity: Sonar Deep Research

perplexity

Sonar Deep Research is a research-focused model designed for multi-step retrieval, synthesis, and reasoning across complex topics. It autonomously searches, reads, and evaluates sources, refining its approach as it gathers...

textreasoninglong-context

128,000 ctx$2.00/1M in

Explore specs and pricingView details →

Perplexity: Sonar Pro

perplexity

Note: Sonar Pro pricing includes Perplexity search pricing. See [details here](https://docs.perplexity.ai/guides/pricing#detailed-pricing-breakdown-for-sonar-reasoning-pro-and-sonar-pro) For enterprises seeking more advanced capabilities, the Sonar Pro API can handle in-depth, multi-step queries with added extensibility, like...

textvisionmultimodal

200,000 ctx$3.00/1M in

Explore specs and pricingView details →

Perplexity: Sonar Reasoning Pro

perplexity

Note: Sonar Pro pricing includes Perplexity search pricing. See [details here](https://docs.perplexity.ai/guides/pricing#detailed-pricing-breakdown-for-sonar-reasoning-pro-and-sonar-pro) Sonar Reasoning Pro is a premier reasoning model powered by DeepSeek R1 with Chain of Thought (CoT). Designed for...

textvisionmultimodal

128,000 ctx$2.00/1M in

Explore specs and pricingView details →

TheDrummer: Skyfall 36B V2

thedrummer

Skyfall 36B v2 is an enhanced iteration of Mistral Small 2501, specifically fine-tuned for improved creativity, nuanced writing, role-playing, and coherent storytelling.

textcheap

32,768 ctx$0.55/1M in

Explore specs and pricingView details →

Google: Gemma 3 27B (free)

google

Gemma 3 introduces multimodality, supporting vision-language input and text outputs. It handles context windows up to 128k tokens, understands over 140 languages, and offers improved math, reasoning, and chat capabilities,...

textvisionmultimodal

131,072 ctx$0.08/1M in

Explore specs and pricingView details →

Reka Flash 3

rekaai

Reka Flash 3 is a general-purpose, instruction-tuned large language model with 21 billion parameters, developed by Reka. It excels at general chat, coding tasks, instruction-following, and function calling. Featuring a...

textcodecheap

65,536 ctx$0.10/1M in

Explore specs and pricingView details →

OpenAI: GPT-4o Search Preview

openai

GPT-4o Search Previewis a specialized model for web search in Chat Completions. It is trained to understand and execute web search queries.

textlong-context

128,000 ctx$2.50/1M in

Explore specs and pricingView details →

OpenAI: GPT-4o-mini Search Preview

openai

GPT-4o mini Search Preview is a specialized model for web search in Chat Completions. It is trained to understand and execute web search queries.

textcheaplong-context

128,000 ctx$0.15/1M in

Explore specs and pricingView details →

Cohere: Command A

cohere

Command A is an open-weights 111B parameter model with a 256k context window focused on delivering great performance across agentic, multilingual, and coding use cases. Compared to other leading proprietary...

textcodeagents

256,000 ctx$2.50/1M in

Explore specs and pricingView details →

Google: Gemma 3 12B (free)

google

textvisionmultimodal

32,768 ctx$0.04/1M in

Explore specs and pricingView details →

Google: Gemma 3 4B (free)

google

textvisionmultimodal

32,768 ctx$0.04/1M in

Explore specs and pricingView details →

AllenAI: Olmo 2 32B Instruct

allenai

OLMo-2 32B Instruct is a supervised instruction-finetuned variant of the OLMo-2 32B March 2025 base model. It excels in complex reasoning and instruction-following tasks across diverse benchmarks such as GSM8K,...

textreasoninginstruct

128,000 ctx$0.05/1M in

Explore specs and pricingView details →

Mistral: Mistral Small 3.1 24B

mistralai

Mistral Small 3.1 24B Instruct is an upgraded variant of Mistral Small 3 (2501), featuring 24 billion parameters with advanced multimodal capabilities. It provides state-of-the-art performance in text-based reasoning and...

textvisionmultimodal

128,000 ctx$0.03/1M in

Explore specs and pricingView details →

OpenAI: o1-pro

openai

The o1 series of models are trained with reinforcement learning to think before they answer and perform complex reasoning. The o1-pro model uses more compute to think harder and provide...

textvisionmultimodal

200,000 ctx$150.00/1M in

Explore specs and pricingView details →

DeepSeek: DeepSeek V3 0324

deepseek

DeepSeek V3, a 685B-parameter, mixture-of-experts model, is the latest iteration of the flagship chat model family from the DeepSeek team. It succeeds the [DeepSeek V3](/deepseek/deepseek-chat-v3) model and performs really well...

textcheaplong-context

163,840 ctx$0.20/1M in

Explore specs and pricingView details →

Qwen: Qwen2.5 VL 32B Instruct

qwen

Qwen2.5-VL-32B is a multimodal vision-language model fine-tuned through reinforcement learning for enhanced mathematical reasoning, structured outputs, and visual problem-solving capabilities. It excels at visual analysis tasks, including object recognition, textual...

textvisionmultimodal

128,000 ctx$0.20/1M in

Explore specs and pricingView details →

Meta: Llama 4 Scout

meta-llama

Llama 4 Scout 17B Instruct (16E) is a mixture-of-experts (MoE) language model developed by Meta, activating 17 billion parameters out of a total of 109B. It supports native multimodal input...

textvisionmultimodal

327,680 ctx$0.08/1M in

Explore specs and pricingView details →

Meta: Llama 4 Maverick

meta-llama

Llama 4 Maverick 17B Instruct (128E) is a high-capacity multimodal language model from Meta, built on a mixture-of-experts (MoE) architecture with 128 experts and 17 billion active parameters per forward...

textvisionmultimodal

1,048,576 ctx$0.15/1M in

Explore specs and pricingView details →

NVIDIA: Llama 3.1 Nemotron Ultra 253B v1

nvidia

Llama-3.1-Nemotron-Ultra-253B-v1 is a large language model (LLM) optimized for advanced reasoning, human-interactive chat, retrieval-augmented generation (RAG), and tool-calling tasks. Derived from Meta’s Llama-3.1-405B-Instruct, it has been significantly customized using Neural...

reasoninginstructcheap

131,072 ctx$0.60/1M in

Explore specs and pricingView details →

xAI: Grok 3 Beta

x-ai

Grok 3 is the latest model from xAI. It's their flagship model that excels at enterprise use cases like data extraction, coding, and text summarization. Possesses deep domain knowledge in...

textcodelong-context

131,072 ctx$3.00/1M in

Explore specs and pricingView details →

xAI: Grok 3 Mini Beta

x-ai

Grok 3 Mini is a lightweight, smaller thinking model. Unlike traditional models that generate answers immediately, Grok 3 Mini thinks before responding. It’s ideal for reasoning-heavy tasks that don’t demand...

textreasoningcheap

131,072 ctx$0.30/1M in

Explore specs and pricingView details →

AlfredPros: CodeLLaMa 7B Instruct Solidity

alfredpros

A finetuned 7 billion parameters Code LLaMA - Instruct model to generate Solidity smart contract using 4-bit QLoRA finetuning provided by PEFT library.

textcodeinstruct

4,096 ctx$0.80/1M in

Explore specs and pricingView details →

EleutherAI: Llemma 7b

eleutherai

Llemma 7B is a language model for mathematics. It was initialized with Code Llama 7B weights, and trained on the Proof-Pile-2 for 200B tokens. Llemma models are particularly strong at...

codecheap

4,096 ctx$0.80/1M in

Explore specs and pricingView details →

OpenAI: GPT-4.1 Nano

openai

For tasks that demand low latency, GPT‑4.1 nano is the fastest and cheapest model in the GPT-4.1 series. It delivers exceptional performance at a small size with its 1 million...

textvisionmultimodal

1,047,576 ctx$0.10/1M in

Explore specs and pricingView details →

OpenAI: GPT-4.1 Mini

openai

GPT-4.1 Mini is a mid-sized model delivering performance competitive with GPT-4o at substantially lower latency and cost. It retains a 1 million token context window and scores 45.1% on hard...

textvisionmultimodal

1,047,576 ctx$0.40/1M in

Explore specs and pricingView details →

OpenAI: GPT-4.1

openai

GPT-4.1 is a flagship large language model optimized for advanced instruction following, real-world software engineering, and long-context reasoning. It supports a 1 million token context window and outperforms GPT-4o and...

textvisionmultimodal

1,047,576 ctx$2.00/1M in

Explore specs and pricingView details →

Qwen: Qwen2.5 Coder 7B Instruct

qwen

Qwen2.5-Coder-7B-Instruct is a 7B parameter instruction-tuned language model optimized for code-related tasks such as code generation, reasoning, and bug fixing. Based on the Qwen2.5 architecture, it incorporates enhancements like RoPE,...

codereasoninginstruct

32,768 ctx$0.03/1M in

Explore specs and pricingView details →

OpenAI: o4 Mini

openai

OpenAI o4-mini is a compact reasoning model in the o-series, optimized for fast, cost-efficient performance while retaining strong multimodal and agentic capabilities. It supports tool use and demonstrates competitive reasoning...

textvisionmultimodal

200,000 ctx$1.10/1M in

Explore specs and pricingView details →

OpenAI: o3

openai

o3 is a well-rounded and powerful model across domains. It sets a new standard for math, science, coding, and visual reasoning tasks. It also excels at technical writing and instruction-following....

textvisionmultimodal

200,000 ctx$2.00/1M in

Explore specs and pricingView details →

OpenAI: o4 Mini High

openai

OpenAI o4-mini-high is the same model as [o4-mini](/openai/o4-mini) with reasoning_effort set to high. OpenAI o4-mini is a compact reasoning model in the o-series, optimized for fast, cost-efficient performance while retaining...

textvisionmultimodal

200,000 ctx$1.10/1M in

Explore specs and pricingView details →

Qwen: Qwen3 235B A22B

qwen

Qwen3-235B-A22B is a 235B parameter mixture-of-experts (MoE) model developed by Qwen, activating 22B parameters per forward pass. It supports seamless switching between a "thinking" mode for complex reasoning, math, and...

textreasoningcheap

131,072 ctx$0.46/1M in

Explore specs and pricingView details →

Qwen: Qwen3 32B

qwen

Qwen3-32B is a dense 32.8B parameter causal language model from the Qwen3 series, optimized for both complex reasoning and efficient dialogue. It supports seamless switching between a "thinking" mode for...

textreasoningcheap

40,960 ctx$0.08/1M in

Explore specs and pricingView details →

Qwen: Qwen3 14B

qwen

Qwen3-14B is a dense 14.8B parameter causal language model from the Qwen3 series, designed for both complex reasoning and efficient dialogue. It supports seamless switching between a "thinking" mode for...

textreasoningcheap

40,960 ctx$0.06/1M in

Explore specs and pricingView details →