AI Model Catalogue

Qwen3-Next-80B-A3B-Instruct is an instruction-tuned chat model in the Qwen3-Next series optimized for fast, stable responses without “thinking” traces. It targets complex tasks across reasoning, code generation, knowledge QA, and multilingual...

262,144 ctx$0.15/1M in

Deepseek Coder 33B Instruct

deepseek-ai

16,384 ctx$0.80/1M in

Qwen 2.5 Coder 32B Instruct

16,384 ctx$0.80/1M in

Qwen3 Coder 30B A3b Instruct

Qwen3-Coder-30B-A3B-Instruct is a 30.5B parameter Mixture-of-Experts (MoE) model with 128 experts (8 active per forward pass), designed for advanced code generation, repository-scale understanding, and agentic tool use. Built on the...

textcodeagents

262,144 ctxFree in

Qwen3 Next 80B A3b Thinking

Qwen3-Next-80B-A3B-Thinking is a reasoning-first chat model in the Qwen3-Next line that outputs structured “thinking” traces by default. It’s designed for hard multi-step problems; math proofs, code synthesis/debugging, logic, and agentic...

262,144 ctx$0.15/1M in

Qwen/Qwen3-Coder-480B-A35B-Instruct-Turbo

Qwen/Qwen3-Coder-480B-A35B-Instruct

deepseek-ai/DeepSeek-V3

DeepSeek V3 — 671B MoE model with exceptional coding and math performance at very low cost.

Qwen/Qwen2.5-72B-Instruct

Alibaba Qwen2.5 72B Instruct on DeepInfra — top open-source model with multilingual and coding strengths.

textcodemultilingual

codestral-2508

Our cutting-edge language model for coding released August 2025.

256,000 ctxFree in

codestral-2508

Our cutting-edge language model for coding released August 2025.

256,000 ctxFree in

devstral-medium-2507

Our medium code-agentic model.

textcodeagents

131,072 ctxFree in

devstral-small-2507

Our small open-source code-agentic model.

textcodeagents

131,072 ctxFree in

claude-opus-4.6

anthropic

Anthropic's most intelligent model with state-of-the-art coding, reasoning, and agentic capabilities

gpt-5.4

openai

OpenAI's most capable frontier model for complex professional work, coding, and multi-step reasoning.

Qwen2.5-Coder-7B-Instruct

Qwen2.5-Coder-7B-Instruct — Alibaba's Qwen series language model with strong multilingual and coding capabilities.

Qwen2.5-Coder-7B-Instruct-GPTQ-Int4

Open-source Qwen2.5-Coder-7B-Instruct-GPTQ-Int4 model from qwen — available for download and self-hosting on Hugging Face.

qwen3-coder-next

ollama

qwen3-coder-next — available to run locally via Ollama on CPU and GPU hardware.

262,144 ctxFree in

qwen3-coder:480b

ollama

qwen3-coder:480b — available to run locally via Ollama on CPU and GPU hardware.

Qwen3-Coder-Next-FP8

Open-source Qwen3-Coder-Next-FP8 model from qwen — available for download and self-hosting on Hugging Face.

Qwen2.5-Coder-32B-Instruct

Open-source Qwen2.5-Coder-32B-Instruct model from qwen — available for download and self-hosting on Hugging Face.

Qwen2.5-Coder-14B-Instruct

Open-source Qwen2.5-Coder-14B-Instruct model from qwen — available for download and self-hosting on Hugging Face.

Qwen2.5-Coder-7B

Qwen2.5-Coder-7B — Alibaba's Qwen series language model with strong multilingual and coding capabilities.

Qwen/Qwen2.5-Coder-32B-Instruct

Qwen/Qwen2.5-Coder-32B-Instruct is a text generation model on Hugging Face with ~1,107,488 monthly downloads. Open access.

ctx$0.00/1M in

Qwen/Qwen2.5-Coder-7B-Instruct-GPTQ-Int4

Qwen/Qwen2.5-Coder-7B-Instruct-GPTQ-Int4 is a text generation model on Hugging Face with ~1,124,729 monthly downloads. Open access.

ctx$0.00/1M in

Qwen3-Coder-30B-A3B-Instruct

Open-source Qwen3-Coder-30B-A3B-Instruct model from qwen — available for download and self-hosting on Hugging Face.

160,000 ctx$0.00/1M in

Qwen/Qwen2.5-Coder-7B-Instruct

Qwen/Qwen2.5-Coder-7B-Instruct is a text generation model on Hugging Face with ~2,225,884 monthly downloads. Open access.

ctx$0.00/1M in

Samsung Gauss 2 54B Instruct

Samsung Research

Samsung Gauss 2 is Samsung's large language model optimized for on-device and cloud workloads. Trained on multilingual data with a focus on Korean and English, covering general conversation, summarization, and code assistance.

instructmultilingualcode

8,192 ctx$0.00/1M in

codereasoningmultilingual

Qwen3 235B A22B

alibaba

Qwen3 235B A22B is Alibaba's flagship mixture-of-experts model with 235B total parameters and 22B active per token. Delivers frontier-level performance on coding, reasoning, and multilingual tasks at significantly lower inference cost.

128,000 ctx$0.00/1M in

Databricks DBRX Instruct

Databricks

DBRX Instruct is an open, general-purpose LLM from Databricks. Built with a fine-grained mixture-of-experts (MoE) architecture, it was the most capable open LLM at launch and excels at code, math, and language tasks.

codereasoninginstruct

32,768 ctx$0.00/1M in

Microsoft Phi-4 Mini

microsoft

Microsoft Phi-4 Mini is a 3.8B parameter compact model from Microsoft. Delivers impressive reasoning capabilities for edge and mobile deployment scenarios, with strong performance on math and coding tasks relative to its size.

reasoningcodeinstruct

128,000 ctx$0.00/1M in

IBM Granite 3.0 8B Instruct

IBM Research

IBM Granite 3.0 8B Instruct is a lightweight enterprise-grade language model trained on a carefully curated enterprise corpus and optimized for RAG, summarization, classification, and code generation in business contexts.

128,000 ctx$0.00/1M in

OpenAI: GPT-3.5 Turbo

openai

GPT-3.5 Turbo is OpenAI's fastest model. It can understand and generate natural language or code, and is optimized for chat and traditional completion tasks. Training data up to Sep 2021.

16,385 ctx$0.50/1M in

OpenAI: GPT-3.5 Turbo (older v0613)

openai

GPT-3.5 Turbo is OpenAI's fastest model. It can understand and generate natural language or code, and is optimized for chat and traditional completion tasks. Training data up to Sep 2021.

4,095 ctx$1.00/1M in

Mistral Large

This is Mistral AI's flagship model, Mistral Large 2 (version `mistral-large-2407`). It's a proprietary weights-available model and excels at reasoning, code, JSON, chat, and more. Read the launch announcement [here](https://mistral.ai/news/mistral-large-2407/)....

128,000 ctx$2.00/1M in

Mistral: Mixtral 8x22B Instruct

Mistral's official instruct fine-tuned version of [Mixtral 8x22B](/models/mistralai/mixtral-8x22b). It uses 39B active parameters out of 141B, offering unparalleled cost efficiency for its size. Its strengths include: - strong math, coding,...

65,536 ctx$2.00/1M in