Gemma 2 27B by Google is an open model built from the same research and technology used to create the [Gemini models](/models?q=gemini). Gemma models are well-suited for a variety of...

textinstructcheap

8,192 ctx$0.80/1M in

Explore specs and pricingView details →

Qwen 2.5 Coder 32B Instruct

qwen

textcodeinstruct

16,384 ctx$0.80/1M in

Explore specs and pricingView details →

Meta Llama 3 8B Instruct

meta-llama

Meta Llama 3 8B Instruct — Meta's Llama open-source language model, one of the most widely deployed open models.

textinstructcheap

8,192 ctx$0.20/1M in

Explore specs and pricingView details →

Qwen3 Next 80B A3b Instruct

qwen

Qwen3-Next-80B-A3B-Instruct is an instruction-tuned chat model in the Qwen3-Next series optimized for fast, stable responses without “thinking” traces. It targets complex tasks across reasoning, code generation, knowledge QA, and multilingual...

textcodereasoning

262,144 ctx$0.15/1M in

Explore specs and pricingView details →

Meta Llama 3 70B Instruct Turbo

meta-llama

Meta Llama 3 70B Instruct Turbo — Meta's Llama open-source language model, one of the most widely deployed open models.

textinstructcheap

8,192 ctx$0.88/1M in

Explore specs and pricingView details →

Meta Llama 3.1 8B Instruct Turbo

meta-llama

textinstructcheap

131,072 ctx$0.18/1M in

Explore specs and pricingView details →

Mistral (7B) Instruct v0.1

mistralai

textinstructcheap

32,768 ctx$0.20/1M in

Explore specs and pricingView details →

Llama 4 Maverick Instruct (17Bx128E) FP8

meta-llama

textinstructcheap

1,048,576 ctx$0.27/1M in

Explore specs and pricingView details →

Meta Llama 3.2 1B Instruct

meta-llama

textinstructcheap

131,072 ctx$0.06/1M in

Explore specs and pricingView details →

Nvidia Nemotron Nano 9B V2

nvidia

textcheaplong-context

131,072 ctx$0.06/1M in

Explore specs and pricingView details →

GLM 4.6 Fp8

zai-org

textcheaplong-context

202,752 ctx$0.60/1M in

Explore specs and pricingView details →

Deepseek Coder 33B Instruct

deepseek-ai

textcodeinstruct

16,384 ctx$0.80/1M in

Explore specs and pricingView details →

Llama 3.1 Nemotron 70B Instruct HF

nvidia

Llama 3.1 Nemotron 70B Instruct HF — Meta's Llama open-source language model, one of the most widely deployed open models.

textinstructcheap

32,768 ctx$0.88/1M in

Explore specs and pricingView details →

Qwen 2.5 14B Instruct

qwen

textinstructcheap

32,768 ctx$0.80/1M in

Explore specs and pricingView details →

Qwen 2 Instruct (1.5B)

qwen

textinstructcheap

32,768 ctx$0.02/1M in

Explore specs and pricingView details →

Qwen3 30B A3b

qwen

Qwen3, the latest generation in the Qwen large language model series, features both dense and mixture-of-experts (MoE) architectures to excel in reasoning, multilingual support, and advanced agent tasks. Its unique...

textreasoningagents

40,960 ctxFree in

Explore specs and pricingView details →

Glm 4.5 Air Fp8

zai-org

textcheaplong-context

131,072 ctx$0.20/1M in

Explore specs and pricingView details →

Qwen3 8B

qwen

Qwen3-8B is a dense 8.2B parameter causal language model from the Qwen3 series, designed for both reasoning-heavy tasks and efficient dialogue. It supports seamless switching between "thinking" mode for math,...

textreasoningcheap

40,960 ctxFree in

Explore specs and pricingView details →

Meta Llama 3 8B Instruct Reference

meta-llama

textinstructcheap

8,192 ctx$0.20/1M in

Explore specs and pricingView details →

Rime Labs Arcana v2

rime-labs

textcheap

ctx$0.27/1M in

Explore specs and pricingView details →

DeepSeek R1 Distill Qwen 1.5B

deepseek-ai

textcheaplong-context

131,072 ctx$0.18/1M in

Explore specs and pricingView details →

Qwen3 235B A22B Thinking 2507 FP8

qwen

Qwen3-235B-A22B-Thinking-2507 is a high-performance, open-weight Mixture-of-Experts (MoE) language model optimized for complex reasoning tasks. It activates 22B of its 235B parameters per forward pass and natively supports up to 262,144...

textreasoningcheap

262,144 ctx$0.65/1M in

Explore specs and pricingView details →

Multilingual E5 Large Instruct

intfloat

textmultilingualinstruct

514 ctx$0.02/1M in

Explore specs and pricingView details →

Qwen3-VL-8B-Instruct

qwen

Qwen3-VL-8B-Instruct is a multimodal vision-language model from the Qwen3-VL series, built for high-fidelity understanding and reasoning across text, images, and video. It features improved multimodal fusion with Interleaved-MRoPE for long-horizon...

textvisionreasoning

262,144 ctx$0.18/1M in

Explore specs and pricingView details →

Meta Llama 3.1 70B Instruct Turbo

meta-llama

textinstructcheap

131,072 ctx$0.88/1M in

Explore specs and pricingView details →

Nous Hermes 2 Mixtral 8X7B Dpo

nousresearch

textcheap

32,768 ctx$0.60/1M in

Explore specs and pricingView details →

Llama Guard 4 12B

meta-llama

Llama Guard 4 is a Llama 4 Scout-derived multimodal pretrained model, fine-tuned for content safety classification. Similar to previous versions, it can be used to classify content in both LLM...

textvisioncheap

1,048,576 ctx$0.20/1M in

Explore specs and pricingView details →

Qwen3 Coder 30B A3b Instruct

qwen

Qwen3-Coder-30B-A3B-Instruct is a 30.5B parameter Mixture-of-Experts (MoE) model with 128 experts (8 active per forward pass), designed for advanced code generation, repository-scale understanding, and agentic tool use. Built on the...

textcodeagents

262,144 ctxFree in

Explore specs and pricingView details →

Mixtral-8x7B Instruct v0.1

mistralai

textinstructcheap

32,768 ctx$0.60/1M in

Explore specs and pricingView details →

LFM2-24B-A2B

liquidai

textcheap

32,768 ctx$0.03/1M in

Explore specs and pricingView details →

Qwen3 Next 80B A3b Thinking

qwen

Qwen3-Next-80B-A3B-Thinking is a reasoning-first chat model in the Qwen3-Next line that outputs structured “thinking” traces by default. It’s designed for hard multi-step problems; math proofs, code synthesis/debugging, logic, and agentic...

textcodereasoning

262,144 ctx$0.15/1M in

Explore specs and pricingView details →

Arize AI Qwen 2 1.5B Instruct

arize-ai

textinstructcheap

32,768 ctx$0.10/1M in

Explore specs and pricingView details →