modelstop.top
Home/All Models

AI Model Catalogue

Browse 9 models across providers, modalities, and use cases.

๐Ÿ“„ Long Context

9 models ยท Page 1 of 1

DeepSeek: DeepSeek V3

deepseek

DeepSeek-V3 is the latest model from the DeepSeek team, building upon the instruction following and coding abilities of the previous versions. Pre-trained on nearly 15 trillion tokens, the reported evaluations...

textcodecheap
Run locally
131,072 ctx$0.32/1M in
Explore specs and pricingView details โ†’

DeepSeek: R1 Distill Llama 70B

deepseek

DeepSeek R1 Distill Llama 70B is a distilled large language model based on [Llama-3.3-70B-Instruct](/meta-llama/llama-3.3-70b-instruct), using outputs from [DeepSeek R1](/deepseek/deepseek-r1). The model combines advanced distillation techniques to achieve high performance across...

textinstructcheap
Run locally
131,072 ctx$0.70/1M in
Explore specs and pricingView details โ†’

DeepSeek: R1 Distill Qwen 32B

deepseek

DeepSeek R1 Distill Qwen 32B is a distilled large language model based on [Qwen 2.5 32B](https://huggingface.co/Qwen/Qwen2.5-32B), using outputs from [DeepSeek R1](/deepseek/deepseek-r1). It outperforms OpenAI's o1-mini across various benchmarks, achieving new...

textcheaplong-context
Run locally
128,000 ctx$0.29/1M in
Explore specs and pricingView details โ†’

DeepSeek: DeepSeek V3 0324

deepseek

DeepSeek V3, a 685B-parameter, mixture-of-experts model, is the latest iteration of the flagship chat model family from the DeepSeek team. It succeeds the [DeepSeek V3](/deepseek/deepseek-chat-v3) model and performs really well...

textcheaplong-context
163,840 ctx$0.20/1M in
Explore specs and pricingView details โ†’

DeepSeek: R1 0528

deepseek

May 28th update to the [original DeepSeek R1](/deepseek/deepseek-r1) Performance on par with [OpenAI o1](/openai/o1), but open-sourced and with fully open reasoning tokens. It's 671B parameters in size, with 37B active...

textreasoningcheap
Run locally
163,840 ctx$0.45/1M in
Explore specs and pricingView details โ†’

DeepSeek: DeepSeek V3.1 Terminus

deepseek

DeepSeek-V3.1 Terminus is an update to [DeepSeek V3.1](/deepseek/deepseek-chat-v3.1) that maintains the model's original capabilities while addressing issues reported by users, including language consistency and agent capabilities, further optimizing the model's...

textagentscheap
163,840 ctx$0.21/1M in
Explore specs and pricingView details โ†’

DeepSeek: DeepSeek V3.2 Exp

deepseek

DeepSeek-V3.2-Exp is an experimental large language model released by DeepSeek as an intermediate step between V3.1 and future architectures. It introduces DeepSeek Sparse Attention (DSA), a fine-grained sparse attention mechanism...

textcheaplong-context
163,840 ctx$0.27/1M in
Explore specs and pricingView details โ†’

DeepSeek: DeepSeek V3.2

deepseek

DeepSeek-V3.2 is a large language model designed to harmonize high computational efficiency with strong reasoning and agentic tool-use performance. It introduces DeepSeek Sparse Attention (DSA), a fine-grained sparse attention mechanism...

textreasoningagents
163,840 ctx$0.26/1M in
Explore specs and pricingView details โ†’

DeepSeek: DeepSeek V3.2 Speciale

deepseek

DeepSeek-V3.2-Speciale is a high-compute variant of DeepSeek-V3.2 optimized for maximum reasoning and agentic performance. It builds on DeepSeek Sparse Attention (DSA) for efficient long-context processing, then scales post-training reinforcement learning...

textreasoningagents
163,840 ctx$0.40/1M in
Explore specs and pricingView details โ†’