trending

2mo ago

DeepSeek-V4 - The open-source era of 1M context intelligence

DeepSeek-V4 Preview is a new series of highly efficient MoE language models, featuring V4-Pro (1.6T params) and V4-Flash (284B params). Both models support a 1 million token context window by default, utilizing a novel hybrid attention architecture to drastically reduce compute and memory costs.

8mo ago

DeepSeek-OCR - Read documents like an image

DeepSeek-OCR is a model that compresses long text by treating it as an image. This optical compression uses far fewer vision tokens to represent documents, unlocking new levels of efficiency for long-context tasks while delivering powerful OCR capabilities.

10mo ago

DeepSeek-V3.1 - Our first step toward the agent era

DeepSeek-V3.1 is a hybrid model that supports both thinking mode and non-thinking mode.

7mo ago

DeepSeek-V3.2 - Reasoning-first models built for agents

We’re on a journey to advance and democratize artificial intelligence through open source and open science.

7mo ago

DeepSeek V3.2 - Open-Source LLM matching GPT-5

DeepSeek V3.2 and V3.2-Speciale are breakthrough open-source AI models from China, built to rival top closed models like OpenAI’s GPT-5 and Google Gemini 3 Pro. Delivering gold-level results in both math and programming benchmarks, DeepSeek V3.2-Speciale even outperforms GPT-5 on AIME 2025. With efficient architecture and exceptional reasoning abilities, these models are perfect for both enthusiasts and enterprise applications

1yr ago

DeepSeek for iOS - Your Al assistant powered by DeepSeek-V3

Powered by the groundbreaking DeepSeek-V3 model with over 600B parameters, this state-of-the-art AI leads global standards and matches top-tier international models across multiple benchmarks.

9mo ago

DeepSeek-V3.1-Terminus - A refined agentic model for developers

DeepSeek-V3.1-Terminus is the latest update to the DeepSeek-V3.1 model. This "Terminus" version focuses on stability and refinement, fixing issues like language mixing and improving agent capabilities, while retaining the core strengths of the V3.1 series.

9mo ago

DeepSeek-V3.2-Exp - Long-context efficiency with DeepSeek Sparse Attention

DeepSeek-V3.2-Exp is a new experimental model introducing DeepSeek Sparse Attention (DSA). This new architecture boosts long-context efficiency for training and inference while maintaining the performance of V3.1-Terminus. API prices have been cut by over 50%.

7mo ago

DeepSeekMath-V2 - IMO Gold level reasoning, fully open.

DeepSeekMath-V2 is a new open-source model specialized in mathematical reasoning. It introduces a self-verification mechanism where the model acts as both generator and verifier to refine its own proofs. It achieved Gold-level scores in IMO 2025 and a near-perfect 118/120 in Putnam 2024.

1yr ago

DeepSeek-R1-0528 - New open-source LLM that rivals o3 in coding & reasoning

DeepSeek's new R1-0528 open-source LLM reportedly rivals OpenAI's o3 in coding & reasoning. Features a long context window & improved long-text accuracy.
12
Next
Last