trending
Zac Zuo

11d ago

DeepSeek-V4 - The open-source era of 1M context intelligence

DeepSeek-V4 Preview is a new series of highly efficient MoE language models, featuring V4-Pro (1.6T params) and V4-Flash (284B params). Both models support a 1 million token context window by default, utilizing a novel hybrid attention architecture to drastically reduce compute and memory costs.
Zac Zuo

7mo ago

DeepSeek-OCR - Read documents like an image

DeepSeek-OCR is a model that compresses long text by treating it as an image. This optical compression uses far fewer vision tokens to represent documents, unlocking new levels of efficiency for long-context tasks while delivering powerful OCR capabilities.
Ankit Sharma

5mo ago

DeepSeek-V3.2 - Reasoning-first models built for agents

We’re on a journey to advance and democratize artificial intelligence through open source and open science.
Ankit Sharma

9mo ago

DeepSeek-V3.1 - Our first step toward the agent era

DeepSeek-V3.1 is a hybrid model that supports both thinking mode and non-thinking mode.
Jim Engine

5mo ago

DeepSeek V3.2 - Open-Source LLM matching GPT-5

DeepSeek V3.2 and V3.2-Speciale are breakthrough open-source AI models from China, built to rival top closed models like OpenAI’s GPT-5 and Google Gemini 3 Pro. Delivering gold-level results in both math and programming benchmarks, DeepSeek V3.2-Speciale even outperforms GPT-5 on AIME 2025. With efficient architecture and exceptional reasoning abilities, these models are perfect for both enthusiasts and enterprise applications
Zac Zuo

7mo ago

DeepSeek-V3.1-Terminus - A refined agentic model for developers

DeepSeek-V3.1-Terminus is the latest update to the DeepSeek-V3.1 model. This "Terminus" version focuses on stability and refinement, fixing issues like language mixing and improving agent capabilities, while retaining the core strengths of the V3.1 series.
Zac Zuo

5mo ago

DeepSeekMath-V2 - IMO Gold level reasoning, fully open.

DeepSeekMath-V2 is a new open-source model specialized in mathematical reasoning. It introduces a self-verification mechanism where the model acts as both generator and verifier to refine its own proofs. It achieved Gold-level scores in IMO 2025 and a near-perfect 118/120 in Putnam 2024.
Chris Messina

1yr ago

DeepSeek for iOS - Your Al assistant powered by DeepSeek-V3

Powered by the groundbreaking DeepSeek-V3 model with over 600B parameters, this state-of-the-art AI leads global standards and matches top-tier international models across multiple benchmarks.
Zac Zuo

7mo ago

DeepSeek-V3.2-Exp - Long-context efficiency with DeepSeek Sparse Attention

DeepSeek-V3.2-Exp is a new experimental model introducing DeepSeek Sparse Attention (DSA). This new architecture boosts long-context efficiency for training and inference while maintaining the performance of V3.1-Terminus. API prices have been cut by over 50%.
Zac Zuo

11mo ago

DeepSeek-R1-0528 - New open-source LLM that rivals o3 in coding & reasoning

DeepSeek's new R1-0528 open-source LLM reportedly rivals OpenAI's o3 in coding & reasoning. Features a long context window & improved long-text accuracy.
12
Next
Last