DeepSeek Research Papers: Complete Guide to DeepSeek AI Research

Last updated: May 10, 2026

DeepSeek Research has become one of the most closely watched areas in open-weight AI because it connects large language model architecture, efficient training, reasoning, coding, math, multimodal systems, and long-context engineering in a fast-moving public research stream. This guide collects the most important DeepSeek Research papers, technical reports, model cards, repositories, and release notes so readers can understand what to read first, how the papers connect, and where to find official sources.

DeepSeek’s verified Hugging Face organization describes DeepSeek as a Chinese company founded in 2023 and dedicated to AGI research, while its official model pages and repositories host many of the papers, model cards, weights, and implementation notes discussed in this guide.


Quick Answer: What Are the Most Important DeepSeek Research Papers?

The most important DeepSeek papers to start with are DeepSeek LLM, DeepSeekMoE, DeepSeekMath, DeepSeek-Coder, DeepSeek-V2, DeepSeek-V3, DeepSeek-R1, DeepSeek-V3.2, and the official DeepSeek-V4 technical materials. Together, these sources explain DeepSeek’s path from scaling open language models to MoE architectures, Multi-head Latent Attention, GRPO-based reasoning, FP8 training, long-context efficiency, agentic coding, mathematical reasoning, theorem proving, multimodal research, and OCR-based context compression.



What Is DeepSeek Research?

DeepSeek Research refers to the body of AI research, technical reports, model releases, and open model infrastructure published by DeepSeek-AI. It covers general-purpose language models, mixture-of-experts models, reasoning models, code models, math models, theorem-proving systems, vision-language models, OCR/context-compression research, and AI infrastructure.

The research line began with open language and code models such as DeepSeek LLM and DeepSeek-Coder, then moved into efficiency-focused architectures such as DeepSeekMoE, DeepSeek-V2, DeepSeek-V3, and later DeepSeek-V3.2 and DeepSeek-V4. DeepSeek’s model cards and reports repeatedly emphasize efficient scaling, sparse computation, long-context inference, reinforcement learning, and open model access.

The phrase “DeepSeek Research papers” usually refers to official arXiv papers, DeepSeek technical reports, Hugging Face model cards, GitHub repositories, and peer-reviewed publications associated with DeepSeek-AI. Some sources are full research papers, while others are release notes or model cards. This distinction matters because a model card can document usage, weights, license, chat templates, or deployment details without being a full peer-reviewed paper.


Why DeepSeek Research Papers Matter

DeepSeek papers matter for five practical reasons.

First, they show how frontier-style language model development can be made more compute-efficient through sparse architectures. DeepSeekMoE proposed fine-grained expert segmentation and shared expert isolation to improve expert specialization in MoE language models. DeepSeek-V2 then applied DeepSeekMoE together with Multi-head Latent Attention to reduce inference costs and KV-cache requirements while scaling to 236B total parameters with 21B activated per token.

Second, DeepSeek’s reasoning papers helped popularize reinforcement-learning-centered reasoning research. DeepSeekMath introduced Group Relative Policy Optimization, or GRPO, in the context of mathematical reasoning, and DeepSeek-R1 later used large-scale reinforcement learning to incentivize reasoning behaviors. The R1 work was also published in Nature, where the authors describe using reinforcement learning to elicit reasoning patterns such as self-reflection and verification without relying on human-labeled reasoning trajectories.

Third, DeepSeek papers are useful for engineers because many releases include weights, model cards, GitHub repositories, and deployment notes. The DeepSeek-R1 repository, for example, provides R1, R1-Zero, and distilled models, while the DeepSeek-V3 and DeepSeek-V4 model cards document architecture, model sizes, context length, and local-running pathways.

Fourth, DeepSeek research connects multiple AI domains. The same research family includes code intelligence through DeepSeek-Coder and DeepSeek-Coder-V2, mathematical reasoning through DeepSeekMath and DeepSeekMath-V2, theorem proving through DeepSeek-Prover, multimodal systems through DeepSeek-VL, DeepSeek-VL2, Janus, and Janus-Pro, and long-context compression through DeepSeek-OCR.

Fifth, the research is timely. As of this update, DeepSeek’s official API documentation lists DeepSeek-V4-Pro and DeepSeek-V4-Flash as supported API models, with the legacy deepseek-chat and deepseek-reasoner names scheduled for discontinuation on July 24, 2026.


Complete List of Important DeepSeek Research Papers

The table below focuses on official or high-authority sources. Some entries are full arXiv papers; others are official release notes or Hugging Face/GitHub model cards. When an item is not a conventional paper, it is labeled accordingly.

Paper / Technical ReportYearMain TopicModel or SystemWhy It MattersOfficial Source TypeBest For
DeepSeek LLM: Scaling Open-Source Language Models with Longtermism2024General LLM scalingDeepSeek LLM 7B/67BEstablishes the early DeepSeek LLM line, scaling-law study, SFT/DPO chat models, and 2T-token pretraining dataset.arXiv + GitHubBeginners, LLM foundations
DeepSeekMoE: Towards Ultimate Expert Specialization in Mixture-of-Experts Language Models2024MoE architectureDeepSeekMoEIntroduces fine-grained expert segmentation and shared expert isolation, which become central to later DeepSeek MoE models.arXivArchitecture researchers
DeepSeek-Coder2024Code LLMsDeepSeek-Coder 1.3B–33BIntroduces open code models trained on project-level code data, with code generation and infilling focus.arXivSoftware engineers, coding agents
DeepSeekMath2024Math reasoningDeepSeekMath 7BIntroduces a math-focused model and GRPO, a key method later associated with DeepSeek reasoning work.arXiv + GitHubReasoning researchers, students
DeepSeek-VL2024Vision-language understandingDeepSeek-VLApplies DeepSeek research to real-world multimodal tasks such as screenshots, PDFs, OCR, charts, and scientific documents.arXiv + GitHubMultimodal engineers
DeepSeek-V22024Efficient MoE LLMDeepSeek-V2Combines DeepSeekMoE and Multi-head Latent Attention; supports 128K context and activates 21B of 236B parameters per token.arXiv + GitHubLLM infra teams
DeepSeek-Prover2024Formal theorem provingDeepSeek-ProverUses synthetic Lean 4 theorem-proving data to improve formal proof generation.arXivFormal math researchers
DeepSeek-Coder-V22024MoE code intelligenceDeepSeek-Coder-V2Continues from DeepSeek-V2 with code-focused training, 128K context, and support for hundreds of programming languages.arXiv + GitHubCoding model builders
DeepSeek-Prover-V1.52024 / 2025Lean theorem provingDeepSeek-Prover-V1.5Uses proof assistant feedback, reinforcement learning, and Monte Carlo tree search; published as an ICLR 2025 conference paper.arXiv + ICLRTheorem proving
Janus2024Unified multimodal understanding and generationJanusDecouples visual encoding for multimodal understanding and generation in a unified autoregressive framework.arXiv + GitHubMultimodal research
JanusFlow2024 / 2025Multimodal generationJanusFlowCombines autoregressive modeling with rectified flow; accepted in CVPR 2025 open access proceedings.arXiv + CVPRVision-generation researchers
DeepSeek-VL22024MoE vision-language modelDeepSeek-VL2Adds dynamic tiling and DeepSeekMoE/MLA-based language modeling for advanced multimodal understanding.arXiv + GitHubVLM builders
DeepSeek-V3 Technical Report2024 / 2025Efficient frontier-scale MoE LLMDeepSeek-V3Introduces a 671B-parameter MoE with 37B activated per token, MLA, DeepSeekMoE, auxiliary-loss-free balancing, multi-token prediction, and FP8 training.arXiv + GitHubLLM architecture and training teams
DeepSeek-R12025Reasoning via RLDeepSeek-R1, R1-Zero, distilled modelsA major reasoning-model release that studies RL-driven reasoning and distillation into smaller dense models.arXiv + GitHub + NatureReasoning researchers
Janus-Pro2025Multimodal understanding and generationJanus-ProScales and improves Janus with better training strategy, data, and model size.arXiv + Hugging Face/GitHubMultimodal product teams
DeepSeek-R1-05282025R1 updateDeepSeek-R1-0528Official R1 update with improved benchmarks, JSON output, function calling, and open weights.DeepSeek API docs + Hugging FaceAPI users, app developers
DeepSeek-V3.1 / V3.1-Terminus2025Hybrid thinking/non-thinking and agentsDeepSeek-V3.1Official release notes describe hybrid inference, faster thinking, and improved agent capabilities; Terminus addresses language consistency and agent feedback.DeepSeek API docs + Hugging FaceAgent builders
DeepSeek-V3.2-Exp2025Sparse attention experimentDeepSeek-V3.2-ExpIntroduces DeepSeek Sparse Attention as an experimental step toward more efficient long-context training and inference.DeepSeek API docs + GitHubLong-context researchers
DeepSeek-OCR2025Optical context compressionDeepSeek-OCRStudies visual-text compression for long-context processing using DeepEncoder and a DeepSeek3B-MoE decoder.arXiv + GitHubDocument AI, OCR, context compression
DeepSeekMath-V22025Self-verifiable math reasoningDeepSeekMath-V2Investigates LLM-based verification and proof generation for rigorous mathematical reasoning.arXivAdvanced math reasoning
DeepSeek-V3.22025Reasoning, DSA, agentsDeepSeek-V3.2 / V3.2-SpecialeAdds DSA, scalable RL, and agentic task synthesis; official sources describe it as reasoning-first and built for agents.arXiv + DeepSeek docs + Hugging FaceAgentic AI and reasoning teams
DeepSeek-V4 Technical Materials2026Million-token context, MoE, hybrid attentionDeepSeek-V4-Pro / V4-FlashOfficial V4 sources describe 1M context, V4-Pro at 1.6T total / 49B active parameters, V4-Flash at 284B total / 13B active, CSA/HCA hybrid attention, mHC, and Muon optimizer.DeepSeek docs + Hugging Face technical report/model cardLong-context AI, infra, agents

Best Papers to Start With

Start here if you are new to DeepSeek Research:

  1. DeepSeek LLM for the early model family and scaling philosophy.
  2. DeepSeekMoE for the sparse expert architecture.
  3. DeepSeek-V2 for MLA and efficient inference.
  4. DeepSeek-V3 for large-scale MoE training and FP8-oriented efficiency.
  5. DeepSeek-R1 for reinforcement-learning-based reasoning.
  6. DeepSeek-V3.2 for sparse attention, agentic training, and reasoning-first design.
  7. DeepSeek-V4 for the current million-token context direction.

Reader TypeStart WithThen ReadWhy This Order Works
BeginnerDeepSeek LLMDeepSeek-V2, DeepSeek-V3, DeepSeek-R1Moves from general LLM scaling to efficient MoE models and reasoning.
LLM architecture researcherDeepSeekMoEDeepSeek-V2, DeepSeek-V3, DeepSeek-V3.2, DeepSeek-V4Follows the architecture path from expert specialization to MLA, DSA, and V4 hybrid attention.
Reasoning/RL researcherDeepSeekMathDeepSeek-R1, DeepSeek-R1-0528, DeepSeek-V3.2, DeepSeekMath-V2Shows the progression from GRPO to large-scale RL, distillation, and verification-based reasoning.
Coding model engineerDeepSeek-CoderDeepSeek-Coder-V2, DeepSeek-R1, DeepSeek-V3.2, DeepSeek-V4Connects code pretraining, long context, reasoning, tool use, and agentic coding.
Math/theorem proving researcherDeepSeekMathDeepSeek-Prover, DeepSeek-Prover-V1.5, DeepSeekMath-V2Moves from informal math reasoning to Lean proof generation and self-verification.
Multimodal/OCR researcherDeepSeek-VLDeepSeek-VL2, Janus, Janus-Pro, DeepSeek-OCRCovers vision-language understanding, unified generation, and visual context compression.
Infrastructure teamDeepSeek-V2DeepSeek-V3, V3.2-Exp, V3.2, V4, DeepSeek GitHub infrastructure reposFocuses on model architecture, efficient attention, context length, kernels, and deployment.

DeepSeek Research Timeline

2023: DeepSeek begins releasing open model work, including early LLM and code-model repositories. The official DeepSeek LLM repository describes the 7B/67B model family and the 2T-token bilingual pretraining dataset.

January 2024: DeepSeek LLM and DeepSeekMoE appear on arXiv. DeepSeek LLM focuses on scaling open language models, while DeepSeekMoE focuses on expert specialization in mixture-of-experts language models.

January–February 2024: DeepSeek-Coder and DeepSeekMath are published. DeepSeek-Coder focuses on code intelligence, while DeepSeekMath introduces mathematical reasoning work and GRPO.

March 2024: DeepSeek-VL is released as a vision-language model for real-world multimodal understanding, including screenshots, PDFs, OCR, charts, and scientific documents.

May–June 2024: DeepSeek-V2 and DeepSeek-Coder-V2 extend the research line into efficient MoE inference and code-focused continued training.

August 2024: DeepSeek-Prover-V1.5 appears as an improvement over DeepSeek-Prover, using proof assistant feedback, reinforcement learning, and Monte Carlo tree search. It later appears as an ICLR 2025 conference paper.

December 2024: DeepSeek-V3 is released as a major MoE technical report with 671B total parameters, 37B activated per token, MLA, DeepSeekMoE, auxiliary-loss-free load balancing, multi-token prediction, and large-scale pretraining.

January 2025: DeepSeek-R1 is released, including R1-Zero, R1, and distilled dense models. The work centers on reinforcement learning for reasoning and distillation into smaller models.

January 2025: Janus-Pro is released as an improved Janus model for unified multimodal understanding and generation.

May 2025: DeepSeek-R1-0528 is released with official notes describing improved benchmark performance, reduced hallucinations, JSON output, function calling, and open weights.

September 2025: DeepSeek-V3.2-Exp is released as an experimental model built on V3.1-Terminus and introducing DeepSeek Sparse Attention for long-context efficiency research.

October–November 2025: DeepSeek-OCR and DeepSeekMath-V2 expand the research line into optical context compression and self-verifiable mathematical reasoning.

December 2025: DeepSeek-V3.2 and V3.2-Speciale are released. Official sources describe V3.2 as a reasoning-first model built for agents, with DSA, scalable RL, and a large-scale agentic task synthesis pipeline.

April 2026: DeepSeek-V4 Preview is released with V4-Pro and V4-Flash, official 1M context support, API availability, open weights, and technical materials on Hugging Face.


Key Technical Themes in DeepSeek Research

Mixture-of-Experts and DeepSeekMoE

Mixture-of-Experts, or MoE, is an architecture where only part of a large model is activated for each token. The goal is to increase total model capacity without paying the full compute cost of activating every parameter. DeepSeekMoE introduced a specialization-focused MoE design using fine-grained experts and shared experts. Later models such as DeepSeek-V2, DeepSeek-V3, V3.2, and V4 build heavily on sparse expert design.

Multi-head Latent Attention

Multi-head Latent Attention, or MLA, is one of DeepSeek’s central efficiency ideas. In plain English, MLA compresses the attention key-value cache into a smaller latent representation, which can reduce memory pressure during inference. DeepSeek-V2 introduced MLA together with DeepSeekMoE, and DeepSeek-V3 further validated that architecture at larger scale.

Efficient Training and FP8

DeepSeek-V3 is the key paper for efficient large-scale training. The report describes a 671B-parameter MoE model with 37B activated parameters per token and reports pretraining on 14.8T tokens. It also highlights efficiency-focused choices such as MLA, DeepSeekMoE, auxiliary-loss-free balancing, multi-token prediction, and stable large-scale training.

Reinforcement Learning and GRPO

GRPO appears in the DeepSeekMath paper as a reinforcement learning method for mathematical reasoning. DeepSeek-R1 later makes RL the center of the reasoning pipeline, with R1-Zero exploring reasoning behavior through RL before the more refined R1 pipeline adds cold-start data and multi-stage training. The Nature version of the R1 paper frames this as a demonstration that reasoning can be incentivized through reinforcement learning without human-labeled reasoning trajectories.

Reasoning Models and Distillation

DeepSeek-R1 is important not only because of its own reasoning capability, but because it shows how reasoning traces from a large model can be distilled into smaller dense models. The official R1 repository lists distilled checkpoints based on Qwen and Llama model families, giving researchers a way to study smaller reasoning models without running the full 671B-parameter MoE model.

Long-Context and Sparse Attention

DeepSeek’s long-context work evolves from MLA in V2 and V3 to DeepSeek Sparse Attention in V3.2-Exp and V3.2, then to V4’s million-token context technical materials. V3.2’s arXiv abstract identifies DSA as a key mechanism for reducing computational complexity while preserving long-context performance, while V4’s official model card describes a hybrid attention design that combines Compressed Sparse Attention and Heavily Compressed Attention.

Coding and Agentic Workflows

DeepSeek-Coder and DeepSeek-Coder-V2 provide the code-model foundation. DeepSeek-V3.1, V3.2, and V4 then move toward agentic use cases such as tool use, code agents, and long-horizon task execution. Official V3.2 sources describe “thinking in tool-use,” and V4 release notes emphasize agentic coding and integration with leading AI-agent workflows.

Mathematical Reasoning and Proof Generation

DeepSeekMath introduced a math-focused model and GRPO. DeepSeek-Prover and DeepSeek-Prover-V1.5 then extended this direction into Lean 4 theorem proving, synthetic formal data, proof assistant feedback, and tree search. DeepSeekMath-V2 continues the line by focusing on self-verifiable mathematical reasoning and LLM-based verification.

Multimodal, Vision-Language, and OCR Research

DeepSeek-VL and DeepSeek-VL2 focus on vision-language understanding. Janus, JanusFlow, and Janus-Pro address unified multimodal understanding and generation. DeepSeek-OCR takes a different route by studying whether visual representations can compress long text contexts through optical 2D mapping.


Technical Themes by Paper

Technical ThemeBest DeepSeek SourceWhat It ExplainsPractical Value
Scaling open LLMsDeepSeek LLMScaling laws, 7B/67B models, SFT/DPO chat modelsFoundation for understanding the early DeepSeek stack
Expert specializationDeepSeekMoEFine-grained expert segmentation and shared expertsEfficient model capacity expansion
KV-cache efficiencyDeepSeek-V2MLA and MoE for cheaper inferenceLong-context serving and lower memory pressure
FP8 and large MoE trainingDeepSeek-V3Stable training, MoE scale, multi-token predictionFrontier-scale training design
RL for reasoningDeepSeekMath, DeepSeek-R1GRPO, RL, R1-Zero, R1, distillationMath, coding, STEM, and chain-of-reasoning research
Tool use and agentsDeepSeek-V3.2Thinking in tool use and agentic task synthesisAgent workflows, code agents, tool-using models
Sparse attentionV3.2-Exp, V3.2DeepSeek Sparse AttentionLong-context efficiency
Million-token contextDeepSeek-V4CSA/HCA hybrid attention, 1M contextLarge-document, repository, and agent memory use cases
Formal proof generationDeepSeek-Prover, Prover-V1.5Lean 4, synthetic proof data, RL from proof assistant feedbackFormal math and theorem proving
Vision-language AIDeepSeek-VL, VL2, Janus-ProOCR, document understanding, multimodal generationMultimodal products
Optical compressionDeepSeek-OCRVision-text compression for long contextsDocument AI and context compression

DeepSeek Research Papers by Use Case

If You Want To…Read FirstThen ReadWhy
Understand DeepSeek reasoning modelsDeepSeekMathDeepSeek-R1, Nature R1 paper, DeepSeekMath-V2Shows GRPO, RL-based reasoning, and self-verification.
Build coding agentsDeepSeek-CoderDeepSeek-Coder-V2, DeepSeek-R1, V3.2, V4Moves from code pretraining to agentic reasoning and tool use.
Study efficient LLM trainingDeepSeekMoEDeepSeek-V2, DeepSeek-V3, V4Explains sparse experts, MLA, FP8-oriented training, and hybrid attention.
Study long contextDeepSeek-V2V3.2-Exp, V3.2, V4, DeepSeek-OCRCovers MLA, DSA, 1M context, and visual context compression.
Study math reasoningDeepSeekMathDeepSeek-R1, DeepSeekMath-V2Follows math RL from answer-based reasoning to verification.
Study theorem provingDeepSeek-ProverProver-V1.5, DeepSeekMath-V2Focuses on Lean, proof assistant feedback, and proof verification.
Build multimodal appsDeepSeek-VLDeepSeek-VL2, Janus-Pro, DeepSeek-OCRCovers document understanding, OCR, image understanding, and generation.
Run models locallyGitHub + Hugging Face model cardsV3, R1, V3.2, V4 model cardsModel cards explain weights, chat templates, supported runtimes, and license details.

Official Sources for DeepSeek Research Papers

Use these source categories when verifying DeepSeek Research papers:

1. DeepSeek official website and API docs
DeepSeek release notes and API documentation are the best source for current model availability, API model names, deprecations, context length, and release dates. The V4 release page, for example, lists V4-Pro and V4-Flash, official 1M context support, open weights, and the V4 technical report link.

2. DeepSeek GitHub repositories
GitHub repositories often include paper links, quick-start instructions, model downloads, licensing notes, and citations. They are especially useful for DeepSeek LLM, V2, V3, R1, Coder-V2, VL, VL2, Janus, and OCR.

3. DeepSeek verified Hugging Face organization
Hugging Face is important for official model cards, collections, weights, downloads, inference examples, and technical summaries. The verified organization lists collections such as DeepSeek-V4 and DeepSeek-OCR.

4. arXiv
arXiv is the main source for most DeepSeek technical papers, including DeepSeek LLM, DeepSeekMoE, DeepSeekMath, DeepSeek-V2, DeepSeek-Coder-V2, DeepSeek-V3, DeepSeek-R1, DeepSeek-V3.2, DeepSeek-OCR, and DeepSeekMath-V2.

5. Peer-reviewed or conference sources
Some DeepSeek-related research appears in peer-reviewed or conference venues. DeepSeek-R1 has a Nature article, DeepSeek-Prover-V1.5 appears as an ICLR 2025 conference paper, and JanusFlow appears in CVPR 2025 open access proceedings.

Before citing or using a DeepSeek model, verify:

  • Is the source from deepseek-ai on GitHub or Hugging Face?
  • Is there an arXiv paper or official technical report?
  • Is the release note on official DeepSeek API docs?
  • Does the model card list the license and usage restrictions?
  • Is the benchmark self-reported, third-party, or peer-reviewed?
  • Is the model a full release, preview, experiment, distilled model, or community quantization?
  • Does the source include the current chat template, context length, and deployment instructions?

Common Misconceptions About DeepSeek Research

DeepSeek Research vs. the DeepSeek Chatbot

DeepSeek Research is broader than the chatbot. The chatbot is an application interface for certain models, while the research includes papers, model weights, repositories, model cards, API releases, infrastructure libraries, and training/inference techniques.

Papers vs. Model Cards vs. GitHub Repositories

A paper explains the research method. A model card explains the released model, license, usage, limitations, and deployment guidance. A GitHub repository may include code, model links, examples, and citations. For DeepSeek, all three can be important.

Open Weight vs. Fully Open Source

A model can have public weights without every part of the training pipeline being open. Always check the license, code license, model license, data disclosure, and intended-use notes. DeepSeek often publishes weights and code, but not every release has the same level of openness or documentation.

Benchmarks vs. Real-World Performance

DeepSeek papers and model cards include many benchmark claims, but benchmark performance is not the same as production reliability. Treat official benchmark tables as vendor-reported unless a third-party or peer-reviewed source independently verifies them. This is especially important for comparisons against closed-source models.

DeepSeek-V4 Is Not Just “A Bigger V3”

DeepSeek-V4 official materials describe new architectural and optimization ideas, including hybrid CSA/HCA attention, mHC, Muon optimizer, 1M context, and two model sizes. It should be read as a new technical direction, not merely as a larger checkpoint.


How to Cite DeepSeek Research Papers

For academic work, cite the latest official arXiv version, conference version, or journal version. Do not fabricate BibTeX. Use the citation block from the official arXiv page, GitHub repository, Hugging Face model card, Nature article, ICLR page, or CVPR page.

General guidance:

  • For arXiv papers: cite the arXiv ID and version you used.
  • For GitHub repositories: cite the repository, commit hash if relevant, and access date.
  • For Hugging Face model cards: cite the model name, organization, revision if relevant, and access date.
  • For API release notes: cite the official DeepSeek API documentation page and access date.
  • For peer-reviewed work: cite the published Nature, ICLR, or CVPR version when available.

Example citation formats:

DeepSeek-AI. “DeepSeek-V3 Technical Report.” arXiv:2412.19437, 2024/2025.
DeepSeek-AI. “DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning.” arXiv:2501.12948, 2025.
DeepSeek-AI. “DeepSeek-V4-Pro.” Hugging Face model card, accessed May 10, 2026.
DeepSeek-AI. “DeepSeek V4 Preview Release.” DeepSeek API Docs, April 24, 2026.

FAQs About DeepSeek Research Papers

What is DeepSeek Research?

DeepSeek Research is the collection of AI papers, technical reports, model cards, repositories, and release notes published by DeepSeek-AI. It covers LLMs, MoE models, reasoning models, code models, math models, theorem proving, multimodal models, OCR, long-context systems, and AI infrastructure.

Where can I find DeepSeek Research papers?

The best sources are official DeepSeek GitHub repositories, the verified DeepSeek Hugging Face organization, arXiv, DeepSeek API release notes, and peer-reviewed venues such as Nature, ICLR, and CVPR when applicable.

What is the most important DeepSeek paper?

For architecture, start with DeepSeek-V2 and DeepSeek-V3. For reasoning, start with DeepSeekMath and DeepSeek-R1. For the latest long-context direction, read DeepSeek-V3.2 and the official DeepSeek-V4 technical materials.

Is DeepSeek-R1 a research paper?

Yes. DeepSeek-R1 has an arXiv paper, official GitHub repository, Hugging Face model cards, and a Nature publication discussing reinforcement learning for reasoning.

Is DeepSeek-V3 open source?

DeepSeek-V3 has public checkpoints and an official GitHub repository, and the arXiv report states that checkpoints are available. For exact usage permissions, check the current repository and model-card license before deployment.

What is GRPO in DeepSeek research?

GRPO stands for Group Relative Policy Optimization. It appears in the DeepSeekMath paper as a reinforcement learning method for improving mathematical reasoning and is part of the broader DeepSeek reasoning research path.

What is the difference between DeepSeek-V3 and DeepSeek-R1?

DeepSeek-V3 is primarily a large MoE base/chat model technical report focused on efficient architecture and training. DeepSeek-R1 is a reasoning-focused model family trained with reinforcement learning on top of DeepSeek-V3-Base, with R1-Zero, R1, and distilled models.

What is DeepSeek-V4?

DeepSeek-V4 is DeepSeek’s 2026 model series. Official sources list V4-Pro and V4-Flash, both supporting 1M context. The Hugging Face model card describes V4-Pro as a 1.6T-parameter MoE with 49B activated parameters and V4-Flash as a 284B-parameter MoE with 13B activated parameters.

Are DeepSeek papers peer-reviewed?

Many DeepSeek papers are on arXiv, which is not peer review by itself. Some associated work has appeared in peer-reviewed or conference venues, including the Nature article for DeepSeek-R1, ICLR 2025 for DeepSeek-Prover-V1.5, and CVPR 2025 for JanusFlow.

What should I read first?

Start with DeepSeek LLM, DeepSeekMoE, DeepSeek-V2, DeepSeek-V3, and DeepSeek-R1. Then choose a specialized path: Coder for code, Math/Prover for math, VL/Janus for multimodal, V3.2/V4 for long context and agents.

Can I run DeepSeek models locally?

Some DeepSeek models can be run locally, but the largest MoE models require substantial hardware or specialized inference stacks. Official Hugging Face model cards and GitHub repositories provide deployment guidance, including examples for Transformers, vLLM, SGLang, Docker, and quantized variants where available.

How often are DeepSeek papers updated?

DeepSeek releases move quickly. The official change log shows major updates across 2024, 2025, and 2026, including V3, R1, R1-0528, V3.1, V3.2-Exp, V3.2, and V4.


Conclusion

DeepSeek Research is best understood as a connected research program rather than a single paper or model. The core path starts with DeepSeek LLM, moves through DeepSeekMoE, DeepSeek-V2, and DeepSeek-V3, then branches into DeepSeek-R1 for reasoning, DeepSeek-Coder for programming, DeepSeekMath and DeepSeek-Prover for mathematics, DeepSeek-VL/Janus for multimodal systems, DeepSeek-OCR for visual context compression, and DeepSeek-V3.2/V4 for long-context and agentic AI.

For most readers, the best reading order is: DeepSeek LLM → DeepSeekMoE → DeepSeek-V2 → DeepSeek-V3 → DeepSeek-R1 → DeepSeek-V3.2 → DeepSeek-V4. After that, follow the specialized track that matches your work: coding, math, theorem proving, multimodal AI, OCR, or infrastructure.

Bookmark this guide and revisit it whenever DeepSeek publishes a new model, paper, model card, or technical report.