DeepSeek
startupDeepSeek is a dominant force in the open-source AI community, founded in 2023 by Liang Wenfeng, who also founded and leads the quantitative hedge fund High-Flyer (幻方量化). High-Flyer, one of China's largest quant funds managing over $8B in assets, provides DeepSeek's funding and compute infrastructure. The company reportedly operates a cluster of ~10,000 NVIDIA A100 GPUs acquired before US export restrictions took full effect.
DeepSeek triggered a global sensation in January 2025 — the "DeepSeek moment" — when DeepSeek-R1 demonstrated reasoning performance rivaling OpenAI's o1 at a fraction of the training cost. The release briefly wiped nearly $1 trillion from US tech stocks and forced a reassessment of assumptions about AI scaling costs worldwide.
The lab has pioneered several key innovations: Multi-head Latent Attention (MLA) for memory-efficient inference, the DeepSeekMoE architecture with fine-grained expert segmentation, FP8 mixed-precision training at scale, Multi-Token Prediction, and GRPO (Group Relative Policy Optimization) for reinforcement learning on reasoning tasks. DeepSeek is notable for its "open-science" approach, publishing detailed technical reports and open-sourcing low-level training infrastructure including DeepGEMM, FlashMLA, DeepEP, and 3FS (Fire-Flyer File System).
DeepSeek's research was published in Nature in 2025 and the company has become a central case study in debates over AI efficiency, US-China technology competition, and the viability of open-weight frontier models.
People
- Liang Wenfeng — Founder (formerly High-Flyer (Founder & CEO))
- Chong Ruan OpenReview — Co-founder (departed early 2025; joined DeepRoute.ai Jan 2026)
- Daya Guo OpenReview — Former Core Researcher (R1 core author; departed early 2026)
- Zhenda Xie OpenReview — Core Researcher (architecture)
- Wangding Zeng OpenReview — Core Researcher (MLA architecture)
- Qihao Zhu OpenReview — Core Researcher
News
- 2026-03-28 DeepSeek Before V4: Culture, Organization, and Liang Wenfeng's Unique Goals (English summary) — LatePost (晚点)
- 2026-03-24 DeepSeek's Latest Job Postings Highlight Pivot to Agentic AI — Bloomberg