Atlas / Reports / Detail
DeepSeek-VL: Towards Real-World Vision-Language Understanding
Vision-Language Models
Connected researchers
Runxin Xu
DeepSeek
Researcher at DeepSeek whose public homepage describes work on DeepSeek R1, V1, V2, V3, Math, Coder, and mixture-of-experts systems.
Yuxuan Ren
DeepSeek
Researcher focused on multimodal generative models and reinforcement learning; currently at ByteDance Seed and previously at DeepSeek.
Yuqing Wang
DeepSeek
Research intern at DeepSeek and PhD student at Princeton University whose research interests include large language models and multimodal foundation models.
Dejian Yang
DeepSeek
DeepSeek team member and co-author of the DeepSeek-V3, DeepSeek-V2, and DeepSeek LLM technical reports.
Jiaming Guo
DeepSeek
PhD student at The Chinese University of Hong Kong focused on multimodal reasoning, optical character recognition, and document parsing; coauthor of DeepSeek-VL.