updated 1 public sources
Multimodal Large Language ModelsVision-Language ModelsLong Video UnderstandingVisual Reasoning

Current frame

Ph.D. student in Artificial Intelligence at Renmin University of China working on multimodal large language models.

Extended note

According to his public homepage, Yifan Du is a Ph.D. student in Artificial Intelligence at Renmin University of China (2022-2027 expected) after completing a B.Sc. in Statistics at Shandong University (2018-2022). His homepage describes research on multimodal large language models, especially visual instruction tuning, long video understanding, and complex visual reasoning, and lists internships at ByteDance Seed, Baichuan, and Meituan.