Chunyuan Li | Researcher in multimodal intelligence and large-scale vision-language training

updated 2 public sources

multimodal intelligencevision-language modelscomputer visiondeep generative models

Current frame

Researcher in multimodal intelligence and large-scale vision-language training

Extended note

According to his public homepage, Chunyuan Li focuses on multimodal intelligence with an emphasis on large-scale language and vision training. His homepage highlights contributions including LLaVA and earlier work such as GroundingDINO, GLIP, GLIGEN, Florence, and Oscar. The same page says he has held research roles at xAI, ByteDance, and Microsoft Research, Redmond, and that he earned a PhD in machine learning from Duke University under Lawrence Carin, where his doctoral work explored deep generative models. OpenReview also links his homepage and Google Scholar profile and lists Duke University PhD study from 2014 to 2018.