Atlas / Reports / Detail
Qwen2-VL: Enhancing Vision-Language Model's Perception of the World at Any Resolution
Vision-Language Models
Connected researchers
Yiheng Xu
Qwen
Yiheng Xu is a research scientist focused on multimodal AI, coding agents, and reasoning systems. His public profiles link him to Qwen research and later work at OpenAI, with publications spanning vision-language models and code generation.
Shuai Bai
Qwen
Senior algorithm expert at Alibaba Group working on large language models, multimodal large language models, and diffusion models.
Jiabo Ye
Qwen
Research scientist in Tongyi Lab whose public homepage and OpenReview profile describe work on large language models, multimodal learning, and visual grounding. His public profiles also list affiliations with Alibaba Group and East China Normal University.
Wei Ding
Qwen
Research scientist at Alibaba working on multimodal learning and generation; previously a postdoctoral researcher at Carnegie Mellon University.
Jun Tang
Qwen
Jun Tang works on multimodal foundation models, open-source language models, and agent systems. His personal site highlights work on Qwen and Qwen3-VL alongside related multimodal research.
Keqin Chen
Qwen
Researcher focused on large language models and multimodal learning, with public profiles linking Keqin Chen to Beihang University and to Qwen vision-language model work.
Zesen Cheng
Qwen
Qwen researcher and author on the Qwen2-VL and Qwen2.5-VL technical reports, with public profiles linking his work to multimodal and vision-language systems.
Xi Zhang
Qwen
Xi Zhang works on multimodal and vision-language model research. Public profiles connect him to Qwen2-VL and related open research projects.
Mingkun Yang
Qwen
Mingkun Yang works on multimodal large language models, embodied AI, and robotics. His public profile says he is a postdoc at Zhejiang University and a research scientist at Qwen.
Jianqiang Wan
Qwen
Research scientist in Alibaba DAMO Academy's Tongyi Lab working on multimodal learning, vision-language models, and embodied AI; author on the Qwen2-VL and Qwen2.5-VL technical reports.
Zhibo Yang
Qwen
Zhibo Yang works on multimodal and vision-language systems. Public profiles connect him to the Qwen2.5-VL technical report and to an individual GitHub account that links back to his personal site.
Zheren Fu
Qwen
Tongyi Lab researcher working on large language models, vision-language models, and reinforcement learning; public profiles connect Zheren Fu to the Qwen2-VL technical report.
Tianbao Xie
Qwen
Research scientist on the Qwen team at Alibaba Group, focusing on foundation models and language agents. He received a PhD in computer science from the University of Illinois Urbana-Champaign.
Kai Dang
Qwen
Researcher on Alibaba's Qwen team focused on large language models and NLP, with public research profiles listing a Nankai University background.
Yuanzhi Zhu
Qwen
Yuanzhi Zhu is a Qwen researcher whose public work includes multimodal and audio-language models.
Peng Wang
Qwen
Researcher affiliated with the Qwen team at Alibaba Group on Google Scholar and coauthor of the Qwen and Qwen3 technical reports.
Wenbin Ge
Qwen
Research scientist in Tongyi Lab whose official profile highlights work on efficient reinforcement learning, generalization, inference-time scaling, and reasoning for large language models.
Shijie Wang
Qwen
Senior research scientist in Tongyi Lab whose official profile highlights post-training, AI for science, evaluation and alignment, multimodal reasoning, and large language model reasoning.
Haiyang Xu
Qwen
Independent researcher focused on multimodal learning, document intelligence, and efficient training; coauthor of Qwen2.5-VL and mPLUG-related vision-language systems.
Hang Zhang
Qwen
Researcher at Alibaba Group working on multimodal large language models; public profile and publication context connect Hang Zhang to the Qwen2-VL technical report.
Jialin Wang
Qwen
Research scientist in Tongyi Lab and contributor to Qwen2-VL, with public work on multimodal large language models.
Pengfei Wang
Qwen
Research scientist in Alibaba DAMO Academy's Tongyi Lab working on machine learning, computer vision, and multimodal large language models; author on the Qwen2-VL and Qwen2.5-VL technical reports.
Sibo Song
Qwen
Research scientist in Tongyi Lab and maintainer of Qwen-VL, with public work on vision-language models.
Xuejing Liu
Qwen
Xuejing Liu is a researcher whose public OpenReview profile includes the Qwen2-VL and Qwen2.5-VL technical report papers.