Atlas / Fields / Detail
Audio Language Models
Researchers connected to this field in the public atlas.
Nan Duan
Alibaba Qwen
Nan Duan is head of foundation model post-training at Qwen and a vice president at Alibaba Group. His public profile highlights work on foundation models and natural language processing, after earlier research leadership at Microsoft Research.
Zhen Ye
Alibaba Qwen
Zhen Ye is a researcher in the Qwen team at Alibaba Cloud. His public profile notes a PhD in computer science from the University of Massachusetts Amherst and research interests in natural language understanding, generation, and reasoning.
Chao Zhang
Alibaba Qwen
Chao Zhang is an applied scientist in the Alibaba Foundation Model team. His public profile notes a PhD in computer science from the University of Illinois Urbana-Champaign and research interests in NLP, large language models, reasoning, and multimodal generation.
Zhengyuan Liu
Alibaba Qwen
Zhengyuan Liu is a research scientist at Alibaba Group and a PhD student at the National University of Singapore. His public profile highlights work in natural language processing, vision-language models, and grounding.
Jiaqi Wang
Alibaba Qwen
Jiaqi Wang works on machine learning, multimodal large language models, and AI for healthcare. Public profiles connect him to the Qwen2-Audio technical report.
Shen Gao
Alibaba Qwen
Shen Gao is a PhD student at Zhejiang University working on multimedia and large language models. His public profiles connect him to Qwen2-Audio and related multimodal systems including OmniParser.
Weiqiang Wang
Alibaba Qwen
Weiqiang Wang is a PhD student working on multimedia and multimodal AI. Public profiles connect him to the Qwen2-Audio technical report and related research.
Tianyu Liu
Moonshot AI / Alibaba Qwen
Principal scientist at Moonshot AI working on multimodal large models.
Yeyun Gong
Alibaba Qwen
Yeyun Gong is a researcher and engineering leader focused on multimodal large language models, grounding, and large-scale knowledge systems. His homepage lists selected work including Qwen2-Audio.
Jie Tang
OpenAI / Alibaba Qwen
OpenAI contributor credited on the GPT-4 Technical Report; previously a Dropbox engineer and a Ph.D. student at UC Berkeley focused on machine learning and robotics.
Hongning Wang
Alibaba Qwen
Associate professor at the University of Virginia and Qwen contributor whose research focuses on personalization and recommender systems, online advertising, and AI systems.
Yongqiang Wang
Alibaba Qwen
Research scientist at Alibaba working on speech processing, multimodal learning, natural language processing, and efficient human-computer interaction.
Xian-Sheng Hua
Alibaba Qwen
Xian-Sheng Hua is a computer vision and multimodal AI researcher known for work in visual recognition, multimedia understanding, and large AI systems. Public profiles tie him to Alibaba DAMO Academy and related academic service roles.
Jingren Zhou
Moonshot AI / Alibaba Qwen
Alibaba senior technology leader and researcher associated with Qwen. Public profiles list him with Alibaba Group, and official Alibaba Cloud coverage identifies him as a chief technology officer leading large-model work.
Xiaoyong Du
Alibaba Qwen
Xiaoyong Du works on multimodal large language models and language agents, with public profile text highlighting omni models, visual agents, and GUI agents. His homepage explicitly identifies him with Qwen.
Zhifeng Chen
Google Gemini / Z.ai
Distinguished software engineer at Google Brain focused on large-scale computer systems and machine learning applications.
An Yang
Alibaba Qwen
Alibaba researcher working on large language models and multimodal pretraining; public research profiles connect An Yang to Qwen-related work and earlier study at Peking University.
Yuanzhi Zhu
Alibaba Qwen
Yuanzhi Zhu is a Qwen researcher whose public work includes multimodal and audio-language models.
Shijie Wang
Alibaba Qwen
Senior research scientist in Tongyi Lab whose official profile highlights post-training, AI for science, evaluation and alignment, multimodal reasoning, and large language model reasoning.
Mingyang Shang
Alibaba Qwen
Research intern at Alibaba Group focused on multimodal understanding and generation, large multimodal models, and reinforcement learning; coauthor of Qwen2-Audio.
Qingyang Zhang
Alibaba Qwen
Second-year PhD student at Peking University focused on audio-language foundation models, trustworthy AI, and embodied AI; coauthor of Qwen2-Audio.
Yaqi Wang
Alibaba Qwen
Research scientist in Tongyi Lab and technical lead of Qwen2-Audio, with public work on audio-language models.
Yinghao Li
Alibaba Qwen
Machine learning engineer and researcher interested in large language models and multimodal audio-language systems; coauthor of Qwen2-Audio.
Yongqi Wang
Alibaba Qwen
Research scientist in Tongyi Lab whose public profile highlights work on speech processing, machine learning, and multimodal large language models.
Yushi Hu
Alibaba Qwen
Yushi Hu is a senior research engineer at Shanghai AI Laboratory and a founding member of OpenMMLab. Public arXiv records also list him as a coauthor of Qwen2-Audio.
Hongyin Luo
Alibaba Qwen
Researcher whose arXiv author results include Qwen-Audio and related audio-language modeling work.
Mengzhe Chen
Alibaba Qwen
Research assistant at CUHK-Shenzhen focused on multimodal learning, efficient adaptation, alignment, and reinforcement learning; coauthor of Qwen2-Audio.
Mingjie Li
Z.ai
Research scientist at Z.ai focused on multimodal large language models, speech interaction, and large language models. He received a bachelor's degree from Tsinghua University and a master's degree from Columbia University.
Na Cao
Z.ai
Research scientist at Z.ai focused on multimodal large language models, speech interaction, and large language models. Her work includes pre-training, post-training, and evaluation of multimodal and speech models.
Shuang Ma
Z.ai
Research scientist at Z.ai focused on multimodal large language models, speech interaction, and large language models. She works on pre-training, post-training, and evaluation of multimodal and speech models.
Yi Ma
Z.ai
Research scientist at Z.ai focused on multimodal large language models, speech interaction, and large language models. He received a bachelor's degree from Shanghai Jiao Tong University and a master's degree from Columbia University.
Yimin Wang
Z.ai
Research scientist at Z.ai focused on multimodal understanding and generation, large language models, and speech interaction. He received a bachelor's degree from Tsinghua University and a master's degree from Columbia University.
Yujie He
Z.ai
Research scientist at Z.ai focused on multimodal large language models, speech interaction, and large language models. His work includes pre-training, post-training, and evaluation of multimodal and speech models.
Zehan Wang
Z.ai
Research scientist at Z.ai focused on multimodal understanding and generation, large language models, and speech interaction. He received a bachelor's degree from Tsinghua University and a master's degree from the University of California, San Diego.
Zejun Ma
Alibaba Qwen
PhD student at The Chinese University of Hong Kong focused on speech language understanding, audio-language multimodal learning, and efficient model adaptation; coauthor of Qwen2-Audio.