Atlas / Fields / Detail
Speech and Audio Models
Researchers connected to this field in the public atlas.
Baosong Yang
Alibaba Qwen
Senior Algorithm Expert in Alibaba Tongyi Lab's Language Technology Lab and Qwen Team member whose work focuses on multilingual large language models and machine translation.
Xinyu Zhang
Alibaba Qwen
Research scientist at Tongyi Lab, Alibaba Group, working on multimodal large language models, machine reasoning, and efficient learning.
Junyang Lin
Alibaba Qwen
Junyang Lin (Justin Lin) is a researcher and open-source maintainer known for the Qwen family of models. His public profiles list interests in LLMs, AI agents, multimodal learning, long-horizon reasoning, world models, and reinforcement learning; multiple March 2026 news reports said he stepped down from the Qwen tech lead role.
Jin Xu
Alibaba Qwen
Jin Xu's homepage says he leads the audio group at Qwen Team, Alibaba, working on audio understanding, real-time multimodal interaction, speech synthesis, general audio synthesis, and audio-centered chat models. He previously completed a Ph.D. at IIIS, Tsinghua University and received a BSc in 2018 from Beijing University of Posts and Telecommunications.
Jingren Zhou
MiniMax / Moonshot AI
Jingren Zhou is Chief Technology Officer of Alibaba Cloud. Public speaker biographies describe him as a computer scientist and entrepreneur whose work includes large-scale AI and cloud systems.
Furu Wei
Microsoft
Furu Wei is a Distinguished Scientist and Chief Scientist of Microsoft Research Asia, listed on Microsoft Research and connected in LLMpeople to Microsoft technical reports including Kosmos, VALL-E, BitNet, and Multilingual E5.
Pei Zhang
Alibaba Qwen
Alibaba Qwen report author whose DBLP profile identifies an Alibaba Group affiliation and Qwen technical report authorship.
Jinyu Li
Microsoft
Jinyu Li is a report-backed author in the LLMpeople atlas, connected through 3 technical reports.
Long Zhou
Microsoft
Long Zhou is a report-backed author in the LLMpeople atlas, connected through 3 technical reports.
Sanyuan Chen
Microsoft
Sanyuan Chen is a report-backed author in the LLMpeople atlas, connected through 3 technical reports.
Sheng Zhao
Microsoft
Sheng Zhao is a report-backed author in the LLMpeople atlas, connected through 3 technical reports.
Shujie Liu
Microsoft
Shujie Liu is a report-backed author in the LLMpeople atlas, connected through 3 technical reports.
Yanqing Liu
Microsoft
Yanqing Liu is a report-backed author in the LLMpeople atlas, connected through 3 technical reports.
Chengyi Wang
Microsoft
Chengyi Wang is a report-backed author in the LLMpeople atlas, connected through 2 technical reports.
Hongkun Hao
Alibaba Qwen
Hongkun Hao is a report-backed author in the LLMpeople atlas, connected through 2 technical reports.
Huaming Wang
Microsoft
Huaming Wang is a report-backed author in the LLMpeople atlas, connected through 2 technical reports.
Lei He
Microsoft
Lei He is a report-backed author in the LLMpeople atlas, connected through 2 technical reports.
Manu Orsini
Kyutai
Researcher at Kyutai and coauthor of the Moshi: a speech-text foundation model for real-time dialogue.
Xiong Wang
Alibaba Qwen
Xiong Wang is a report-backed author in the LLMpeople atlas, connected through 2 technical reports.
Yu Wu
Microsoft
Yu Wu is a report-backed author in the LLMpeople atlas, connected through 2 technical reports.
Zhifang Guo
Alibaba Qwen
Zhifang Guo is a report-backed author in the LLMpeople atlas, connected through 2 technical reports.
Zhuo Chen
Microsoft
Zhuo Chen is a report-backed author in the LLMpeople atlas, connected through 2 technical reports.
Ziqiang Zhang
Microsoft
Ziqiang Zhang is a report-backed author in the LLMpeople atlas, connected through 2 technical reports.
Zishan Guo
Alibaba Qwen
Zishan Guo is a report-backed author in the LLMpeople atlas, connected through 2 technical reports.
Yongqi Wang
Alibaba Qwen
Research scientist in Tongyi Lab whose public profile highlights work on speech processing, machine learning, and multimodal large language models.
Alexandre Défossez
Kyutai
Alexandre Défossez is a report-backed author in the LLMpeople atlas, connected through Continuous Audio Language Models.
Axel Roebel
Kyutai
Axel Roebel is a report-backed author in the LLMpeople atlas, connected through Continuous Audio Language Models.
Bin Zhang
Alibaba Qwen
Bin Zhang is a report-backed author in the LLMpeople atlas, connected through Qwen3-TTS Technical Report.
Boyong Wu
Stepfun
Boyong Wu is a report-backed author in the LLMpeople atlas, connected through Step-Audio-EditX Technical Report.
Chao Yan
Stepfun
Chao Yan is a report-backed author in the LLMpeople atlas, connected through Step-Audio-EditX Technical Report.
Dake Guo
Alibaba Qwen
Dake Guo is a report-backed author in the LLMpeople atlas, connected through Qwen3-TTS Technical Report.
Daxin Jiang
Stepfun
Daxin Jiang is a report-backed author in the LLMpeople atlas, connected through Step-Audio-EditX Technical Report.
Fei Tian
Stepfun
Fei Tian is a report-backed author in the LLMpeople atlas, connected through Step-Audio-EditX Technical Report.
Gang Yu
Stepfun
Gang Yu is a report-backed author in the LLMpeople atlas, connected through Step-Audio-EditX Technical Report.
Guoqiang Hu
Stepfun
Guoqiang Hu is a report-backed author in the LLMpeople atlas, connected through Step-Audio-EditX Technical Report.
Hangrui Hu
Alibaba Qwen
Hangrui Hu is a report-backed author in the LLMpeople atlas, connected through Qwen3-TTS Technical Report.
Li Xie
Stepfun
Li Xie is a report-backed author in the LLMpeople atlas, connected through Step-Audio-EditX Technical Report.
Neil Zeghidour
Kyutai
Neil Zeghidour is a report-backed author in the LLMpeople atlas, connected through Continuous Audio Language Models.
Pengfei Tan
Stepfun
Pengfei Tan is a report-backed author in the LLMpeople atlas, connected through Step-Audio-EditX Technical Report.
Peng Yang
Stepfun
Peng Yang is a report-backed author in the LLMpeople atlas, connected through Step-Audio-EditX Technical Report.
Shuchang Zhou
Stepfun
Shuchang Zhou is a report-backed author in the LLMpeople atlas, connected through Step-Audio-EditX Technical Report.
Simon Rouard
Kyutai
Simon Rouard is a report-backed author in the LLMpeople atlas, connected through Continuous Audio Language Models.
Ting He
Alibaba Qwen
Ting He is a report-backed author in the LLMpeople atlas, connected through Qwen3-TTS Technical Report.
Xiangyu (Tony) Zhang
Stepfun
Xiangyu (Tony) Zhang is a report-backed author in the LLMpeople atlas, connected through Step-Audio-EditX Technical Report.
Xiangyu Zhang
Stepfun
Xiangyu Zhang is a report-backed author in the LLMpeople atlas, connected through Step-Audio-EditX Technical Report.
Xian Shi
Alibaba Qwen
Xian Shi is a report-backed author in the LLMpeople atlas, connected through Qwen3-ASR Technical Report.
Xinfa Zhu
Alibaba Qwen
Xinfa Zhu is a report-backed author in the LLMpeople atlas, connected through Qwen3-TTS Technical Report.
Xuerui Yang
Stepfun
Xuerui Yang is a report-backed author in the LLMpeople atlas, connected through Step-Audio-EditX Technical Report.
Xu Tan
Microsoft
Xu Tan is a report-backed author in the LLMpeople atlas, connected through VALL-E 2: Neural Codec Language Models are Human Parity Zero-Shot Text to Speech Synthesizers.
Yao Qian
Microsoft
Yao Qian is a report-backed author in the LLMpeople atlas, connected through VALL-E 2: Neural Codec Language Models are Human Parity Zero-Shot Text to Speech Synthesizers.
Yu Xi
Alibaba Qwen
Yu Xi is a report-backed author in the LLMpeople atlas, connected through Qwen3-ASR Technical Report.
Yuxin Zhang
Stepfun
Yuxin Zhang is a report-backed author in the LLMpeople atlas, connected through Step-Audio-EditX Technical Report.
Ziyue Jiang
Alibaba Qwen
Ziyue Jiang is a report-backed author in the LLMpeople atlas, connected through Qwen3-TTS Technical Report.