Atlas / Fields / Detail
Speech Language Models
Researchers connected to this field in the public atlas.
Clémentine Fourrier
Mistral AI
AI researcher focused on evaluating language models and agents, open NLP research, and historical linguistics. She led evaluation efforts at Hugging Face between 2023 and 2025 and helped build LightEval and the Open LLM Leaderboard.
Jean-Baptiste Alayrac
Mistral AI
Jean-Baptiste Alayrac is a researcher focused on multimodal learning, vision-language modeling, and video understanding.
Wei-Ning Hsu
Meta AI
Research scientist at Meta FAIR working on speech and audio foundation models. His research covers self-supervised learning, spoken language modeling, and multimodal audio-language systems.
Yossi Adi
Meta AI
Yossi Adi is a computer scientist at the Hebrew University of Jerusalem and a research scientist at Meta FAIR. His research focuses on speech, audio, and language modeling, including spoken language models and machine learning methods for speech applications.
Arthur Mensch
Mistral AI
Co-founder and CEO of Mistral AI and a researcher on efficient large language models and mixture-of-experts systems.
Felix Kreuk
Meta AI
Research scientist at Meta AI working on generative AI, multimodal learning, and speech and audio generation. His public homepage notes earlier research at Bar-Ilan University before joining Meta in Menlo Park.
Pierre Sennrich
Mistral AI
Pierre Sennrich is Chief Scientist at Mistral AI and a professor at the University of Zurich. His research centers on natural language processing and machine translation, and he has led widely cited work on subword methods and multilingual language technology.
Tu Anh Nguyen
Meta AI
Tu Anh Nguyen is a research scientist at Meta working on speech and audio generation. He is also a PhD candidate at Mila and the Universite de Montreal, advised by Yoshua Bengio and Abdelrahman Mohamed, with interests in audio language models, speech generation, and efficient inference.
Timothée Lacroix
Meta AI / Mistral AI
Timothee Lacroix is a machine learning researcher whose public work includes multilingual representation learning and open language models.
Mingxuan Wang
ByteDance Seed / Mistral AI
Mingxuan Wang is a researcher at ByteDance Seed. Public ByteDance Seed sources identify Wang Mingxuan as a Senior Researcher on the Doubao Seed Team, and official Seed publications list Mingxuan Wang as an author on reasoning and multimodal technical reports.
Emmanuel Dupoux
Meta AI
Research scientist and professor working across Meta, NYU, and EHESS on speech, language, and cognitive science. His work studies how humans and machines acquire language and how spoken and written models can be aligned.
John Canny
Mistral AI
Professor of Electrical Engineering and Computer Sciences at the University of California, Berkeley, known for work spanning artificial intelligence, machine learning, and related computing systems research.
Karan Sikka
Amazon
Senior applied scientist on the Amazon AGI team working on multimodal generative AI, speech recognition, and spoken language understanding, and a co-author of the Amazon Nova Sonic technical report.
Morgane Riviere
Meta AI
Research scientist working on natural language processing, with public work spanning speech and language modeling such as VoxPopuli, pGSLM, and SPIrit-LM.
Raghuraman Krishnamoorthi
Amazon
Applied scientist at Amazon AGI working on speech, spoken language translation, and multimodal generative AI, and a co-author of the Amazon Nova Sonic technical report.
Guillaume Lample
Meta AI / Mistral AI
Chief AI scientist at Mistral AI and co-founder of Kyutai. Previously worked on large language models and machine translation at Meta and earned a PhD in computer science at Sorbonne University and Inria Paris.
Jinyuan Jia
MiniMax
Researcher working on speech and multimodal language models, including MiniMax-Speech and related speech understanding work.
Louis Martin
Meta AI / Mistral AI
Louis Martin is a scientist at Meta AI and a PhD student at McGill University and Mila. His research spans natural language processing and machine learning.
Ming Ding
MiniMax
Lead of foundation models at MiniMax working on large language models, multimodal pretraining, and efficient training systems. He completed a PhD in computer science at Tsinghua University.
Qingyang Ge
MiniMax
Research scientist at MiniMax AI Research interested in large language models, machine learning, and computer vision. He earned a PhD from Shanghai Jiao Tong University and previously worked at Alibaba DAMO Academy on vision-language reasoning and video understanding.
Teven Le Scao
Mistral AI
Research scientist at Mistral AI and co-author of the Mistral 7B report.
Wenchao Zhou
MiniMax
Research scientist at MiniMax AI Research focused on large language models, machine learning, and recommendation systems. He received a PhD from Shanghai Jiao Tong University and previously worked at Alibaba DAMO Academy on machine learning and distributed databases.
Yusheng Zhao
MiniMax
Research scientist at MiniMax AI Research focused on reinforcement learning, reasoning, multimodal learning, large language models, and large-scale distributed systems. He received a PhD in machine learning from Carnegie Mellon University.
Dawei Feng
MiniMax
Co-founder and research scientist at MiniMax AI Research. He received a PhD from Tsinghua University and works on foundation models, reinforcement learning, and data systems, with publications at major machine learning and NLP venues.
Kushal Lakhotia
Meta AI
Research scientist at Meta whose OpenReview profile describes work on multilingual language and speech models, along with data and inference optimization.
Xiang Li
MiniMax
Research scientist at MiniMax AI Research with interests in large language models, machine learning, and data intelligence. He earned a PhD from Nanyang Technological University and previously worked at Alibaba DAMO Academy on data intelligence and large language model applications.
Abdelrahman Mohamed
Meta AI
Abdelrahman Mohamed is a professor at the University of Toronto and a Canada CIFAR AI Chair whose work spans speech, audio, and language modeling. His public profile highlights speech recognition, representation learning, and multimodal foundation models.
Alexei Baevski
Meta AI
Research scientist at Meta whose public work spans speech and audio-language modeling; arXiv author results include SPIrit-LM.
Ariel Noy
Meta AI
Research scientist at Meta working on spoken language technology and multimodal language models.
Pallavi Baljekar
Amazon
Pallavi Baljekar is an Applied Scientist at Amazon whose public profile focuses on machine learning and language technologies.
Pierre Colombo
Mistral AI
Research scientist at Mistral AI and associate researcher at Harvard working on natural language processing and multimodal machine learning.
Sean McLeish
Amazon
PhD student at University College London and the Alan Turing Institute whose research spans language and vision models, genomics, and reinforcement learning.
Yao Qian
Amazon
Senior principal scientist at Amazon whose public work focuses on speech, audio, and generative AI systems.