Atlas / Fields / Detail
Multimodal Language Models
Researchers connected to this field in the public atlas.
Anand Avati
Apple
Anand Avati is a PhD candidate at the Stanford AI Lab. His homepage says he is the technical lead of the AI-Enabled ACP project deployed at Stanford Hospital, has served as principal instructor of Stanford's CS229 course, and previously was a distinguished engineer in Red Hat's CTO office after joining through the acquisition of Gluster, where he was a founding engineer and lead architect of GlusterFS.
Andrew Tulloch
Apple
Andrew Tulloch is a researcher at Meta working on superintelligence and a co-founder of Thinking Machines Lab. His public homepage says he previously worked on machine learning systems at Meta, helped train GPT-4o, GPT-4.5, and o3 at OpenAI, and studied mathematics at the University of Sydney and the University of Cambridge.
Ji Lin
Apple
Ji Lin is a research scientist at OpenAI working on multimodal models, reasoning, and synthetic data. He previously completed his Ph.D. and M.Sc. in EECS at MIT under Song Han, earned a B.Eng. in Electronic Engineering from Tsinghua University, and has interned or worked at Adobe Research, OmniML, and NVIDIA Research.
Sanchit Gandhi
Apple
Sanchit Gandhi is a research scientist at Mistral AI. His public Hugging Face and conference speaker profiles say he previously worked on open-source speech technology at Hugging Face and at Apple, helped popularize OpenAI's Whisper ecosystem, and earned a master's degree at the University of Cambridge.
Sebastian Gehrmann
Apple
Sebastian Gehrmann is the head of Responsible AI in the office of the CTO at Bloomberg. His homepage says he previously led Bloomberg's NLP work, earlier researched evaluation of large language models at Google, and earned a PhD from Harvard University. It also highlights research interests in natural language generation, model evaluation, and interpretability.
Ariana Mirian
Apple
Ariana Mirian is a senior security researcher at Censys and a PhD student in computer science at UC Berkeley. Her public resume shows earlier AI/ML researcher and data scientist roles at Apple, as well as Stanford degrees in business administration and electrical engineering.
Haotian Zhang
Apple
Haotian Zhang is a research scientist on Apple AI/ML's Visual Intelligence team. His homepage says he works on embodied agents that understand the world from 2D and 3D image data as well as natural language, previously interned at Microsoft Research and Azure AI, completed a PhD in electrical and computer engineering at the University of Washington in 2022, and earlier earned master's degrees at Washington and a bachelor's degree at Shanghai Jiao Tong University.
Kartik Sreenivasan
Apple
Kartik Sreenivasan is a machine learning researcher whose homepage says he was a final-year PhD student in computer science at the University of Wisconsin-Madison advised by Dimitris Papailiopoulos. The same page says he studies optimization, machine learning, and large-scale distributed settings, earned a B.Tech from the National Institute of Technology Karnataka, worked at Adobe Systems for three years in Bengaluru, and was preparing to join Databricks as a research scientist on the after-training team.
Zhengfeng Lai
Apple
Zhengfeng Lai is an ML Research Scientist at Apple AI/ML. His self-authored CV lists a PhD in Electrical and Computer Engineering from the University of California, Davis, prior Apple internship work, and publications in multimodal and vision-language learning.
Ari Holtzman
Cohere
Assistant professor at the University of Chicago and head of the Conceptualization Lab, working on language generation, communication, and large language models.
Eric P. Xing
Apple
Eric P. Xing is president of Mohamed bin Zayed University of Artificial Intelligence and a professor in Carnegie Mellon University's Machine Learning Department, Language Technologies Institute, and Computer Science Department. His public homepage and biography say his work spans machine learning, statistical methodology, large-scale computational systems, large language models, world and agent models, and biology foundation models; he earned a bachelor's degree from Peking University and PhDs from Rutgers University and the University of Alberta.
Marzyeh Ghassemi
Apple
Marzyeh Ghassemi is the Germeshausen Career Development Professor and an associate professor in electrical engineering and computer science and the Institute for Medical Engineering and Science at MIT, and a CIFAR AI Chair at the Vector Institute. MIT says she joined MIT in July 2021 after serving as an assistant professor at the University of Toronto in computer science and medicine and as a Vector Institute faculty member holding a Canadian CIFAR AI Chair and Canada Research Chair. MIT also lists a PhD in computer science from MIT, an MSc in biomedical engineering from Oxford University, and bachelor's degrees in computer science and electrical engineering from New Mexico State University.
Sanjiv Kumar
Apple
Sanjiv Kumar is a Google Fellow and vice president at Google DeepMind. His homepage says he leads a machine learning team working on foundation models including LLMs and generative AI for Gemini, has also led research in deep retrieval and ranking, and earned a Ph.D. in computer science from Carnegie Mellon University in 2005.
Yacine Jernite
Apple
Yacine Jernite leads the ML and Society team at Hugging Face. His personal site says he works on ML systems governance at the intersection of regulatory and technical tools, with a focus on NLP models, data curation, documentation, and governance, and that he completed a Ph.D. in computer science at New York University under David Sontag.
Ke Ye
Apple
Ke Ye is a fifth-year PhD student in the Language Technologies Institute at Carnegie Mellon University. His homepage says his research spans natural language processing, machine learning, and artificial intelligence, including language models, speech and speech-language models, conversational AI, and reasoning, and notes that before Carnegie Mellon he worked at Apple Intelligence. His OpenReview profile lists Apple foundation-model work alongside earlier roles at Google, Roblox, and Capital One.
Aakanksha Chowdhery
Apple
Aakanksha Chowdhery is an adjunct professor at Stanford and a researcher at Reflection AI working on agentic LLMs and reinforcement learning for self-improving agents. Her homepage says she previously led work on the 540B PaLM model and Gemini at Google, earlier led interdisciplinary research initiatives at Microsoft Research and Princeton University, and completed a PhD in electrical engineering at Stanford University.
Kanishka Rao
Apple
Kanishka Rao is a researcher at Google. Public Google Research and OpenReview profiles connect Kanishka Rao to work in natural language processing, speech processing, robotics, and responsible AI, and list earlier student-research and internship roles at Carnegie Mellon University, Google Research, and Microsoft Research along with a master's degree in language technologies from Carnegie Mellon University.
Nandan Thakur
Apple
Nandan Thakur is a PhD student at the University of Waterloo working on information retrieval, vision-language models, inference optimization, and agentic systems. His public homepage says he is currently affiliated with Apple and Stanford and previously worked with researchers at Meta AI and NVIDIA.
Jean-Baptiste Tristan
Apple
Jean-Baptiste Tristan works at Anthropic on alignment. His public website says he previously worked at Amazon AWS on generative AI, at OpenAI, and at Meta, and that he holds bachelor's and master's degrees from Rice University in electrical engineering and computer science plus a PhD in machine learning from UC Berkeley.
Horace He
Apple
Horace He is a researcher at Meta working on PyTorch. His personal homepage says he works on machine learning and compilers, and his Google Scholar profile lists Cornell University with interests in machine learning, compilers, and algorithms.
Karan Singhal
Apple
Karan Singhal leads the Health AI team at OpenAI. His homepage says he works on LLMs for health and AI safety, with goals that include universalizing access to medical expertise, using health as a testbed for safety, and developing better plans for high-stakes AI deployment. The same page says his previous Google work included Med-PaLM and Med-PaLM 2.
Mingxing Tan
Apple
Mingxing Tan is a staff research scientist at Google DeepMind in the San Francisco Bay Area. His public homepage says he works on efficient models and reasoning, was the lead author of EfficientNet, EfficientDet, and MobileNetV3, and earned a Ph.D. from the University of Washington.
Da-Cheng Juan
Apple
Da-Cheng Juan is a software engineer at Google Research. His official Google Research profile says he has worked on large-scale semi-supervised learning with Expander and personalized recommendation for computational advertising, received a PhD from Carnegie Mellon University in 2014 before joining Google, and works on machine learning, convex optimization, and data mining.
Sven Gowal
Apple
Sven Gowal is a research scientist at Google DeepMind in Mountain View. His public Google Research profile describes work in robust machine learning, responsible AI, machine perception, and watermarking, and public profile text for that page notes previous postdoctoral work in statistical machine learning at UCL and a Ph.D. from EPFL.
Yao Lu
Google Gemini
Yao Lu is a research scientist at NVIDIA Research working on embodied AI, foundation models, and computer vision. His public homepage says he previously worked at Google DeepMind and Boston University and earned a Ph.D. from Carnegie Mellon University.
Yingbo Mao
Apple
Yingbo Mao is an assistant professor at the University of Hong Kong. His public homepage says he works on computer vision and multimodal large language models, completed his PhD at the Chinese University of Hong Kong in 2024 under Dahua Lin, and previously served as head of multimodal at Together AI and as a research scientist at UCLA.
Jiahui Yu
Google Gemini
Jiahui Yu is a Research Lead at OpenAI leading the Perception team. His homepage notes prior co-leadership on Gemini Multimodal at Google DeepMind and work on deep learning and high-performance computing.
Ruoming Pang
Apple
Public profiles and publications link Ruoming Pang to speech, language, and multimodal model research. Reuters reported on February 25, 2026 that OpenAI hired him from Meta.
Sharon Zhou
Apple
Sharon Zhou is founder and CEO of Lamini and is pursuing a PhD in computer science at Stanford University. Stanford speaker pages describe her as co-director of the Generative AI for Education Hub at Stanford HAI, advised by Fei-Fei Li and Jure Leskovec, and working on generative AI topics including large language models, computer vision, and graphics.
Marcelo O. Matena
Apple
Marcelo O. Matena is a research engineer at Apple. His public homepage says he focuses on machine learning, natural language processing, and software engineering, and that he holds a doctorate in natural language processing from the University of Edinburgh plus both master's and bachelor's degrees in computer science from UFMG.
Phillipp Schmid
Apple
Phillipp Schmid is a staff engineer in developer experience and developer relations at Google DeepMind. His public about page says he joined Google DeepMind in 2025 after serving as a technical lead at Hugging Face, where he led strategic collaborations with major cloud providers and focused on practical large language model deployment and RLHF.
Ninareh Mehrabi
Amazon
Ninareh Mehrabi says she works at Meta's Superintelligence Labs on red teaming and frontier risks, after earlier roles at Amazon AGI and Amazon Alexa AI. USC CAIS lists her as a PhD alum in computer science.
Yinfei Yang
Apple
Research scientist at Apple focused on natural language processing and machine learning.
Pingye Shi
Apple
Pingye Shi is an applied research scientist at Apple AI/ML in Cupertino. His public homepage says he completed a Ph.D. in computer science at Cornell and works on machine learning applications and machine learning systems.
Nan Du
Apple
Nan Du works on large language models, mixture-of-experts methods, few-shot learning, and natural language processing. Her public OpenReview profile lists Apple AIML as her current affiliation and Google Brain as an earlier role.
Haoxuan You
Apple
Research scientist on Apple Foundation Models whose work focuses on machine learning systems, multimodal foundation models, and AI agents.
Max Schwarzer
Apple
Max Schwarzer is a reinforcement learning researcher whose work focuses on scaling and sample-efficient RL. He completed a PhD at Mila, later interned in Apple's machine learning research group, and was an author on Apple's MM1 multimodal pre-training report.
Samia Touileb
Cohere
Associate Professor in Natural Language Processing at the University of Bergen whose work focuses on bias and fairness in NLP, information extraction, summarization, and under-resourced languages.
Danny Driess
Google Gemini
Danny Driess is a research scientist at Google DeepMind whose work focuses on general AI, robot learning, and multimodal foundation models.
Ehsan Amid
Apple
Ehsan Amid is a research scientist at Apple. His public homepage says his work focuses on machine learning for speech and language processing and deep representation learning, and his exact Google Scholar profile provides a supporting public publication record.
Nikolay Burbulis
Apple
Nikolay Burbulis is a staff research scientist on Apple Foundation Models. His public homepage says his interests include inference-time compute, reasoning, and code generation, and that he completed a Ph.D. in mathematics and computer science at EPFL.
Fei Xia
Google Gemini / Mistral AI
Senior Staff Research Scientist and Tech Lead Manager at Google DeepMind Robotics, focused on embodied agents and foundation models for robot decision-making.
Jonathan H. Clark
Apple
Jonathan H. Clark is a research scientist at Apple working on foundation models. His website says he previously worked at Google and focuses on large language model pretraining, evaluation, and data.
Tim Cooijmans
Apple
Tim Cooijmans is a research scientist at Apple. His personal site says he previously worked as a research scientist at Google DeepMind and as a postdoctoral researcher at Mila, and that his interests include machine learning, generative modeling, and reinforcement learning.
Awni Hannun
Apple
Researcher and engineer working on machine learning, software, and hardware systems.
Sachin Kumar
Apple
Sachin Kumar is a researcher at Apple and incoming assistant professor at UC San Diego. His work focuses on natural language processing, efficient and multilingual language models, and machine learning systems.
Arianna Bisazza
Cohere
Associate professor of natural language processing at the University of Groningen and research scientist at Cohere Labs, with work spanning machine translation, multilingual models, and multimodal language understanding.
Jitendra Malik
Apple
Jitendra Malik is a computer vision and machine learning researcher at UC Berkeley whose public homepage and Google Scholar profile highlight work on image understanding, robotics, and foundation models.
Jonas Geiping
Apple
Machine learning researcher at Apple Machine Learning Research working at the intersection of optimization, privacy, and security.
Munsina Sundaram
Cohere
Machine learning engineer and researcher based in the San Francisco Bay Area whose interests include multilingual and multimodal machine learning, responsible AI, and applications in healthcare and education.
Raza Habib
Cohere
Research scientist and engineer focused on multimodal and multilingual language models, with public work on translation, retrieval, and agent systems.
Teddy Karrer
Google Gemini
Teddy Karrer is a research scientist working on embodied AI, multimodal reasoning, and machine learning for interactive systems. His public profile highlights robotics, decision making, and intelligent agents.
Yusuke M. Asano
Google Gemini
Research scientist at Google whose work spans computer vision, multimodal learning, and large embodied models, including PaLM-E.
Yuyin Zhou
NVIDIA
Assistant Professor of Computer Science and Engineering at UC Santa Cruz working on multimodal learning, computer vision, and medical image analysis.
Zhaowen Wang
NVIDIA
Research manager at NVIDIA working on large-scale distributed pretraining, synthetic data, multimodal LLMs, and computer vision.
Shyamal Anadkat
NVIDIA
Shyamal Anadkat's personal site describes him as a former Applied AI practitioner at OpenAI and an AI advisor to startups, and frames the site as essays on AI, startups, strategy, and the future of work.
Floris Weers
Apple
Research scientist at Apple working on efficient and multilingual language modeling, speech and language systems, and large language models.
Xiang Kong
Apple
Xiang Kong's public homepage says he is a machine learning researcher at Apple and that he received his PhD from the School of Computer Science at Carnegie Mellon University.
Andy Zeng
Google Gemini
Andy Zeng is a Research Scientist at Google DeepMind. His public research interests include robot learning, computer vision, graphics, and personalized 3D content generation.
Vincent Ponzo
Amazon
Public sources identify Vincent Ponzo as a former Responsible AI Business Development Lead at Amazon AI and a 2025 Amazon technical report coauthor.
Brandon McKinzie
Apple
Senior research scientist at Apple working on large multimodal foundation models, with prior work on large language models at MosaicML.
Neil Houlsby
Apple
Neil Houlsby works on adaptation of large language models, transfer learning, parameter-efficient fine-tuning, and inference efficiency.
Scott Reed
Google Gemini
Research scientist at Google DeepMind working on language, vision, action, and robotics; previously on the Google Brain team and a co-creator of the first text-to-image GAN.
Weizhu Chen
NVIDIA
Technical Fellow and CVP, Microsoft GenAI. His official Microsoft Research profile says he leads a modeling team working on large-scale model training and human language technologies.
Rahul Gupta
Amazon
Amazon Science lists Rahul Gupta as Senior Manager, Applied Science at Amazon AGI, with publications in LLM safety, evaluation, and responsible AI.
Angela Fan
Apple
Research scientist at Apple working at the intersection of natural language processing, machine learning, and AI, with a focus on building more intelligent, robust, and reliable systems.
Fei Xia
Google Gemini
Research scientist at Google DeepMind working on robotics and embodied intelligence. His research spans robot learning, navigation, manipulation, and multimodal agents.
Nicolas Heess
Google Gemini
Nicolas Heess is a research scientist at Google DeepMind whose work focuses on machine learning, reinforcement learning, and robotics.
Peter Grasch
Apple
Research scientist at Apple focused on state-of-the-art machine learning and computer vision methods.
Zirui Wang
Apple
Senior researcher at Apple working on large models, multimodal learning, and speech processing, according to his personal site.
Abhinav Mohanty
Amazon
Public sources list Abhinav Mohanty as a coauthor on Amazon Nova safety evaluation work under the Frontier Model Safety Framework.
Aida Amini
Cohere
Researcher focused on grounded language understanding, question answering, semantic parsing, and natural language inference.
Ishan Misra
Apple
Ishan Misra is a Research Scientist at Apple whose work spans computer vision, multimodal learning, and large foundation models. He has contributed to Apple Intelligence foundation model research.
Johnny Mao
Google Gemini
Senior research scientist at Google DeepMind working on machine learning.
Marc G. Bellemare
Google Gemini
Principal research scientist at Google DeepMind and professor of computer science at McGill University.
Masoud Alizadeh
Apple
Research scientist at Apple specializing in multilingual and multimodal generative models.
Montserrat Gonzalez Arenas
Google Gemini
Montserrat Gonzalez Arenas is a research engineer at Google Research whose public work focuses on robot learning and mobile manipulation, including robotic table wiping, waste sorting, and RT-Trajectory for robot task generalization.
Paria Hafezi
Apple
Research scientist and engineer at Apple working on foundation models for speech and language, with interests in explainability and interpretability.
Thaddeus Culhane
NVIDIA
Research scientist at NVIDIA working on multimodal AI, especially language and vision models.
Yao Lu
DeepSeek / Google Gemini
Yao Lu is listed as an author of the Google technical report Gemini Robotics: Bringing AI into the Physical World.
Mark Lee
Apple
Co-author of MM1, which studies multimodal LLM pre-training.
Payal Motwani
Amazon
Public report authorship links Payal Motwani to Amazon Nova Responsible AI evaluations for Nova Premier and Nova 2.0 Lite.
Shiliang Pu
Baidu
Researcher at Baidu and coauthor of the ERNIE 4.5 Technical Report.
Shuaiqian Wang
Baidu
Researcher at Baidu and coauthor of the ERNIE 4.5 Technical Report.
Tom Gunter
Apple
Research scientist at Apple Intelligence working on computer vision, machine learning, and natural language processing.
Alex Beutel
Apple
Alex Beutel is listed as an author of the Apple technical report Apple Intelligence Foundation Language Models.
Ashish Ahuja
Apple
Ashish Ahuja is listed as an author of the Apple technical report Apple Intelligence Foundation Language Models.
Ayush Jain
Apple
Ayush Jain is listed as an author of the Apple technical report Apple Intelligence Foundation Language Models.
Bharat Ramadoss
Apple
Bharat Ramadoss is listed as an author of the Apple technical report Apple Intelligence Foundation Language Models.
Bret Kinsella
Apple
Bret Kinsella is listed as an author of the Apple technical report Apple Intelligence Foundation Language Models.
Cheng-Kang Hsieh
Apple
Cheng-Kang Hsieh is listed as an author of the Apple technical report Apple Intelligence Foundation Language Models.
Florian Schroff
Apple
Florian Schroff is listed as an author of the Apple technical report Apple Intelligence Foundation Language Models.
Gang Luo
Apple
Gang Luo is listed as an author of the Apple technical report Apple Intelligence Foundation Language Models.
Hamid Palangi
Apple
Hamid Palangi is listed as an author of the Apple technical report Apple Intelligence Foundation Language Models.
Harsha Nori
Apple
Harsha Nori is listed as an author of the Apple technical report Apple Intelligence Foundation Language Models.
Hong Xu
Apple
Hong Xu is listed as an author of the Apple technical report Apple Intelligence Foundation Language Models.
HyoukJoong Lee
Apple
HyoukJoong Lee is listed as an author of the Apple technical report Apple Intelligence Foundation Language Models.
Jianfeng Gao
NVIDIA
Public report authorship links Jianfeng Gao to the Nemotron-4 15B Technical Report at NVIDIA.
Jiawei Dong
Baidu
Researcher at Baidu and coauthor of the ERNIE 4.5 Technical Report.
Jin Xu
Apple
Jin Xu is listed as an author of the Apple technical report Apple Intelligence Foundation Language Models.
Justin Tsai
Apple
Justin Tsai is listed as an author of the Apple technical report Apple Intelligence Foundation Language Models.
Ke Yang
Apple
Ke Yang is listed as an author of the Apple technical report Apple Intelligence Foundation Language Models.
Lukas Haas
Apple
Lukas Haas is listed as an author of the Apple technical report Apple Intelligence Foundation Language Models.
Mengyu Zhao
Apple
Mengyu Zhao is listed as an author of the Apple technical report Apple Intelligence Foundation Language Models.
Michael Riley
Apple
Michael Riley is listed as an author of the Apple technical report Apple Intelligence Foundation Language Models.
Michael Rush
Apple
Michael Rush is listed as an author of the Apple technical report Apple Intelligence Foundation Language Models.
Ming Yang
Baidu
Researcher at Baidu and coauthor of the ERNIE 4.5 Technical Report.
Nick Charron
Apple
Nick Charron is listed as an author of the Apple technical report Apple Intelligence Foundation Language Models.
Pankaj Kedia
Apple
Pankaj Kedia is listed as an author of the Apple technical report Apple Intelligence Foundation Language Models.
Puneet Pathak
Apple
Puneet Pathak is listed as an author of the Apple technical report Apple Intelligence Foundation Language Models.
Rahul Nair
Apple
Rahul Nair is listed as an author of the Apple technical report Apple Intelligence Foundation Language Models.
Rahul Vaidyanathan
Apple
Rahul Vaidyanathan is listed as an author of the Apple technical report Apple Intelligence Foundation Language Models.
Rong Rong
Apple
Rong Rong is listed as an author of the Apple technical report Apple Intelligence Foundation Language Models.
Saksham Singhal
Apple
Saksham Singhal is listed as an author of the Apple technical report Apple Intelligence Foundation Language Models.
Sam Dodge
Apple
Sam Dodge is an Apple AI/ML-affiliated researcher. The linked arXiv paper lists him as a coauthor of MM1 and shows his affiliation as Apple AI/ML in Cupertino, California.
Sharmila Bhattacharya
Apple
Sharmila Bhattacharya is listed as an author of the Apple technical report Apple Intelligence Foundation Language Models.
Shobhit Chauhan
Apple
Shobhit Chauhan is listed as an author of the Apple technical report Apple Intelligence Foundation Language Models.
Sofi Yao
Apple
Sofi Yao is listed as an author of the Apple technical report Apple Intelligence Foundation Language Models.
Sonal Gupta
Apple
Sonal Gupta is listed as an author of the Apple technical report Apple Intelligence Foundation Language Models.
Tianjian Lu
Apple
Tianjian Lu is listed as an author of the Apple technical report Apple Intelligence Foundation Language Models.
Tim Dettmers
Apple
Tim Dettmers is listed as an author of the Apple technical report Apple Intelligence Foundation Language Models.
Vasu Sharma
Apple
Vasu Sharma is listed as an author of the Apple technical report Apple Intelligence Foundation Language Models.
Weijie Su
Apple
Weijie Su is listed as an author of the Apple technical report Apple Intelligence Foundation Language Models.
Wenfeng Chen
Baidu
Researcher at Baidu and coauthor of the ERNIE 4.5 Technical Report.
Wojciech Zaremba
Apple
Wojciech Zaremba is listed as an author of the Apple technical reports Apple Intelligence Foundation Language Models and Apple Intelligence Foundation Language Models: Tech Report 2025.
Xiaomin Wang
Apple
Xiaomin Wang is listed as an author of the Apple technical report Apple Intelligence Foundation Language Models.
Xingqiao Liu
Apple
Xingqiao Liu is listed as an author of the Apple technical report Apple Intelligence Foundation Language Models.
Yanbo Liang
Apple
Yanbo Liang is listed as an author of the Apple technical report Apple Intelligence Foundation Language Models.
Yifeng Lu
Apple
Yifeng Lu is listed as an author of the Apple technical report Apple Intelligence Foundation Language Models.
Yingbo Zhou
Apple
Yingbo Zhou is listed as an author of the Apple technical report Apple Intelligence Foundation Language Models.
Yi Tay
Apple
Yi Tay is listed as an author of the Apple technical report Apple Intelligence Foundation Language Models.
Yunsen Xian
Baidu
Researcher at Baidu and coauthor of the ERNIE 4.5 Technical Report.
Yuriy Gusev
Apple
Yuriy Gusev is listed as an author of the Apple technical report Apple Intelligence Foundation Language Models.
Zhongyuan Wang
Baidu
Researcher at Baidu and coauthor of the ERNIE 4.5 Technical Report.
Bing Ren
Apple
Researcher working on on-device and foundation language models, including Apple Intelligence models.
Bowen Zhang
Apple
Research scientist at Apple working on large language models, vision-language models, and model scaling.
Dhruti Shah
Apple
Researcher working on machine learning, vision and language, computer vision, diffusion, and generative AI.
Jean-Philippe Fauconnier
Apple
Research scientist at Apple Foundation Models working on generative AI, large language models, and multimodal models.
Ming Lei
Apple
Apple researcher whose publications include the Apple Intelligence Foundation Language Models technical report.
Philipp Dufter
Apple
Research scientist at Apple Foundation Models with interests in natural language processing, structured generation, controllable generation, and algorithmic efficiency.
Raman Chopra
Apple
Apple researcher whose publications include the Apple Intelligence Foundation Language Models technical report.
Tengyun Huang
Apple
Apple researcher whose publications include the Apple Intelligence Foundation Language Models technical report.
Xianzhi Du
Apple
Research scientist at Apple working on language and vision-language modeling, AI agents, and post-training.
Yash Jernite
Apple
Apple researcher whose publications include the Apple Intelligence Foundation Language Models technical report.
Zhe Gan
Apple
Machine learning researcher at Apple working on large multimodal foundation models, video generation, and vision-language systems.
Alex Wang
Cohere
Public report authorship links Alex Wang to the Aya Vision: Advancing the Frontier of Multilingual Multimodality at Cohere.
Ali Farhadi
Cohere
Public report authorship links Ali Farhadi to the Aya Vision: Advancing the Frontier of Multilingual Multimodality at Cohere.
Amit Bhonkar
Apple
Amit Bhonkar is listed as an author of the Apple technical report Apple Intelligence Foundation Language Models.
Amit Singh
Apple
Amit Singh is listed as an author of the Apple technical report Apple Intelligence Foundation Language Models.
Aniket Kittur
Cohere
Public report authorship links Aniket Kittur to the Aya Vision: Advancing the Frontier of Multilingual Multimodality at Cohere.
Anna Goldie
Apple
Anna Goldie is listed as an author of the Apple technical report Apple Intelligence Foundation Language Models.
Berta Chulvi
Cohere
Public report authorship links Berta Chulvi to the Aya Vision: Advancing the Frontier of Multilingual Multimodality at Cohere.
Binhang Yuan
Apple
Binhang Yuan is listed as an author of the Apple technical report Apple Intelligence Foundation Language Models.
Bokun Wang
NVIDIA
Bokun Wang is listed as an author of the NVIDIA technical report NVLM: Open Frontier-Class Multimodal LLMs.
Bo Pang
Apple
Bo Pang is listed as an author of the Apple technical report Apple Intelligence Foundation Language Models.
Chris Hughes
Apple
Chris Hughes is listed as an author of the Apple technical report Apple Intelligence Foundation Language Models: Tech Report 2025.
David Hallac
Apple
David Hallac is listed as an author of the Apple technical report Apple Intelligence Foundation Language Models.
Dongxin Li
Apple
Dongxin Li is listed as an author of the Apple technical report Apple Intelligence Foundation Language Models.
Eugene Yun
Apple
Eugene Yun is listed as an author of the Apple technical report Apple Intelligence Foundation Language Models.
Fuwen Tan
NVIDIA
Fuwen Tan is listed as an author of the NVIDIA technical report NVLM: Open Frontier-Class Multimodal LLMs.
George Dahl
Apple
George Dahl is listed as an author of the Apple technical report Apple Intelligence Foundation Language Models: Tech Report 2025.
Ge Zhang
NVIDIA
Ge Zhang is listed as an author of the NVIDIA technical report NVLM: Open Frontier-Class Multimodal LLMs.
Haotian Zhang
NVIDIA
Haotian Zhang is listed as an author of the NVIDIA technical report NVLM: Open Frontier-Class Multimodal LLMs.
Haritha Nori
Apple
Haritha Nori is listed as an author of the Apple technical report Apple Intelligence Foundation Language Models.
Harshavardhan Kannan
Apple
Harshavardhan Kannan is listed as an author of the Apple technical report Apple Intelligence Foundation Language Models: Tech Report 2025.
Himanshu Arora
Apple
Himanshu Arora is listed as an author of the Apple technical report Apple Intelligence Foundation Language Models: Tech Report 2025.
Jamie Simon
Apple
Jamie Simon is listed as an author of the Apple technical report Apple Intelligence Foundation Language Models.
Jared Quincy Davis
Apple
Jared Quincy Davis is listed as an author of the Apple technical report Apple Intelligence Foundation Language Models.
Jiaming Wang
NVIDIA
Jiaming Wang is listed as an author of the NVIDIA technical report NVLM: Open Frontier-Class Multimodal LLMs.
Jian Zhang
Apple
Jian Zhang is listed as an author of the Apple technical report Apple Intelligence Foundation Language Models.
Jiaqi Zeng
NVIDIA
Jiaqi Zeng is listed as an author of the NVIDIA technical report NVLM: Open Frontier-Class Multimodal LLMs.
Jingchao Ge
NVIDIA
Jingchao Ge is listed as an author of the NVIDIA technical report NVLM: Open Frontier-Class Multimodal LLMs.
Jixuan Fan
Apple
Jixuan Fan is listed as an author of the Apple technical report Apple Intelligence Foundation Language Models: Tech Report 2025.
Johnny Wei
Apple
Johnny Wei is listed as an author of the Apple technical report Apple Intelligence Foundation Language Models.
Junjie Wang
Apple
Junjie Wang is listed as an author of the Apple technical report Apple Intelligence Foundation Language Models.
Karan Singhal
Cohere
Public report authorship links Karan Singhal to the Aya Vision: Advancing the Frontier of Multilingual Multimodality at Cohere.
Mahdi Milani Fard
Apple
Mahdi Milani Fard is listed as an author of the Apple technical report Apple Intelligence Foundation Language Models.
Manan Tomar
Apple
Manan Tomar is listed as an author of the Apple technical report Apple Intelligence Foundation Language Models.
Mariam Morshed
Apple
Mariam Morshed is listed as an author of the Apple technical report Apple Intelligence Foundation Language Models.
Marilyne Berlemont
Apple
Marilyne Berlemont is listed as an author of the Apple technical report Apple Intelligence Foundation Language Models: Tech Report 2025.
Marton Patwary
Apple
Marton Patwary is listed as an author of the Apple technical report Apple Intelligence Foundation Language Models.
Max Ku
Cohere
Public report authorship links Max Ku to the Aya Vision: Advancing the Frontier of Multilingual Multimodality at Cohere.
Mengwei Xu
Apple
Mengwei Xu is listed as an author of the Apple technical report Apple Intelligence Foundation Language Models.
Miao Liu
Apple
Miao Liu is listed as an author of the Apple technical report Apple Intelligence Foundation Language Models: Tech Report 2025.
Mohak Bansal
Apple
Mohak Bansal is listed as an author of the Apple technical report Apple Intelligence Foundation Language Models.
Moishe Hasabnis
Apple
Moishe Hasabnis is listed as an author of the Apple technical report Apple Intelligence Foundation Language Models.
Nathan Cooper
Cohere
Public report authorship links Nathan Cooper to the Aya Vision: Advancing the Frontier of Multilingual Multimodality at Cohere.
Nikita Bhalla
Apple
Nikita Bhalla is listed as an author of the Apple technical report Apple Intelligence Foundation Language Models.
Nina Wenzel
Apple
Nina Wenzel is listed as an author of the Apple technical report MM1.5: Methods, Analysis and Insights from Multimodal LLM Fine-tuning.
Oluwatobi "Tobi" Oladipo
Apple
Oluwatobi "Tobi" Oladipo is listed as an author of the Apple technical report Apple Intelligence Foundation Language Models.
Pai Peng
NVIDIA
Pai Peng is listed as an author of the NVIDIA technical report NVLM: Open Frontier-Class Multimodal LLMs.
Phillipp Wiesner
Apple
Phillipp Wiesner is listed as an author of the Apple technical report Apple Intelligence Foundation Language Models.
Ping Xu
Apple
Ping Xu is listed as an author of the Apple technical report Apple Intelligence Foundation Language Models.
Pranav Yadlapalli
Apple
Pranav Yadlapalli is listed as an author of the Apple technical report Apple Intelligence Foundation Language Models.
Pritish Kamath
Apple
Pritish Kamath is listed as an author of the Apple technical report Apple Intelligence Foundation Language Models.
Pulin Gupta
Apple
Pulin Gupta is listed as an author of the Apple technical report Apple Intelligence Foundation Language Models.
Pyeongjae Cho
Apple
Pyeongjae Cho is listed as an author of the Apple technical report Apple Intelligence Foundation Language Models.
Qing Guo
NVIDIA
Qing Guo is listed as an author of the NVIDIA technical report NVLM: Open Frontier-Class Multimodal LLMs.
Ranjith Prasad
Apple
Ranjith Prasad is listed as an author of the Apple technical report Apple Intelligence Foundation Language Models.
Romal Thoppilan
Apple
Romal Thoppilan is listed as an author of the Apple technical report Apple Intelligence Foundation Language Models.
Sam Yen-Chi Chen
Apple
Sam Yen-Chi Chen is listed as an author of the Apple technical report Apple Intelligence Foundation Language Models.
Saurabh Tiwary
Apple
Saurabh Tiwary is listed as an author of the Apple technical report Apple Intelligence Foundation Language Models.
Sercan O. Arik
Apple
Sercan O. Arik is listed as an author of the Apple technical report Apple Intelligence Foundation Language Models.
Shang-Wen Li
Apple
Shang-Wen Li is listed as an author of the Apple technical report Apple Intelligence Foundation Language Models: Tech Report 2025.
Sharad Bhat
Apple
Sharad Bhat is listed as an author of the Apple technical report Apple Intelligence Foundation Language Models.
Sheeel Jindal
Apple
Sheeel Jindal is listed as an author of the Apple technical report Apple Intelligence Foundation Language Models.
Shengyu Wang
NVIDIA
Shengyu Wang is listed as an author of the NVIDIA technical report NVLM: Open Frontier-Class Multimodal LLMs.
Shiqi Yu
Apple
Shiqi Yu is listed as an author of the Apple technical report Apple Intelligence Foundation Language Models.
Shuai Zhang
NVIDIA
Shuai Zhang is listed as an author of the NVIDIA technical report NVLM: Open Frontier-Class Multimodal LLMs.
Simran Arora
NVIDIA
Simran Arora is listed as an author of the NVIDIA technical report NVLM: Open Frontier-Class Multimodal LLMs.
Siyao Li
Apple
Siyao Li is listed as an author of the Apple technical report Apple Intelligence Foundation Language Models.
Sora Tokumine
Google Gemini
Sora Tokumine is listed as an author of the Google technical report PaLM-E: An Embodied Multimodal Language Model.
Sumit Kumar Jha
NVIDIA
Sumit Kumar Jha is listed as an author of the NVIDIA technical report NVLM: Open Frontier-Class Multimodal LLMs.
Swaroop Mishra
Apple
Swaroop Mishra is listed as an author of the Apple technical report Apple Intelligence Foundation Language Models.
Tianwei Zhang
Apple
Tianwei Zhang is listed as an author of the Apple technical report Apple Intelligence Foundation Language Models.
Tim Althoff
NVIDIA
Tim Althoff is listed as an author of the NVIDIA technical report NVLM: Open Frontier-Class Multimodal LLMs.
Tomas Pfister
Apple
Tomas Pfister is listed as an author of the Apple technical report Apple Intelligence Foundation Language Models.
Udit Gupta
Apple
Udit Gupta is listed as an author of the Apple technical report Apple Intelligence Foundation Language Models.
Vishal Monga
Apple
Vishal Monga is listed as an author of the Apple technical report Apple Intelligence Foundation Language Models.
Wenhao Li
Apple
Wenhao Li is listed as an author of the Apple technical report Apple Intelligence Foundation Language Models.
Wenshan Wang
Apple
Wenshan Wang is listed as an author of the Apple technical report Apple Intelligence Foundation Language Models.
Xavier Garcia
Apple
Xavier Garcia is listed as an author of the Apple technical report Apple Intelligence Foundation Language Models.
Xi Chen
Apple
Xi Chen is listed as an author of the Apple technical report Apple Intelligence Foundation Language Models.
Xingyou Song
Apple
Xingyou Song is listed as an author of the Apple technical report Apple Intelligence Foundation Language Models.
Xinyuan Li
Apple
Xinyuan Li is listed as an author of the Apple technical report Apple Intelligence Foundation Language Models.
Xizhou Zhu
NVIDIA
Xizhou Zhu is listed as an author of the NVIDIA technical report NVLM: Open Frontier-Class Multimodal LLMs.
Yadong Wang
NVIDIA
Yadong Wang is listed as an author of the NVIDIA technical report NVLM: Open Frontier-Class Multimodal LLMs.
Yang Zhao
NVIDIA
Yang Zhao is listed as an author of the NVIDIA technical report NVLM: Open Frontier-Class Multimodal LLMs.
Yann LeCun
Apple
Yann LeCun is listed as an author of the Apple technical report Apple Intelligence Foundation Language Models.
Yanping Huang
Apple
Yanping Huang is listed as an author of the Apple technical report Apple Intelligence Foundation Language Models.
Yeung-Leung Chow
NVIDIA
Yeung-Leung Chow is listed as an author of the NVIDIA technical report NVLM: Open Frontier-Class Multimodal LLMs.
Yichao Ma
NVIDIA
Yichao Ma is listed as an author of the NVIDIA technical report NVLM: Open Frontier-Class Multimodal LLMs.
Yichen Zhu
NVIDIA
Yichen Zhu is listed as an author of the NVIDIA technical report NVLM: Open Frontier-Class Multimodal LLMs.
Yiqi Han
Apple
Yiqi Han is listed as an author of the Apple technical report Apple Intelligence Foundation Language Models: Tech Report 2025.
Yiwen Wang
Google Gemini
Yiwen Wang is listed as an author of the Google technical report PaLM-E: An Embodied Multimodal Language Model.
Yuan Du
NVIDIA
Yuan Du is listed as an author of the NVIDIA technical report NVLM: Open Frontier-Class Multimodal LLMs.
Yuchen Jin
NVIDIA
Yuchen Jin is listed as an author of the NVIDIA technical report NVLM: Open Frontier-Class Multimodal LLMs.
Yuchen Zhang
NVIDIA
Yuchen Zhang is listed as an author of the NVIDIA technical report NVLM: Open Frontier-Class Multimodal LLMs.
Zhe Lin
Google Gemini
Zhe Lin is listed as an author of the Google technical report PaLM-E: An Embodied Multimodal Language Model.
Zhilin Wu
Apple
Zhilin Wu is listed as an author of the Apple technical report Apple Intelligence Foundation Language Models.
Zizhao Zhang
Apple
Zizhao Zhang is listed as an author of the Apple technical report Apple Intelligence Foundation Language Models.
Zoubin Ghahramani
Apple
Zoubin Ghahramani is listed as an author of the Apple technical report Apple Intelligence Foundation Language Models.
Aditya Siddhant
Cohere
Member of Technical Staff at Cohere Labs working on multilingual and multimodal language technologies.
Afshin Dehghan
Apple
Research scientist at Apple focused on computer vision, multimodal learning, and robotics.
Aleksei Timofeev
Apple
Research scientist whose public OpenReview profile lists work on multimodal representation learning, speech synthesis, and personalized voice generation.
Alexander Toshev
Apple
Computer vision and machine learning scientist at Apple whose work includes multimodal understanding and robotics, following earlier leadership roles at Google.
Amin Jalali
Apple
Apple researcher whose publications include the Apple Intelligence Foundation Language Models technical report.
Andy Yao
NVIDIA
Research scientist at NVIDIA with public publications on multimodal language models and visual instruction tuning, including NVLM, VILA, and Video2Flow.
Anton Belyi
Apple
Research scientist at Apple and adjunct professor at MIPT working in computer vision, image processing, and machine learning.
Caiming Xiong
NVIDIA
Vice President of AI Research and General Manager of AI Platforms at NVIDIA.
Cengiz Oztireli
Google Gemini
Senior staff research scientist at Google DeepMind and affiliated lecturer at Cambridge working on computer vision, machine learning, and computer graphics.
Daria Buchsbaum
Google Gemini
Daria Buchsbaum is a PhD student at Georgia Tech and a Research Scientist Intern at Google DeepMind.
Forrest Huang
Apple
Research scientist at Apple Foundation Models working on efficient training and multimodal language models.
Futang Peng
Apple
Research scientist at Apple focusing on understanding and generating text and images.
Greg Yang
Apple
AI researcher and deep learning theorist whose public work includes tensor programs and maximal update parameterization. He coauthored Apple's 2025 Apple Intelligence foundation language models technical report.
Hong-You Chen
Apple
AI and machine learning engineer at Apple working on multimodal foundation models; previously worked at Snap and the University of Southern California.
Hongyu He
Apple
Research scientist at Apple focused on computer vision, machine learning, and multimodal understanding.
Jose A. Arenas
Google Gemini
Staff software engineer at Google focused on machine learning and systems.
Keen You
Apple
Research scientist at Apple specializing in post-training, reinforcement learning, and AI agents.
Louis Borry
Google Gemini
Louis Borry is a PhD student at Google DeepMind working on embodied language models and grounded language understanding.
Mikel Arza
Google Gemini
Research scientist at Google DeepMind focused on robotics and machine learning, especially reinforcement learning and language models.
Mingfei Gao
Apple
Researcher working on machine learning, optimization, and sequential data.
Qiaozi Gao
Google Gemini
Qiaozi Gao is a Stanford PhD student whose work spans vision and language, machine learning, and robotics, with research internships at Google and Google DeepMind.
Ran Tian
NVIDIA
Research scientist at NVIDIA working on multimodal language models and vision-language research, with public publications including NVLM, VILA, and Visual Role Play.
Rulin Shao
NVIDIA
PhD student at UCLA and research intern at NVIDIA, working on multimodal reasoning, vision-language models, and embodied AI.
Sam Wiseman
Apple
Sam Wiseman is an assistant professor of computer science at New York University whose research focuses on natural language processing and machine learning, including controllable generation, summarization, and learning from human feedback.
Seb Noury
Apple
Apple researcher whose publications include the Apple Intelligence Foundation Language Models technical report and work on the MLX framework.
Sergey Ioffe
Apple
Machine learning researcher whose work spans neural networks and statistics, and a co-author of Apple's Foundation Language Models report.
Soroosh Mariooryad
Apple
Senior research scientist at Apple with public publications spanning speech, audio, and language modeling, including work on speech language models and MemoryLLM.
Tanmay Shah
Apple
Apple researcher whose publications include the Apple Intelligence Foundation Language Models technical report.
Thomas Blankevoort
Google Gemini
Thomas Blankevoort is a Research Scientist at Google DeepMind whose work focuses on efficient neural networks and machine learning systems.
Tom Small
Apple
Researcher working on foundation language models and efficient inference, including Apple Intelligence models.