Multimodal Language Models | Field

Anand Avati is a PhD candidate at the Stanford AI Lab. His homepage says he is the technical lead of the AI-Enabled ACP project deployed at Stanford Hospital, has served as principal instructor of Stanford's CS229 course, and previously was a distinguished engineer in Red Hat's CTO office after joining through the acquisition of Gluster, where he was a founding engineer and lead architect of GlusterFS.

Andrew Tulloch is a researcher at Meta working on superintelligence and a co-founder of Thinking Machines Lab. His public homepage says he previously worked on machine learning systems at Meta, helped train GPT-4o, GPT-4.5, and o3 at OpenAI, and studied mathematics at the University of Sydney and the University of Cambridge.

Ji Lin is a research scientist at OpenAI working on multimodal models, reasoning, and synthetic data. He previously completed his Ph.D. and M.Sc. in EECS at MIT under Song Han, earned a B.Eng. in Electronic Engineering from Tsinghua University, and has interned or worked at Adobe Research, OmniML, and NVIDIA Research.

Sanchit Gandhi is a research scientist at Mistral AI. His public Hugging Face and conference speaker profiles say he previously worked on open-source speech technology at Hugging Face and at Apple, helped popularize OpenAI's Whisper ecosystem, and earned a master's degree at the University of Cambridge.

Sebastian Gehrmann is the head of Responsible AI in the office of the CTO at Bloomberg. His homepage says he previously led Bloomberg's NLP work, earlier researched evaluation of large language models at Google, and earned a PhD from Harvard University. It also highlights research interests in natural language generation, model evaluation, and interpretability.

Ariana Mirian is a senior security researcher at Censys and a PhD student in computer science at UC Berkeley. Her public resume shows earlier AI/ML researcher and data scientist roles at Apple, as well as Stanford degrees in business administration and electrical engineering.

Haotian Zhang is a research scientist on Apple AI/ML's Visual Intelligence team. His homepage says he works on embodied agents that understand the world from 2D and 3D image data as well as natural language, previously interned at Microsoft Research and Azure AI, completed a PhD in electrical and computer engineering at the University of Washington in 2022, and earlier earned master's degrees at Washington and a bachelor's degree at Shanghai Jiao Tong University.

Kartik Sreenivasan is a machine learning researcher whose homepage says he was a final-year PhD student in computer science at the University of Wisconsin-Madison advised by Dimitris Papailiopoulos. The same page says he studies optimization, machine learning, and large-scale distributed settings, earned a B.Tech from the National Institute of Technology Karnataka, worked at Adobe Systems for three years in Bengaluru, and was preparing to join Databricks as a research scientist on the after-training team.

Zhengfeng Lai is an ML Research Scientist at Apple AI/ML. His self-authored CV lists a PhD in Electrical and Computer Engineering from the University of California, Davis, prior Apple internship work, and publications in multimodal and vision-language learning.

Assistant professor at the University of Chicago and head of the Conceptualization Lab, working on language generation, communication, and large language models.

Eric P. Xing is president of Mohamed bin Zayed University of Artificial Intelligence and a professor in Carnegie Mellon University's Machine Learning Department, Language Technologies Institute, and Computer Science Department. His public homepage and biography say his work spans machine learning, statistical methodology, large-scale computational systems, large language models, world and agent models, and biology foundation models; he earned a bachelor's degree from Peking University and PhDs from Rutgers University and the University of Alberta.

Marzyeh Ghassemi is the Germeshausen Career Development Professor and an associate professor in electrical engineering and computer science and the Institute for Medical Engineering and Science at MIT, and a CIFAR AI Chair at the Vector Institute. MIT says she joined MIT in July 2021 after serving as an assistant professor at the University of Toronto in computer science and medicine and as a Vector Institute faculty member holding a Canadian CIFAR AI Chair and Canada Research Chair. MIT also lists a PhD in computer science from MIT, an MSc in biomedical engineering from Oxford University, and bachelor's degrees in computer science and electrical engineering from New Mexico State University.

Sanjiv Kumar is a Google Fellow and vice president at Google DeepMind. His homepage says he leads a machine learning team working on foundation models including LLMs and generative AI for Gemini, has also led research in deep retrieval and ranking, and earned a Ph.D. in computer science from Carnegie Mellon University in 2005.

Yacine Jernite leads the ML and Society team at Hugging Face. His personal site says he works on ML systems governance at the intersection of regulatory and technical tools, with a focus on NLP models, data curation, documentation, and governance, and that he completed a Ph.D. in computer science at New York University under David Sontag.

Ke Ye is a fifth-year PhD student in the Language Technologies Institute at Carnegie Mellon University. His homepage says his research spans natural language processing, machine learning, and artificial intelligence, including language models, speech and speech-language models, conversational AI, and reasoning, and notes that before Carnegie Mellon he worked at Apple Intelligence. His OpenReview profile lists Apple foundation-model work alongside earlier roles at Google, Roblox, and Capital One.

Aakanksha Chowdhery is an adjunct professor at Stanford and a researcher at Reflection AI working on agentic LLMs and reinforcement learning for self-improving agents. Her homepage says she previously led work on the 540B PaLM model and Gemini at Google, earlier led interdisciplinary research initiatives at Microsoft Research and Princeton University, and completed a PhD in electrical engineering at Stanford University.

Kanishka Rao is a researcher at Google. Public Google Research and OpenReview profiles connect Kanishka Rao to work in natural language processing, speech processing, robotics, and responsible AI, and list earlier student-research and internship roles at Carnegie Mellon University, Google Research, and Microsoft Research along with a master's degree in language technologies from Carnegie Mellon University.

Nandan Thakur is a PhD student at the University of Waterloo working on information retrieval, vision-language models, inference optimization, and agentic systems. His public homepage says he is currently affiliated with Apple and Stanford and previously worked with researchers at Meta AI and NVIDIA.

Jean-Baptiste Tristan works at Anthropic on alignment. His public website says he previously worked at Amazon AWS on generative AI, at OpenAI, and at Meta, and that he holds bachelor's and master's degrees from Rice University in electrical engineering and computer science plus a PhD in machine learning from UC Berkeley.

Horace He is a researcher at Meta working on PyTorch. His personal homepage says he works on machine learning and compilers, and his Google Scholar profile lists Cornell University with interests in machine learning, compilers, and algorithms.

Karan Singhal leads the Health AI team at OpenAI. His homepage says he works on LLMs for health and AI safety, with goals that include universalizing access to medical expertise, using health as a testbed for safety, and developing better plans for high-stakes AI deployment. The same page says his previous Google work included Med-PaLM and Med-PaLM 2.

Mingxing Tan is a staff research scientist at Google DeepMind in the San Francisco Bay Area. His public homepage says he works on efficient models and reasoning, was the lead author of EfficientNet, EfficientDet, and MobileNetV3, and earned a Ph.D. from the University of Washington.

Da-Cheng Juan is a software engineer at Google Research. His official Google Research profile says he has worked on large-scale semi-supervised learning with Expander and personalized recommendation for computational advertising, received a PhD from Carnegie Mellon University in 2014 before joining Google, and works on machine learning, convex optimization, and data mining.

Sven Gowal is a research scientist at Google DeepMind in Mountain View. His public Google Research profile describes work in robust machine learning, responsible AI, machine perception, and watermarking, and public profile text for that page notes previous postdoctoral work in statistical machine learning at UCL and a Ph.D. from EPFL.

Yao Lu is a research scientist at NVIDIA Research working on embodied AI, foundation models, and computer vision. His public homepage says he previously worked at Google DeepMind and Boston University and earned a Ph.D. from Carnegie Mellon University.

Yingbo Mao is an assistant professor at the University of Hong Kong. His public homepage says he works on computer vision and multimodal large language models, completed his PhD at the Chinese University of Hong Kong in 2024 under Dahua Lin, and previously served as head of multimodal at Together AI and as a research scientist at UCLA.

Jiahui Yu is a Research Lead at OpenAI leading the Perception team. His homepage notes prior co-leadership on Gemini Multimodal at Google DeepMind and work on deep learning and high-performance computing.

Public profiles and publications link Ruoming Pang to speech, language, and multimodal model research. Reuters reported on February 25, 2026 that OpenAI hired him from Meta.

Sharon Zhou is founder and CEO of Lamini and is pursuing a PhD in computer science at Stanford University. Stanford speaker pages describe her as co-director of the Generative AI for Education Hub at Stanford HAI, advised by Fei-Fei Li and Jure Leskovec, and working on generative AI topics including large language models, computer vision, and graphics.

Marcelo O. Matena is a research engineer at Apple. His public homepage says he focuses on machine learning, natural language processing, and software engineering, and that he holds a doctorate in natural language processing from the University of Edinburgh plus both master's and bachelor's degrees in computer science from UFMG.

Phillipp Schmid is a staff engineer in developer experience and developer relations at Google DeepMind. His public about page says he joined Google DeepMind in 2025 after serving as a technical lead at Hugging Face, where he led strategic collaborations with major cloud providers and focused on practical large language model deployment and RLHF.

Ninareh Mehrabi says she works at Meta's Superintelligence Labs on red teaming and frontier risks, after earlier roles at Amazon AGI and Amazon Alexa AI. USC CAIS lists her as a PhD alum in computer science.

Research scientist at Apple focused on natural language processing and machine learning.

Pingye Shi is an applied research scientist at Apple AI/ML in Cupertino. His public homepage says he completed a Ph.D. in computer science at Cornell and works on machine learning applications and machine learning systems.

Nan Du works on large language models, mixture-of-experts methods, few-shot learning, and natural language processing. Her public OpenReview profile lists Apple AIML as her current affiliation and Google Brain as an earlier role.

Research scientist on Apple Foundation Models whose work focuses on machine learning systems, multimodal foundation models, and AI agents.

Max Schwarzer is a reinforcement learning researcher whose work focuses on scaling and sample-efficient RL. He completed a PhD at Mila, later interned in Apple's machine learning research group, and was an author on Apple's MM1 multimodal pre-training report.

Associate Professor in Natural Language Processing at the University of Bergen whose work focuses on bias and fairness in NLP, information extraction, summarization, and under-resourced languages.

Danny Driess is a research scientist at Google DeepMind whose work focuses on general AI, robot learning, and multimodal foundation models.

Ehsan Amid is a research scientist at Apple. His public homepage says his work focuses on machine learning for speech and language processing and deep representation learning, and his exact Google Scholar profile provides a supporting public publication record.

Nikolay Burbulis is a staff research scientist on Apple Foundation Models. His public homepage says his interests include inference-time compute, reasoning, and code generation, and that he completed a Ph.D. in mathematics and computer science at EPFL.

Senior Staff Research Scientist and Tech Lead Manager at Google DeepMind Robotics, focused on embodied agents and foundation models for robot decision-making.

Jonathan H. Clark is a research scientist at Apple working on foundation models. His website says he previously worked at Google and focuses on large language model pretraining, evaluation, and data.

Tim Cooijmans is a research scientist at Apple. His personal site says he previously worked as a research scientist at Google DeepMind and as a postdoctoral researcher at Mila, and that his interests include machine learning, generative modeling, and reinforcement learning.

Researcher and engineer working on machine learning, software, and hardware systems.

Sachin Kumar is a researcher at Apple and incoming assistant professor at UC San Diego. His work focuses on natural language processing, efficient and multilingual language models, and machine learning systems.

Associate professor of natural language processing at the University of Groningen and research scientist at Cohere Labs, with work spanning machine translation, multilingual models, and multimodal language understanding.

Jitendra Malik is a computer vision and machine learning researcher at UC Berkeley whose public homepage and Google Scholar profile highlight work on image understanding, robotics, and foundation models.

Machine learning researcher at Apple Machine Learning Research working at the intersection of optimization, privacy, and security.

Machine learning engineer and researcher based in the San Francisco Bay Area whose interests include multilingual and multimodal machine learning, responsible AI, and applications in healthcare and education.

Research scientist and engineer focused on multimodal and multilingual language models, with public work on translation, retrieval, and agent systems.

Teddy Karrer is a research scientist working on embodied AI, multimodal reasoning, and machine learning for interactive systems. His public profile highlights robotics, decision making, and intelligent agents.

Research scientist at Google whose work spans computer vision, multimodal learning, and large embodied models, including PaLM-E.

Assistant Professor of Computer Science and Engineering at UC Santa Cruz working on multimodal learning, computer vision, and medical image analysis.

Research manager at NVIDIA working on large-scale distributed pretraining, synthetic data, multimodal LLMs, and computer vision.

Shyamal Anadkat's personal site describes him as a former Applied AI practitioner at OpenAI and an AI advisor to startups, and frames the site as essays on AI, startups, strategy, and the future of work.

Research scientist at Apple working on efficient and multilingual language modeling, speech and language systems, and large language models.

Xiang Kong's public homepage says he is a machine learning researcher at Apple and that he received his PhD from the School of Computer Science at Carnegie Mellon University.

Andy Zeng is a Research Scientist at Google DeepMind. His public research interests include robot learning, computer vision, graphics, and personalized 3D content generation.

Public sources identify Vincent Ponzo as a former Responsible AI Business Development Lead at Amazon AI and a 2025 Amazon technical report coauthor.

Senior research scientist at Apple working on large multimodal foundation models, with prior work on large language models at MosaicML.

Neil Houlsby works on adaptation of large language models, transfer learning, parameter-efficient fine-tuning, and inference efficiency.

Research scientist at Google DeepMind working on language, vision, action, and robotics; previously on the Google Brain team and a co-creator of the first text-to-image GAN.

Technical Fellow and CVP, Microsoft GenAI. His official Microsoft Research profile says he leads a modeling team working on large-scale model training and human language technologies.

Amazon Science lists Rahul Gupta as Senior Manager, Applied Science at Amazon AGI, with publications in LLM safety, evaluation, and responsible AI.

Research scientist at Apple working at the intersection of natural language processing, machine learning, and AI, with a focus on building more intelligent, robust, and reliable systems.

Research scientist at Google DeepMind working on robotics and embodied intelligence. His research spans robot learning, navigation, manipulation, and multimodal agents.

Nicolas Heess is a research scientist at Google DeepMind whose work focuses on machine learning, reinforcement learning, and robotics.

Research scientist at Apple focused on state-of-the-art machine learning and computer vision methods.

Senior researcher at Apple working on large models, multimodal learning, and speech processing, according to his personal site.

Public sources list Abhinav Mohanty as a coauthor on Amazon Nova safety evaluation work under the Frontier Model Safety Framework.

Researcher focused on grounded language understanding, question answering, semantic parsing, and natural language inference.

Ishan Misra is a Research Scientist at Apple whose work spans computer vision, multimodal learning, and large foundation models. He has contributed to Apple Intelligence foundation model research.

Senior research scientist at Google DeepMind working on machine learning.

Principal research scientist at Google DeepMind and professor of computer science at McGill University.

Research scientist at Apple specializing in multilingual and multimodal generative models.

Montserrat Gonzalez Arenas is a research engineer at Google Research whose public work focuses on robot learning and mobile manipulation, including robotic table wiping, waste sorting, and RT-Trajectory for robot task generalization.

Research scientist and engineer at Apple working on foundation models for speech and language, with interests in explainability and interpretability.

Research scientist at NVIDIA working on multimodal AI, especially language and vision models.

Yao Lu is listed as an author of the Google technical report Gemini Robotics: Bringing AI into the Physical World.

Co-author of MM1, which studies multimodal LLM pre-training.

Public report authorship links Payal Motwani to Amazon Nova Responsible AI evaluations for Nova Premier and Nova 2.0 Lite.

Researcher at Baidu and coauthor of the ERNIE 4.5 Technical Report.

Research scientist at Apple Intelligence working on computer vision, machine learning, and natural language processing.

Alex Beutel is listed as an author of the Apple technical report Apple Intelligence Foundation Language Models.

Ashish Ahuja is listed as an author of the Apple technical report Apple Intelligence Foundation Language Models.

Ayush Jain is listed as an author of the Apple technical report Apple Intelligence Foundation Language Models.

Bharat Ramadoss is listed as an author of the Apple technical report Apple Intelligence Foundation Language Models.

Bret Kinsella is listed as an author of the Apple technical report Apple Intelligence Foundation Language Models.

Cheng-Kang Hsieh is listed as an author of the Apple technical report Apple Intelligence Foundation Language Models.

Florian Schroff is listed as an author of the Apple technical report Apple Intelligence Foundation Language Models.

Gang Luo is listed as an author of the Apple technical report Apple Intelligence Foundation Language Models.

Hamid Palangi is listed as an author of the Apple technical report Apple Intelligence Foundation Language Models.

Harsha Nori is listed as an author of the Apple technical report Apple Intelligence Foundation Language Models.

Hong Xu is listed as an author of the Apple technical report Apple Intelligence Foundation Language Models.

HyoukJoong Lee is listed as an author of the Apple technical report Apple Intelligence Foundation Language Models.

Public report authorship links Jianfeng Gao to the Nemotron-4 15B Technical Report at NVIDIA.

Researcher at Baidu and coauthor of the ERNIE 4.5 Technical Report.

Jin Xu is listed as an author of the Apple technical report Apple Intelligence Foundation Language Models.

Justin Tsai is listed as an author of the Apple technical report Apple Intelligence Foundation Language Models.

Ke Yang is listed as an author of the Apple technical report Apple Intelligence Foundation Language Models.

Lukas Haas is listed as an author of the Apple technical report Apple Intelligence Foundation Language Models.

Mengyu Zhao is listed as an author of the Apple technical report Apple Intelligence Foundation Language Models.

Michael Riley is listed as an author of the Apple technical report Apple Intelligence Foundation Language Models.

Michael Rush is listed as an author of the Apple technical report Apple Intelligence Foundation Language Models.

Researcher at Baidu and coauthor of the ERNIE 4.5 Technical Report.

Nick Charron is listed as an author of the Apple technical report Apple Intelligence Foundation Language Models.

Pankaj Kedia is listed as an author of the Apple technical report Apple Intelligence Foundation Language Models.

Puneet Pathak is listed as an author of the Apple technical report Apple Intelligence Foundation Language Models.

Rahul Nair is listed as an author of the Apple technical report Apple Intelligence Foundation Language Models.

Rahul Vaidyanathan is listed as an author of the Apple technical report Apple Intelligence Foundation Language Models.

Rong Rong is listed as an author of the Apple technical report Apple Intelligence Foundation Language Models.

Saksham Singhal is listed as an author of the Apple technical report Apple Intelligence Foundation Language Models.

Sam Dodge is an Apple AI/ML-affiliated researcher. The linked arXiv paper lists him as a coauthor of MM1 and shows his affiliation as Apple AI/ML in Cupertino, California.

Sharmila Bhattacharya is listed as an author of the Apple technical report Apple Intelligence Foundation Language Models.

Shobhit Chauhan is listed as an author of the Apple technical report Apple Intelligence Foundation Language Models.

Sofi Yao is listed as an author of the Apple technical report Apple Intelligence Foundation Language Models.

Sonal Gupta is listed as an author of the Apple technical report Apple Intelligence Foundation Language Models.

Tianjian Lu is listed as an author of the Apple technical report Apple Intelligence Foundation Language Models.

Tim Dettmers is listed as an author of the Apple technical report Apple Intelligence Foundation Language Models.

Vasu Sharma is listed as an author of the Apple technical report Apple Intelligence Foundation Language Models.

Weijie Su is listed as an author of the Apple technical report Apple Intelligence Foundation Language Models.

Researcher at Baidu and coauthor of the ERNIE 4.5 Technical Report.

Wojciech Zaremba is listed as an author of the Apple technical reports Apple Intelligence Foundation Language Models and Apple Intelligence Foundation Language Models: Tech Report 2025.

Xiaomin Wang is listed as an author of the Apple technical report Apple Intelligence Foundation Language Models.

Xingqiao Liu is listed as an author of the Apple technical report Apple Intelligence Foundation Language Models.

Yanbo Liang is listed as an author of the Apple technical report Apple Intelligence Foundation Language Models.

Yifeng Lu is listed as an author of the Apple technical report Apple Intelligence Foundation Language Models.

Yingbo Zhou is listed as an author of the Apple technical report Apple Intelligence Foundation Language Models.

Yi Tay is listed as an author of the Apple technical report Apple Intelligence Foundation Language Models.

Researcher at Baidu and coauthor of the ERNIE 4.5 Technical Report.

Yuriy Gusev is listed as an author of the Apple technical report Apple Intelligence Foundation Language Models.

Researcher at Baidu and coauthor of the ERNIE 4.5 Technical Report.

Researcher working on on-device and foundation language models, including Apple Intelligence models.

Research scientist at Apple working on large language models, vision-language models, and model scaling.

Researcher working on machine learning, vision and language, computer vision, diffusion, and generative AI.

Research scientist at Apple Foundation Models working on generative AI, large language models, and multimodal models.

Apple researcher whose publications include the Apple Intelligence Foundation Language Models technical report.

Research scientist at Apple Foundation Models with interests in natural language processing, structured generation, controllable generation, and algorithmic efficiency.

Apple researcher whose publications include the Apple Intelligence Foundation Language Models technical report.

Research scientist at Apple working on language and vision-language modeling, AI agents, and post-training.

Apple researcher whose publications include the Apple Intelligence Foundation Language Models technical report.

Machine learning researcher at Apple working on large multimodal foundation models, video generation, and vision-language systems.

Public report authorship links Alex Wang to the Aya Vision: Advancing the Frontier of Multilingual Multimodality at Cohere.

Public report authorship links Ali Farhadi to the Aya Vision: Advancing the Frontier of Multilingual Multimodality at Cohere.

Amit Bhonkar is listed as an author of the Apple technical report Apple Intelligence Foundation Language Models.

Amit Singh is listed as an author of the Apple technical report Apple Intelligence Foundation Language Models.

Public report authorship links Aniket Kittur to the Aya Vision: Advancing the Frontier of Multilingual Multimodality at Cohere.

Anna Goldie is listed as an author of the Apple technical report Apple Intelligence Foundation Language Models.

Public report authorship links Berta Chulvi to the Aya Vision: Advancing the Frontier of Multilingual Multimodality at Cohere.

Binhang Yuan is listed as an author of the Apple technical report Apple Intelligence Foundation Language Models.

Bokun Wang is listed as an author of the NVIDIA technical report NVLM: Open Frontier-Class Multimodal LLMs.

Bo Pang is listed as an author of the Apple technical report Apple Intelligence Foundation Language Models.

Chris Hughes is listed as an author of the Apple technical report Apple Intelligence Foundation Language Models: Tech Report 2025.

David Hallac is listed as an author of the Apple technical report Apple Intelligence Foundation Language Models.

Dongxin Li is listed as an author of the Apple technical report Apple Intelligence Foundation Language Models.

Eugene Yun is listed as an author of the Apple technical report Apple Intelligence Foundation Language Models.

Fuwen Tan is listed as an author of the NVIDIA technical report NVLM: Open Frontier-Class Multimodal LLMs.

George Dahl is listed as an author of the Apple technical report Apple Intelligence Foundation Language Models: Tech Report 2025.

Ge Zhang is listed as an author of the NVIDIA technical report NVLM: Open Frontier-Class Multimodal LLMs.

Haotian Zhang is listed as an author of the NVIDIA technical report NVLM: Open Frontier-Class Multimodal LLMs.

Haritha Nori is listed as an author of the Apple technical report Apple Intelligence Foundation Language Models.

Harshavardhan Kannan is listed as an author of the Apple technical report Apple Intelligence Foundation Language Models: Tech Report 2025.

Himanshu Arora is listed as an author of the Apple technical report Apple Intelligence Foundation Language Models: Tech Report 2025.

Jamie Simon is listed as an author of the Apple technical report Apple Intelligence Foundation Language Models.

Jared Quincy Davis is listed as an author of the Apple technical report Apple Intelligence Foundation Language Models.

Jiaming Wang is listed as an author of the NVIDIA technical report NVLM: Open Frontier-Class Multimodal LLMs.

Jian Zhang is listed as an author of the Apple technical report Apple Intelligence Foundation Language Models.

Jiaqi Zeng is listed as an author of the NVIDIA technical report NVLM: Open Frontier-Class Multimodal LLMs.

Jingchao Ge is listed as an author of the NVIDIA technical report NVLM: Open Frontier-Class Multimodal LLMs.

Jixuan Fan is listed as an author of the Apple technical report Apple Intelligence Foundation Language Models: Tech Report 2025.

Johnny Wei is listed as an author of the Apple technical report Apple Intelligence Foundation Language Models.

Junjie Wang is listed as an author of the Apple technical report Apple Intelligence Foundation Language Models.

Public report authorship links Karan Singhal to the Aya Vision: Advancing the Frontier of Multilingual Multimodality at Cohere.

Mahdi Milani Fard is listed as an author of the Apple technical report Apple Intelligence Foundation Language Models.

Manan Tomar is listed as an author of the Apple technical report Apple Intelligence Foundation Language Models.

Mariam Morshed is listed as an author of the Apple technical report Apple Intelligence Foundation Language Models.

Marilyne Berlemont is listed as an author of the Apple technical report Apple Intelligence Foundation Language Models: Tech Report 2025.

Marton Patwary is listed as an author of the Apple technical report Apple Intelligence Foundation Language Models.

Public report authorship links Max Ku to the Aya Vision: Advancing the Frontier of Multilingual Multimodality at Cohere.

Mengwei Xu is listed as an author of the Apple technical report Apple Intelligence Foundation Language Models.

Miao Liu is listed as an author of the Apple technical report Apple Intelligence Foundation Language Models: Tech Report 2025.

Mohak Bansal is listed as an author of the Apple technical report Apple Intelligence Foundation Language Models.

Moishe Hasabnis is listed as an author of the Apple technical report Apple Intelligence Foundation Language Models.

Public report authorship links Nathan Cooper to the Aya Vision: Advancing the Frontier of Multilingual Multimodality at Cohere.

Nikita Bhalla is listed as an author of the Apple technical report Apple Intelligence Foundation Language Models.

Nina Wenzel is listed as an author of the Apple technical report MM1.5: Methods, Analysis and Insights from Multimodal LLM Fine-tuning.

Oluwatobi "Tobi" Oladipo is listed as an author of the Apple technical report Apple Intelligence Foundation Language Models.

Pai Peng is listed as an author of the NVIDIA technical report NVLM: Open Frontier-Class Multimodal LLMs.

Phillipp Wiesner is listed as an author of the Apple technical report Apple Intelligence Foundation Language Models.

Ping Xu is listed as an author of the Apple technical report Apple Intelligence Foundation Language Models.

Pranav Yadlapalli is listed as an author of the Apple technical report Apple Intelligence Foundation Language Models.

Pritish Kamath is listed as an author of the Apple technical report Apple Intelligence Foundation Language Models.

Pulin Gupta is listed as an author of the Apple technical report Apple Intelligence Foundation Language Models.

Pyeongjae Cho is listed as an author of the Apple technical report Apple Intelligence Foundation Language Models.

Qing Guo is listed as an author of the NVIDIA technical report NVLM: Open Frontier-Class Multimodal LLMs.

Ranjith Prasad is listed as an author of the Apple technical report Apple Intelligence Foundation Language Models.

Romal Thoppilan is listed as an author of the Apple technical report Apple Intelligence Foundation Language Models.

Sam Yen-Chi Chen is listed as an author of the Apple technical report Apple Intelligence Foundation Language Models.

Saurabh Tiwary is listed as an author of the Apple technical report Apple Intelligence Foundation Language Models.

Sercan O. Arik is listed as an author of the Apple technical report Apple Intelligence Foundation Language Models.

Shang-Wen Li is listed as an author of the Apple technical report Apple Intelligence Foundation Language Models: Tech Report 2025.

Sharad Bhat is listed as an author of the Apple technical report Apple Intelligence Foundation Language Models.

Sheeel Jindal is listed as an author of the Apple technical report Apple Intelligence Foundation Language Models.

Shengyu Wang is listed as an author of the NVIDIA technical report NVLM: Open Frontier-Class Multimodal LLMs.

Shiqi Yu is listed as an author of the Apple technical report Apple Intelligence Foundation Language Models.

Shuai Zhang is listed as an author of the NVIDIA technical report NVLM: Open Frontier-Class Multimodal LLMs.

Simran Arora is listed as an author of the NVIDIA technical report NVLM: Open Frontier-Class Multimodal LLMs.

Siyao Li is listed as an author of the Apple technical report Apple Intelligence Foundation Language Models.

Sora Tokumine is listed as an author of the Google technical report PaLM-E: An Embodied Multimodal Language Model.

Sumit Kumar Jha is listed as an author of the NVIDIA technical report NVLM: Open Frontier-Class Multimodal LLMs.

Swaroop Mishra is listed as an author of the Apple technical report Apple Intelligence Foundation Language Models.

Tianwei Zhang is listed as an author of the Apple technical report Apple Intelligence Foundation Language Models.

Tim Althoff is listed as an author of the NVIDIA technical report NVLM: Open Frontier-Class Multimodal LLMs.

Tomas Pfister is listed as an author of the Apple technical report Apple Intelligence Foundation Language Models.

Udit Gupta is listed as an author of the Apple technical report Apple Intelligence Foundation Language Models.

Vishal Monga is listed as an author of the Apple technical report Apple Intelligence Foundation Language Models.

Wenhao Li is listed as an author of the Apple technical report Apple Intelligence Foundation Language Models.

Wenshan Wang is listed as an author of the Apple technical report Apple Intelligence Foundation Language Models.

Xavier Garcia is listed as an author of the Apple technical report Apple Intelligence Foundation Language Models.

Xi Chen is listed as an author of the Apple technical report Apple Intelligence Foundation Language Models.

Xingyou Song is listed as an author of the Apple technical report Apple Intelligence Foundation Language Models.

Xinyuan Li is listed as an author of the Apple technical report Apple Intelligence Foundation Language Models.

Xizhou Zhu is listed as an author of the NVIDIA technical report NVLM: Open Frontier-Class Multimodal LLMs.

Yadong Wang is listed as an author of the NVIDIA technical report NVLM: Open Frontier-Class Multimodal LLMs.

Yang Zhao is listed as an author of the NVIDIA technical report NVLM: Open Frontier-Class Multimodal LLMs.

Yann LeCun is listed as an author of the Apple technical report Apple Intelligence Foundation Language Models.

Yanping Huang is listed as an author of the Apple technical report Apple Intelligence Foundation Language Models.

Yeung-Leung Chow is listed as an author of the NVIDIA technical report NVLM: Open Frontier-Class Multimodal LLMs.

Yichao Ma is listed as an author of the NVIDIA technical report NVLM: Open Frontier-Class Multimodal LLMs.

Yichen Zhu is listed as an author of the NVIDIA technical report NVLM: Open Frontier-Class Multimodal LLMs.

Yiqi Han is listed as an author of the Apple technical report Apple Intelligence Foundation Language Models: Tech Report 2025.

Yiwen Wang is listed as an author of the Google technical report PaLM-E: An Embodied Multimodal Language Model.

Yuan Du is listed as an author of the NVIDIA technical report NVLM: Open Frontier-Class Multimodal LLMs.

Yuchen Jin is listed as an author of the NVIDIA technical report NVLM: Open Frontier-Class Multimodal LLMs.

Yuchen Zhang is listed as an author of the NVIDIA technical report NVLM: Open Frontier-Class Multimodal LLMs.

Zhe Lin is listed as an author of the Google technical report PaLM-E: An Embodied Multimodal Language Model.

Zhilin Wu is listed as an author of the Apple technical report Apple Intelligence Foundation Language Models.

Zizhao Zhang is listed as an author of the Apple technical report Apple Intelligence Foundation Language Models.

Zoubin Ghahramani is listed as an author of the Apple technical report Apple Intelligence Foundation Language Models.

Member of Technical Staff at Cohere Labs working on multilingual and multimodal language technologies.

Research scientist at Apple focused on computer vision, multimodal learning, and robotics.

Research scientist whose public OpenReview profile lists work on multimodal representation learning, speech synthesis, and personalized voice generation.

Computer vision and machine learning scientist at Apple whose work includes multimodal understanding and robotics, following earlier leadership roles at Google.

Apple researcher whose publications include the Apple Intelligence Foundation Language Models technical report.

Research scientist at NVIDIA with public publications on multimodal language models and visual instruction tuning, including NVLM, VILA, and Video2Flow.

Research scientist at Apple and adjunct professor at MIPT working in computer vision, image processing, and machine learning.

Vice President of AI Research and General Manager of AI Platforms at NVIDIA.

Senior staff research scientist at Google DeepMind and affiliated lecturer at Cambridge working on computer vision, machine learning, and computer graphics.

Daria Buchsbaum is a PhD student at Georgia Tech and a Research Scientist Intern at Google DeepMind.

Research scientist at Apple Foundation Models working on efficient training and multimodal language models.

Research scientist at Apple focusing on understanding and generating text and images.

AI researcher and deep learning theorist whose public work includes tensor programs and maximal update parameterization. He coauthored Apple's 2025 Apple Intelligence foundation language models technical report.

AI and machine learning engineer at Apple working on multimodal foundation models; previously worked at Snap and the University of Southern California.

Research scientist at Apple focused on computer vision, machine learning, and multimodal understanding.

Staff software engineer at Google focused on machine learning and systems.

Research scientist at Apple specializing in post-training, reinforcement learning, and AI agents.

Louis Borry is a PhD student at Google DeepMind working on embodied language models and grounded language understanding.

Research scientist at Google DeepMind focused on robotics and machine learning, especially reinforcement learning and language models.

Researcher working on machine learning, optimization, and sequential data.

Qiaozi Gao is a Stanford PhD student whose work spans vision and language, machine learning, and robotics, with research internships at Google and Google DeepMind.

Research scientist at NVIDIA working on multimodal language models and vision-language research, with public publications including NVLM, VILA, and Visual Role Play.

PhD student at UCLA and research intern at NVIDIA, working on multimodal reasoning, vision-language models, and embodied AI.

Sam Wiseman is an assistant professor of computer science at New York University whose research focuses on natural language processing and machine learning, including controllable generation, summarization, and learning from human feedback.

Apple researcher whose publications include the Apple Intelligence Foundation Language Models technical report and work on the MLX framework.

Machine learning researcher whose work spans neural networks and statistics, and a co-author of Apple's Foundation Language Models report.

Senior research scientist at Apple with public publications spanning speech, audio, and language modeling, including work on speech language models and MemoryLLM.

Apple researcher whose publications include the Apple Intelligence Foundation Language Models technical report.

Thomas Blankevoort is a Research Scientist at Google DeepMind whose work focuses on efficient neural networks and machine learning systems.

Researcher working on foundation language models and efficient inference, including Apple Intelligence models.

Anand Avati

Andrew Tulloch

Ji Lin

Sanchit Gandhi

Sebastian Gehrmann

Ariana Mirian

Haotian Zhang

Kartik Sreenivasan

Zhengfeng Lai

Ari Holtzman

Eric P. Xing

Marzyeh Ghassemi

Sanjiv Kumar

Yacine Jernite

Ke Ye

Aakanksha Chowdhery

Kanishka Rao

Nandan Thakur

Jean-Baptiste Tristan

Horace He

Karan Singhal

Mingxing Tan

Da-Cheng Juan

Sven Gowal

Yao Lu

Yingbo Mao

Jiahui Yu

Ruoming Pang

Sharon Zhou

Marcelo O. Matena

Phillipp Schmid

Ninareh Mehrabi

Yinfei Yang

Pingye Shi

Nan Du

Haoxuan You

Max Schwarzer

Samia Touileb

Danny Driess

Ehsan Amid

Nikolay Burbulis

Fei Xia

Jonathan H. Clark

Tim Cooijmans

Awni Hannun

Sachin Kumar

Arianna Bisazza

Jitendra Malik

Jonas Geiping

Munsina Sundaram

Raza Habib

Teddy Karrer

Yusuke M. Asano

Yuyin Zhou

Zhaowen Wang

Shyamal Anadkat

Floris Weers

Xiang Kong

Andy Zeng

Vincent Ponzo

Brandon McKinzie

Neil Houlsby

Scott Reed

Weizhu Chen

Rahul Gupta

Angela Fan

Fei Xia

Nicolas Heess

Peter Grasch

Zirui Wang

Abhinav Mohanty

Aida Amini

Ishan Misra

Johnny Mao

Marc G. Bellemare

Masoud Alizadeh

Montserrat Gonzalez Arenas

Paria Hafezi

Thaddeus Culhane

Yao Lu