Atlas / Fields / Detail
Multimodal Large Language Models
Researchers connected to this field in the public atlas.
Radu Soricut
Google Gemini
Radu Soricut is a Distinguished Scientist at Google DeepMind working on natural language processing and machine learning, with earlier Google Research and Google Translate work.
Tulsee Doshi
Google Gemini
Tulsee Doshi is a Senior Director of Product Management at Google DeepMind and currently leads product for Gemini Model. She previously served as Head of Product for Responsible AI at Google and holds both an M.S. and a Ph.D. in Symbolic Systems from Stanford.
Jifeng Dai
DeepSeek / MiniMax
Jifeng Dai is a tenured associate professor in the Department of Electronic Engineering at Tsinghua University. His homepage says his current research focuses on agentic AI and continual learning, and lists prior roles at Shanghai AI Lab, SenseTime Research, and Microsoft Research Asia.
Ksenia Konyushkova
Google Gemini
Ksenia Konyushkova is a research scientist at Google DeepMind in London working on computer vision, embodied AI, and reinforcement learning. Her personal homepage describes earlier roles at Google Research in Zurich and postdoctoral work after EPFL.
Binyuan Hui
DeepSeek / MiniMax
Staff research scientist at Alibaba's Qwen Team and initiator of OpenDevin, focused on foundation models, reasoning models, coding agents, and computer-use agents.
Louis Martin
Meta AI / Mistral AI
Research scientist at Meta AI working on natural language processing and AI safety. His homepage says he completed a PhD at Facebook AI Research and Inria focused on text simplification and accessibility.
Hao Yang
DeepSeek / Moonshot AI
Hao Yang works on multimodal data infrastructure at Moonshot.ai. He previously worked at ByteDance ICVG and Microsoft Research Asia, and received BS and PhD degrees from Tsinghua University.
David Dohan
Google Gemini / OpenAI
David Dohan is a computer scientist at OpenAI studying scalable alignment of language models and generally intelligent reasoning systems. His personal site also notes prior work at Google Brain on foundation model programs, code generation, protein engineering, and scientific reasoning.
Vahid Noroozi
Google Gemini / NVIDIA
Vahid Noroozi is an applied research scientist at NVIDIA. His NVIDIA author profile says his work focuses on deep learning for speech and natural language processing and that he received a PhD in computer science from the University of Illinois Chicago. His homepage says he previously worked on post-training large language models at Google DeepMind after earlier multimedia and neuroscience research at TU Delft and the Max Planck Institute for Biological Cybernetics.
Kevin Robinson
Google Gemini
Kevin Robinson is a research engineer at Google Research working on evaluations of language models and NLP systems. His Google Research profile says he previously worked as a special education teacher, a software engineer building visualization and analytics systems, and a researcher in K12 computer science education.
Amjad Almahairi
Meta AI / Mistral AI
Amjad Almahairi is a researcher at Anyscale. His OpenReview profile lists work spanning LLMs, VLLMs, generative models, and deep learning, with earlier roles at Facebook and Element AI.
Sebastian Gehrmann
Google Gemini / Mistral AI
Sebastian Gehrmann leads Responsible AI in the office of the CTO at Bloomberg and works on natural language generation, model evaluation, and interpretability.
Rogerio Feris
Mistral AI
Principal scientist and senior manager at IBM Research's MIT-IBM Watson AI Lab. His public homepage emphasizes computer vision, multimodal AI, and augmenting large language models with memory for enterprise use.
Azade Nova
Google Gemini
Staff Research Scientist at Google DeepMind. Public Google profiles describe earlier work at Google Brain and Microsoft Research and research spanning machine learning, graph mining, and unstructured data analytics.
Jiahui Yu
Google Gemini
Jiahui Yu is a Research Lead at OpenAI leading the Perception team. His homepage notes prior co-leadership on Gemini Multimodal at Google DeepMind and work on deep learning and high-performance computing.
Bhuwan Dhingra
Google Gemini
Bhuwan Dhingra is an associate professor of computer science at Duke University and is also affiliated with Google DeepMind. His public Duke and lab profiles say he leads the AI for Language Technologies lab, co-directs Pratt at TUNL, is a member of Duke AI Health, works on natural language processing, multimodal learning, and trustworthy AI, and received a PhD in computer science from Carnegie Mellon University in 2019.
Asterios Katsamanis
Meta AI
ATHENA Research Center's profile describes Athanasios (Nassos) Katsamanis as a principal researcher there since 2019, focusing on multimodal speech processing, multimodal human-computer interaction, and human behavior analysis.
Lechao Xiao
Mistral AI
Lechao Xiao's OpenReview profile lists him as a researcher at Google DeepMind. His homepage says his current focus is scaling-centric machine learning and lists interests in deep learning theory, generalization, optimization, training dynamics, kernels, and Gaussian processes.
Huazuo Gao
DeepSeek
Researcher at DeepSeek AI working on decision-making and post-training for large language models.
Corey Lynch
Google Gemini
Corey Lynch is a research scientist at Google DeepMind working on embodied AI and robotics. He previously cofounded Ikonos.
Yuchen Ge
Google Gemini
Yuchen Ge is a research scientist at Google DeepMind whose work focuses on vision-language models and multimodal machine learning.
Angela Fan
Meta AI / Mistral AI
Recent public bios describe Angela Fan as a researcher at Meta working on large language models, machine translation, multilingual generation, and story generation.
Yoon Kim
Mistral AI
Research scientist at Mistral AI working on natural language processing and large language models; previously an assistant professor at MIT.
Fei Xia
Google Gemini / Mistral AI
Senior Staff Research Scientist and Tech Lead Manager at Google DeepMind Robotics, focused on embodied agents and foundation models for robot decision-making.
Luyao Yuan
Meta AI
Luyao Yuan is a research scientist at FAIR at Meta. Her homepage says her research aims to build AI systems that can see, learn, reason, and interact like humans, and that she completed a PhD in EECS at MIT advised by Antonio Torralba after earlier research with Song Han at MIT and Jiajun Wu at Stanford.
Szymon Migacz
Mistral AI
Szymon Migacz is a researcher at NVIDIA. His OpenReview profile lists NVIDIA as his affiliation since 2015, identifies deep learning as his expertise, and records University of Warsaw degrees in computer science.
Pablo Sprechmann
Google Gemini
Pablo Sprechmann is a research scientist at Google DeepMind whose work spans representation learning, reinforcement learning, and machine learning for football tactics. Before DeepMind, he was a postdoctoral researcher at New York University working with Yann LeCun. He previously completed doctoral research under Guillermo Sapiro.
Matthias Minderer
Google Gemini
Research Scientist at Google DeepMind in London working on large multimodal models, evaluation, agents, and computer vision; he completed a PhD at the University of Tuebingen and MPI for Intelligent Systems.
Armand Joulin
Google Gemini
Public research profiles show Armand Joulin as an author on work in natural language processing, information retrieval, and computer vision.
Chenguang Zhu
Meta AI
Research scientist at Meta AI focused on vision-language models, large language models, and agents; public work includes the multimodal foundation model Chameleon.
Xiangkun Wang
DeepSeek
Research intern at DeepSeek and undergraduate student at Tsinghua University focusing on multimodal large language models, agents, and embodied AI.
Xiaoze Liu
DeepSeek
Research intern at DeepSeek and PhD student at Carnegie Mellon University interested in machine learning, agents, language, vision, robotics, and healthcare.
Xinyu Li
DeepSeek
Research intern at DeepSeek and undergraduate student at Tsinghua University working on vision-language models, inference-time scaling, and reinforcement learning.
Zezhou Wang
DeepSeek
Research intern at DeepSeek and master's student at Tsinghua University working on large language models, reinforcement learning, and multimodal understanding and generation.
Zhihuan Liu
DeepSeek
Research intern at DeepSeek and PhD student at Shanghai Jiao Tong University working on large language models, reasoning, agents, and reinforcement learning.
Jascha Sohl-Dickstein
Mistral AI
Jascha Sohl-Dickstein is a member of the technical staff at Anthropic. His public site highlights work on diffusion models, overparameterized neural networks, learned optimizers, and large language models, and notes prior roles at Google Brain and Google DeepMind.
Jaehoon Lee
Google Gemini
Jaehoon Lee is a researcher at Google DeepMind. His work covers practical and foundational aspects of large language models, together with deep learning theory and reinforcement learning.
Jonathan Tompson
Google Gemini
Jonathan Tompson is a research scientist working on robotics, perception, and embodied AI. His public profile highlights work on computer vision, simulation, reinforcement learning, and robot intelligence.
Lewis Houghton
Google Gemini
Software engineer at Google DeepMind working on model architecture and engineering for general-purpose language models.
Vivek Natarajan
Google Gemini
Research scientist at Google DeepMind working on multimodal medical AI and personalized health applications.
Jie Zhou
DeepSeek / MiniMax
Public report authorship links Jie Zhou to the MiniMax-M1: Scaling Test-Time Compute Efficiently with Lightning Attention at MiniMax.
Armand Joulin
Meta AI
Armand Joulin is listed as an author of the Meta AI technical report Llama 2: Open Foundation and Fine-Tuned Chat Models.
Jason Wei
Google Gemini / OpenAI
Public report authorship links Jason Wei to the Gemma 3n Technical Report at Google.
Jinghong Yuan
DeepSeek
PhD student at UC San Diego researching reasoning, planning, and multimodal foundation models; publication context connects Jinghong Yuan to Janus-Pro.
Nicholas Crane
Meta AI
Research scientist at Meta working on computer vision and multimodal foundation models with an emphasis on robustness, trustworthiness, and alignment.
Jiaxuan Fan
DeepSeek
Jiaxuan Fan is a machine learning researcher at DeepSeek. Her interests include data-centric AI, model efficiency, and multimodal learning.
Sébastien Bubeck
Meta AI
Sébastien Bubeck's public homepage says he works on AI at OpenAI, after earlier work on convex optimization, online algorithms, and adversarial robustness.
Julian Schrittwieser
Google Gemini
Julian Schrittwieser is a Google DeepMind researcher known for reinforcement learning and game-playing systems.
Mike Lewis
Meta AI
Mike Lewis is a natural language processing researcher whose public work includes multimodal language modeling and large-scale pretraining.
Karen Simonyan
Google Gemini
Karén Simonyan is Chief Scientist at Microsoft AI. Public Microsoft sources describe him as a co-founder and former Chief Scientist of Inflection and credit him on recent Microsoft AI model work.
Kanishka Rao
Google Gemini
Kanishka Rao is listed in the author list for the Google DeepMind report 'Gemini Robotics: Bringing AI into the Physical World.'
Alaaeldin El-Nouby
Meta AI
Alaaeldin El-Nouby is a machine learning researcher whose public work includes multimodal and vision-language models.
Christopher Pal
Meta AI
Christopher Pal is a professor and AI researcher whose public work spans deep learning, multimodal learning, and large language models.
Michael Uthus
Google Gemini
Michael Uthus works on frontier model safety and evaluation at Google DeepMind.
Piotr Padlewski
Google Gemini
Piotr Padlewski is a researcher working on efficient language and multimodal models, with publications including Gemma 3n and EdgeMark.
Sami Stigzelius
Google Gemini
Machine learning researcher at Google DeepMind focused on multimodal foundation models and post-training.
Srujana Merugu
Meta AI
Research scientist at Meta AI focused on multimodal and embodied AI, with interests in computer vision, deep learning, and decision making.
Xiaodong Zhang
Mistral AI
Research scientist at Mistral AI whose homepage highlights work on large language models, agents, multimodal understanding, and scaling.
Khaled Saeed
Meta AI
Khaled Saeed is a Research Scientist at Meta working on efficient multimodal reasoning and AI systems.
Saswato R. Das
Mistral AI
Saswato R. Das is a postdoctoral researcher at Mistral AI working on computer vision and multimodal foundation models.
Orhan Firat
Google Gemini
Research scientist at Google Research whose public work spans multilingual and large-scale language modeling; arXiv author results include the PaLM paper.
Sebastian Borgeaud
Google Gemini
Research scientist at Google DeepMind in London working on agentic reasoning, efficient inference, and large-scale post-training, with a background in high-dimensional statistics and theory.
Vincent Vanhoucke
Google Gemini
Senior Staff Research Scientist at Google DeepMind and CTO of the Gemini app, with work spanning speech, language, vision, and large-scale AI systems.
Jiaxuan Li
Google Gemini
Jiaxuan Li is listed as an author of the Google technical report Gemma 3n Technical Report.
Mingxing Zhang
Google Gemini
Public report authorship links Mingxing Zhang to the Gemma 3n Technical Report at Google.
Mingze Li
Alibaba Qwen / Meta AI
Mingze Li is listed as an author of the Qwen technical report Qwen3 Technical Report.
Sebastian Goodman
Google Gemini
Public report authorship links Sebastian Goodman to the Gemma 3n Technical Report at Google.
Shang Yang
DeepSeek / MiniMax
Public report authorship links Shang Yang to the MiniMax-M1: Scaling Test-Time Compute Efficiently with Lightning Attention at MiniMax.
Su Wang
Google Gemini
Public report authorship links Su Wang to the Gemma 3n Technical Report at Google.
Adam Casson
Google Gemini
Research scientist at Google DeepMind working on large language models in London. His public site lists interests in efficient inference, evaluation, multi-agent systems, and interpretability, and notes earlier work on code intelligence at Graphcore.
Andy Brohan
Google Gemini
Research scientist at Google DeepMind working on general robotic intelligence, robot learning, and real-world datasets for improved robot dexterity and understanding.
Dylan Cope
Google Gemini
Research engineer at Google DeepMind focused on large language models, long-context systems, and efficient inference; previously worked on speech and generative models.
Faisal Azhar
Meta AI
Faisal Azhar is a PhD candidate in computer science at Stanford University. His work focuses on multimodal systems that unify text, image, and speech, together with efficient training and inference for large-scale machine learning.
Fang Xia
Google Gemini
Fang Xia is a Research Scientist at Google DeepMind working on bringing AI into the physical world through robotics and embodied intelligence.
Myungjae Ahn
Google Gemini
Myungjae Ahn is a postdoctoral researcher at Google DeepMind whose work focuses on multimodal AI, including language, speech, vision, and robotics.
Peter Florence
Google Gemini
Research scientist at Google DeepMind and co-founder of Waypoint, working on robot learning.
Rishabh Kabra
Google Gemini
Rishabh Kabra is a research scientist at Google DeepMind. His public homepage highlights work on machine learning systems and large-scale language model research.
Xiangyu Yue
Google Gemini
Research scientist at Google DeepMind working on multimodal large language models and efficient language modeling.
Albert Webson
Google Gemini
Public report authorship links Albert Webson to the Gemma 3n Technical Report at Google.
Andrew Webb
Google Gemini
Public report authorship links Andrew Webb to the Gemma 3n Technical Report at Google.
Ankur Handa
Google Gemini
Public report authorship links Ankur Handa to the Gemma 3n Technical Report at Google.
Arezoo Rajabi
Google Gemini
Public report authorship links Arezoo Rajabi to the Gemma 3n Technical Report at Google.
Bruno Lefaudeux
Meta AI
Bruno Lefaudeux is listed as an author of the Meta AI technical report Chameleon: Mixed-Modal Early-Fusion Foundation Models.
Caroline Pantofaru
Google Gemini
Public report authorship links Caroline Pantofaru to the Gemma 3n Technical Report at Google.
David Hong
Google Gemini
Public report authorship links David Hong to the Gemma 3n Technical Report at Google.
Denis Kocisky
Mistral AI
Denis Kocisky is listed as an author of the Mistral AI technical report Pixtral 12B.
Duc Pham
Mistral AI
Duc Pham is listed as an author of the Mistral AI technical report Pixtral 12B.
Elad Segal
Google Gemini
Public report authorship links Elad Segal to the Gemma 3n Technical Report at Google.
Eric Chu
Google Gemini
Public report authorship links Eric Chu to the Gemma 3n Technical Report at Google.
Fei Xia
Google Gemini
Public report authorship links Fei Xia to the Gemma 3n Technical Report at Google.
Hongxia Yang
DeepSeek
Hongxia Yang is listed as an author of the DeepSeek technical report Janus-Pro: Unified Multimodal Understanding and Generation with Data and Model Scaling.
Huiyu Wang
Google Gemini
Public report authorship links Huiyu Wang to the Gemma 3n Technical Report at Google.
Jaime Carbonell
Mistral AI
Jaime Carbonell is listed as an author of the Mistral AI technical report Pixtral 12B.
Jeffrey Ding
Google Gemini
Public report authorship links Jeffrey Ding to the Gemma 3n Technical Report at Google.
Jing Yu Koh
Google Gemini
Public report authorship links Jing Yu Koh to the Gemma 3n Technical Report at Google.
Jon Lamprecht
Mistral AI
Jon Lamprecht is listed as an author of the Mistral AI technical report Pixtral 12B.
Jules Ponce
Meta AI
Jules Ponce is listed as an author of the Meta AI technical report Chameleon: Mixed-Modal Early-Fusion Foundation Models.
Jun-Hyuk Ahn
Mistral AI
Jun-Hyuk Ahn is listed as an author of the Mistral AI technical report Pixtral 12B.
Kevin Albrecht
Google Gemini
Public report authorship links Kevin Albrecht to the Gemma 3n Technical Report at Google.
Laurent Mouchere
Google Gemini
Public report authorship links Laurent Mouchere to the Gemma 3n Technical Report at Google.
Limin Zhu
Google Gemini
Public report authorship links Limin Zhu to the Gemma 3n Technical Report at Google.
Livio Baldini Soares
Google Gemini
Public report authorship links Livio Baldini Soares to the Gemma 3n Technical Report at Google.
Maciej Abramczyk
Google Gemini
Public report authorship links Maciej Abramczyk to the Gemma 3n Technical Report at Google.
Marcus Hutter
Google Gemini
Public report authorship links Marcus Hutter to the Gemma 3n Technical Report at Google.
Michal Matena
Google Gemini
Public report authorship links Michal Matena to the Gemma 3n Technical Report at Google.
Mingyang Chen
Meta AI
Mingyang Chen is listed as an author of the Meta AI technical report Chameleon: Mixed-Modal Early-Fusion Foundation Models.
Mohammad Sadegh Sharifi
Google Gemini
Public report authorship links Mohammad Sadegh Sharifi to the Gemma 3n Technical Report at Google.
Noor Alabdulmohsin
Google Gemini
Public report authorship links Noor Alabdulmohsin to the Gemma 3n Technical Report at Google.
Oliver Groth
Google Gemini
Public report authorship links Oliver Groth to the Gemma 3n Technical Report at Google.
Olivia Watkins
Google Gemini
Public report authorship links Olivia Watkins to the Gemma 3n Technical Report at Google.
Oscar Klimovskikh
Google Gemini
Public report authorship links Oscar Klimovskikh to the Gemma 3n Technical Report at Google.
Paul A. Crook
Mistral AI
Paul A. Crook is listed as an author of the Mistral AI technical report Pixtral 12B.
Philip Torr
Google Gemini
Public report authorship links Philip Torr to the Gemma 3n Technical Report at Google.
Pooja Rao
Google Gemini
Public report authorship links Pooja Rao to the Gemma 3n Technical Report at Google.
Po-Sen Huang
Google Gemini
Public report authorship links Po-Sen Huang to the Gemma 3n Technical Report at Google.
Qiaochu Chen
Google Gemini
Public report authorship links Qiaochu Chen to the Gemma 3n Technical Report at Google.
Qimin Chen
Google Gemini
Public report authorship links Qimin Chen to the Gemma 3n Technical Report at Google.
Roman Ring
Google Gemini
Public report authorship links Roman Ring to the Gemma 3n Technical Report at Google.
Sai Praneeth Karimireddy
Google Gemini
Public report authorship links Sai Praneeth Karimireddy to the Gemma 3n Technical Report at Google.
Samy Bengio
Google Gemini
Public report authorship links Samy Bengio to the Gemma 3n Technical Report at Google.
Shakti Sharma
Google Gemini
Public report authorship links Shakti Sharma to the Gemma 3n Technical Report at Google.
Sid Mittal
Google Gemini
Public report authorship links Sid Mittal to the Gemma 3n Technical Report at Google.
Stephanie Houde
Google Gemini
Public report authorship links Stephanie Houde to the Gemma 3n Technical Report at Google.
Stephan Rabanser
Google Gemini
Public report authorship links Stephan Rabanser to the Gemma 3n Technical Report at Google.
Sunita Chandrasekaran
Google Gemini
Public report authorship links Sunita Chandrasekaran to the Gemma 3n Technical Report at Google.
Surabhi Swaroop
Google Gemini
Public report authorship links Surabhi Swaroop to the Gemma 3n Technical Report at Google.
Vikas Sindhwani
Google Gemini
Public report authorship links Vikas Sindhwani to the Gemma 3n Technical Report at Google.
Vinitha Jeyakumar
Google Gemini
Public report authorship links Vinitha Jeyakumar to the Gemma 3n Technical Report at Google.
Weixuan Wang
Google Gemini
Public report authorship links Weixuan Wang to the Gemma 3n Technical Report at Google.
Wenxin Zou
Google Gemini
Public report authorship links Wenxin Zou to the Gemma 3n Technical Report at Google.
Wesley H. Tiong
Meta AI
Wesley H. Tiong is listed as an author of the Meta AI technical report Chameleon: Mixed-Modal Early-Fusion Foundation Models.
Xin Wang
Mistral AI
Xin Wang is listed as an author of the Mistral AI technical report Pixtral 12B.
Yinghui Xu
Google Gemini
Public report authorship links Yinghui Xu to the Gemma 3n Technical Report at Google.
Yuchen Yang
Meta AI
Yuchen Yang is listed as an author of the Meta AI technical report Chameleon: Mixed-Modal Early-Fusion Foundation Models.
Abhijit Guha Roy
Google Gemini
Google researcher whose publications include the Gemma 3n technical report.
Alberto Mario Cadeddu
Meta AI
Senior AI research scientist at Meta and affiliate researcher at MIT working on computer vision and machine learning.
Aleks Hartholz
Google Gemini
Google researcher whose publications include the Gemma 3n, Gemma 3, and Gemma 2 technical reports.
Anelia Angelova
Google Gemini
Anelia Angelova works on robotics, computer vision, and machine learning, and her public bio notes more than four years at Google DeepMind before becoming VP of AI at Humane.
Carl Vondrick
Google Gemini
Professor of computer science at Columbia University whose public research focuses on computer vision, video understanding, robotics, and machine learning.
Elizabeth Cole
Google Gemini
Researcher at Google DeepMind with public publications on language modeling, multimodal systems, and speech generation, including Gemma 3n, CT5, and ELLA.
Fei-Fei Li
Meta AI
Computer scientist known for work in computer vision, machine learning, and human-centered AI.
Frederik Ebert
Google Gemini
Google researcher whose publications include the Gemma 3n technical report.
Geneviève Dorkenwald
Meta AI
Research scientist at FAIR working on multimodal systems.
Graham Neubig
Mistral AI
Computer scientist at Carnegie Mellon University whose work spans machine learning, natural language processing, and human language technologies. His public homepage lists recent work including Pixtral and collaborations with Mistral AI.
Kira Radinsky
Google Gemini
Researcher working on multimodal and large language models, including Gemma 3n.
Luke M. Zettlemoyer
Meta AI
Professor in computer science and engineering at the University of Washington, scientist at the Allen Institute for Artificial Intelligence, and co-director of the UW NLP group.
Madhu Krishna
Meta AI
Research scientist at Meta working on multimodal reasoning, vision-language models, multimodal generation, and compression. His homepage highlights a background spanning machine learning, computer vision, and NLP.
Nathan Schuh
Google Gemini
Research scientist at Google DeepMind focused on scaling frontier models and advancing the Gemma family of open models.
Nat Levine
Google Gemini
Research scientist at Google DeepMind interested in reasoning and multimodal understanding in machine learning and AI systems.
Pankaj Doshi
Google Gemini
Research scientist at Google DeepMind whose work spans sequential decision making, multiagent systems, and responsible AI.
Saurav Belkhale
Google Gemini
Saurav Belkhale is a researcher at Google DeepMind working on dexterous robot manipulation at the intersection of control, computer vision, and machine learning.
Sergio de Cesare
Google Gemini
Researcher working on multimodal foundation models, including Gemma 3n.
Tao Ge
Google Gemini
Research scientist at Google DeepMind working on large language models, machine translation, and natural language processing.
Tianhe Yu
Meta AI
Research scientist at Meta working on embodied AI, robotics, and reinforcement learning.
Udit Sodhi
Meta AI
Research scientist at Meta whose public work covers embodied AI, language agents, and multimodal systems; his arXiv author results include the Chameleon multimodal model paper.
Urvashi Khandelwal
Mistral AI
Senior research scientist at Mistral AI working on domain knowledge, factuality, efficiency, and personalization in large language models.
Young-Min Kim
Google Gemini
Staff research scientist at Google DeepMind in Mountain View working on multimodal language model pretraining and post-training.
Zhitao Ying
Google Gemini
Zhitao Ying is a Research Scientist at Google DeepMind.