LLMpeople
Home People Organizations Reports Fields Schools
Public Atlas People first, reports as evidence, organizations as context.

Atlas / Reports / Detail

Chameleon: Mixed-Modal Early-Fusion Foundation Models

Multimodal Large Language Models report from Meta AI with 20 connected researchers in the LLMpeople atlas.

Meta AIUndated20 researchers
Field
Multimodal Large Language Models
Organization
Meta AI
arXiv
2405.09818

Canonical link

https://arxiv.org/abs/2405.09818

Connected researchers

Asterios Katsamanis portrait
Researcher 1 reports

Asterios Katsamanis

Meta AI

Researcher at Apple working on speech, audio, and multimodal machine learning; previously a senior research scientist at SRI International and also worked at Google.

Meta AI
7 likes
Hao Yang portrait
Researcher 5 reports

Hao Yang

DeepSeek / Meta AI

Researcher at Moonshot AI working on multimodal large language models; previously a key member of Alibaba's Qwen team and author of work including Kimi-VL, DeepSeek-VL, and Qwen technical reports.

DeepSeekMeta AIMoonshot AI
Mingze Li portrait
Researcher 2 reports

Mingze Li

Meta AI / Alibaba Qwen

Researcher at Alibaba Group exploring the math and science of large language models; incoming assistant professor at Nanyang Technological University.

Meta AIAlibaba Qwen
Chenguang Zhu portrait
Researcher 1 reports

Chenguang Zhu

Meta AI

Research scientist at Meta AI focused on vision-language models, large language models, and agents; public work includes the multimodal foundation model Chameleon.

Meta AI
Armand Joulin portrait
Researcher 4 reports

Armand Joulin

Meta AI

Armand Joulin is a researcher and the cofounder and chief scientist of Mistral AI. Public arXiv records also list him as an author of LLaMA: Open and Efficient Foundation Language Models.

Meta AI
Nicholas Crane portrait
Researcher 1 reports

Nicholas Crane

Meta AI

Research scientist at Meta working on computer vision and multimodal foundation models with an emphasis on robustness, trustworthiness, and alignment.

Meta AI
Mike Lewis portrait
Researcher 2 reports

Mike Lewis

Meta AI

Mike Lewis is a natural language processing researcher whose public work includes multimodal language modeling and large-scale pretraining.

Meta AI
Alaaeldin El-Nouby portrait
Researcher 1 reports

Alaaeldin El-Nouby

Meta AI

Alaaeldin El-Nouby is a machine learning researcher whose public work includes multimodal and vision-language models.

Meta AI
Christopher Pal portrait
Researcher 1 reports

Christopher Pal

Meta AI

Christopher Pal is a professor and AI researcher whose public work spans deep learning, multimodal learning, and large language models.

Meta AI
Srujana Merugu portrait
Researcher 1 reports

Srujana Merugu

Meta AI

Research scientist at Meta AI focused on multimodal and embodied AI, with interests in computer vision, deep learning, and decision making.

Meta AI
Khaled Saeed portrait
Researcher 1 reports

Khaled Saeed

Meta AI

Khaled Saeed is a Research Scientist at Meta working on efficient multimodal reasoning and AI systems.

Meta AI
Faisal Azhar portrait
Researcher 2 reports

Faisal Azhar

Meta AI

Faisal Azhar is a PhD candidate in computer science at Stanford University. His work focuses on multimodal systems that unify text, image, and speech, together with efficient training and inference for large-scale machine learning.

Meta AI
Alberto Mario Cadeddu portrait
Researcher 1 reports

Alberto Mario Cadeddu

Meta AI

Senior AI research scientist at Meta and affiliate researcher at MIT working on computer vision and machine learning.

Meta AI
Fei-Fei Li portrait
Researcher 1 reports

Fei-Fei Li

Meta AI

Computer scientist known for work in computer vision, machine learning, and human-centered AI.

Meta AI
Geneviève Dorkenwald portrait
Researcher 1 reports

Geneviève Dorkenwald

Meta AI

Research scientist at FAIR working on multimodal systems.

Meta AI
Luke M. Zettlemoyer portrait
Researcher 1 reports

Luke M. Zettlemoyer

Meta AI

Professor in computer science and engineering at the University of Washington, scientist at the Allen Institute for Artificial Intelligence, and co-director of the UW NLP group.

Meta AI
Madhu Krishna portrait
Researcher 1 reports

Madhu Krishna

Meta AI

Research scientist at Meta working on multimodal reasoning, vision-language models, multimodal generation, and compression. His homepage highlights a background spanning machine learning, computer vision, and NLP.

Meta AI
Sébastien Bubeck portrait
Researcher 1 reports

Sébastien Bubeck

Meta AI

Vice president of GenAI at Microsoft AI and a long-time machine learning researcher known for work on the foundations of reinforcement learning and bandits.

Meta AI
Tianhe Yu portrait
Researcher 1 reports

Tianhe Yu

Meta AI

Research scientist at Meta working on embodied AI, robotics, and reinforcement learning.

Meta AI
Udit Sodhi portrait
Researcher 1 reports

Udit Sodhi

Meta AI

Research scientist at Meta whose public work covers embodied AI, language agents, and multimodal systems; his arXiv author results include the Chameleon multimodal model paper.

Meta AI

LLMpeople is a public atlas for discovering frontier AI researchers with context, provenance, and respect.

Privacy · Terms