LLMpeople
Home People Organizations Reports Fields Schools
Public Atlas People first, reports as evidence, organizations as context.

Atlas / Reports / Detail

MM1.5: Methods, Analysis and Insights from Multimodal LLM Fine-tuning

Multimodal Language Models report from Apple with 23 connected researchers in the LLMpeople atlas.

Apple2024-09-3023 researchers
Field
Multimodal Language Models
Organization
Apple
arXiv
2409.20566

Canonical link

https://arxiv.org/abs/2409.20566

Connected researchers

Zhe Gan portrait
Researcher 2 reports

Zhe Gan

Apple

Machine learning researcher at Apple working on large multimodal foundation models, video generation, and vision-language systems.

Apple
Jean-Philippe Fauconnier portrait
Researcher 2 reports

Jean-Philippe Fauconnier

Apple

Research scientist at Apple Foundation Models working on generative AI, large language models, and multimodal models.

Apple
Sam Dodge portrait
Researcher 2 reports

Sam Dodge

Apple

Sam Dodge is an Apple AI/ML-affiliated researcher. The linked arXiv paper lists him as a coauthor of MM1 and shows his affiliation as Apple AI/ML in Cupertino, California.

Apple
Philipp Dufter portrait
Researcher 2 reports

Philipp Dufter

Apple

Research scientist at Apple Foundation Models with interests in natural language processing, structured generation, controllable generation, and algorithmic efficiency.

Apple
Bowen Zhang portrait
Researcher 2 reports

Bowen Zhang

Apple

Research scientist at Apple working on large language models, vision-language models, and model scaling.

Apple
Dhruti Shah portrait
Researcher 2 reports

Dhruti Shah

Apple

Researcher working on machine learning, vision and language, computer vision, diffusion, and generative AI.

Apple
Xianzhi Du portrait
Researcher 2 reports

Xianzhi Du

Apple

Research scientist at Apple working on language and vision-language modeling, AI agents, and post-training.

Apple
Haotian Zhang portrait
Researcher 2 reports

Haotian Zhang

Apple

Haotian Zhang is a research scientist on Apple AI/ML's Visual Intelligence team. His homepage says he works on embodied agents that understand the world from 2D and 3D image data as well as natural language, previously interned at Microsoft Research and Azure AI, completed a PhD in electrical and computer engineering at the University of Washington in 2022, and earlier earned master's degrees at Washington and a bachelor's degree at Shanghai Jiao Tong University.

Apple
Zirui Wang portrait
Researcher 2 reports

Zirui Wang

Apple

Senior researcher at Apple working on large models, multimodal learning, and speech processing, according to his personal site.

Apple
Peter Grasch portrait
Researcher 2 reports

Peter Grasch

Apple

Research scientist at Apple focused on state-of-the-art machine learning and computer vision methods.

Apple
Yinfei Yang portrait
Researcher 2 reports

Yinfei Yang

Apple

Research scientist at Apple focused on natural language processing and machine learning.

Apple
Mingfei Gao portrait
Researcher 1 reports

Mingfei Gao

Apple

Researcher working on machine learning, optimization, and sequential data.

Apple
Nina Wenzel portrait
Researcher 1 reports

Nina Wenzel

Apple

Nina Wenzel is listed as an author of the Apple technical report MM1.5: Methods, Analysis and Insights from Multimodal LLM Fine-tuning.

Apple
Forrest Huang portrait
Researcher 1 reports

Forrest Huang

Apple

Research scientist at Apple Foundation Models working on efficient training and multimodal language models.

Apple
Keen You portrait
Researcher 1 reports

Keen You

Apple

Research scientist at Apple specializing in post-training, reinforcement learning, and AI agents.

Apple
Aleksei Timofeev portrait
Researcher 1 reports

Aleksei Timofeev

Apple

Research scientist whose public OpenReview profile lists work on multimodal representation learning, speech synthesis, and personalized voice generation.

Apple
Hong-You Chen portrait
Researcher 1 reports

Hong-You Chen

Apple

AI and machine learning engineer at Apple working on multimodal foundation models; previously worked at Snap and the University of Southern California.

Apple
Zhengfeng Lai portrait
Researcher 1 reports

Zhengfeng Lai

Apple

Zhengfeng Lai is an ML Research Scientist at Apple AI/ML. His self-authored CV lists a PhD in Electrical and Computer Engineering from the University of California, Davis, prior Apple internship work, and publications in multimodal and vision-language learning.

Apple
Haoxuan You portrait
Researcher 1 reports

Haoxuan You

Apple

Research scientist on Apple Foundation Models whose work focuses on machine learning systems, multimodal foundation models, and AI agents.

Apple
Afshin Dehghan portrait
Researcher 1 reports

Afshin Dehghan

Apple

Research scientist at Apple focused on computer vision, multimodal learning, and robotics.

Apple

LLMpeople is a public atlas for discovering frontier AI researchers with context, provenance, and respect.

Privacy ยท Terms