Atlas / Reports / Detail
MM1.5: Methods, Analysis and Insights from Multimodal LLM Fine-tuning
Multimodal Language Models report from Apple with 23 connected researchers in the LLMpeople atlas.
Connected researchers
Zhe Gan
Apple
Machine learning researcher at Apple working on large multimodal foundation models, video generation, and vision-language systems.
Jean-Philippe Fauconnier
Apple
Research scientist at Apple Foundation Models working on generative AI, large language models, and multimodal models.
Sam Dodge
Apple
Sam Dodge is an Apple AI/ML-affiliated researcher. The linked arXiv paper lists him as a coauthor of MM1 and shows his affiliation as Apple AI/ML in Cupertino, California.
Philipp Dufter
Apple
Research scientist at Apple Foundation Models with interests in natural language processing, structured generation, controllable generation, and algorithmic efficiency.
Bowen Zhang
Apple
Research scientist at Apple working on large language models, vision-language models, and model scaling.
Dhruti Shah
Apple
Researcher working on machine learning, vision and language, computer vision, diffusion, and generative AI.
Xianzhi Du
Apple
Research scientist at Apple working on language and vision-language modeling, AI agents, and post-training.
Haotian Zhang
Apple
Haotian Zhang is a research scientist on Apple AI/ML's Visual Intelligence team. His homepage says he works on embodied agents that understand the world from 2D and 3D image data as well as natural language, previously interned at Microsoft Research and Azure AI, completed a PhD in electrical and computer engineering at the University of Washington in 2022, and earlier earned master's degrees at Washington and a bachelor's degree at Shanghai Jiao Tong University.
Zirui Wang
Apple
Senior researcher at Apple working on large models, multimodal learning, and speech processing, according to his personal site.
Peter Grasch
Apple
Research scientist at Apple focused on state-of-the-art machine learning and computer vision methods.
Yinfei Yang
Apple
Research scientist at Apple focused on natural language processing and machine learning.
Mingfei Gao
Apple
Researcher working on machine learning, optimization, and sequential data.
Nina Wenzel
Apple
Nina Wenzel is listed as an author of the Apple technical report MM1.5: Methods, Analysis and Insights from Multimodal LLM Fine-tuning.
Forrest Huang
Apple
Research scientist at Apple Foundation Models working on efficient training and multimodal language models.
Keen You
Apple
Research scientist at Apple specializing in post-training, reinforcement learning, and AI agents.
Aleksei Timofeev
Apple
Research scientist whose public OpenReview profile lists work on multimodal representation learning, speech synthesis, and personalized voice generation.
Hong-You Chen
Apple
AI and machine learning engineer at Apple working on multimodal foundation models; previously worked at Snap and the University of Southern California.
Zhengfeng Lai
Apple
Zhengfeng Lai is an ML Research Scientist at Apple AI/ML. His self-authored CV lists a PhD in Electrical and Computer Engineering from the University of California, Davis, prior Apple internship work, and publications in multimodal and vision-language learning.
Haoxuan You
Apple
Research scientist on Apple Foundation Models whose work focuses on machine learning systems, multimodal foundation models, and AI agents.
Afshin Dehghan
Apple
Research scientist at Apple focused on computer vision, multimodal learning, and robotics.