LLMpeople
Home People Organizations Reports Fields Schools
Public Atlas People first, reports as evidence, organizations as context.

Atlas / Reports / Detail

PaliGemma: A versatile 3B VLM for transfer

Vision-Language Models report from Google Gemini with 14 connected researchers in the LLMpeople atlas.

Google Gemini2024-07-1014 researchers
Field
Vision-Language Models
Organization
Google Gemini
arXiv
2407.07726

Canonical link

https://arxiv.org/abs/2407.07726

Connected researchers

Researcher 3 reports

Radu Soricut

Google Gemini

Research scientist focused on machine learning and natural language understanding, with work spanning machine translation, semantic parsing, and large-scale language modeling.

Google Gemini
Researcher 3 reports

Jiahui Yu

Google Gemini

Jiahui Yu is a research scientist at Google DeepMind working on multimodal learning and large language models.

Google Gemini
Researcher 2 reports

Nikolay Savinov

Google Gemini

Research scientist at Google DeepMind on the Gemini team, working on multimodal AI.

Google Gemini
Researcher 1 reports

Leonardo Beyer

Google Gemini

Leonardo Beyer is a research scientist at Google DeepMind. His public homepage highlights work across representation learning, multimodal models, and large-scale machine learning systems.

Google Gemini
Researcher 1 reports

Koray Kavukcuoglu

Google Gemini

Chief Technology Officer at Google DeepMind, with work spanning machine learning and reinforcement learning.

Google Gemini
Researcher 4 reports

Yonghui Wu

Google researcher focused on machine translation, natural language processing, and machine learning.

Google researcher whose public profile says he joined Google in September 2008 and has been with the Google Brain team since January 2015, with interests spanning information retrieval, learning to rank, machine learning, machine translation, and natural language processing.

ByteDance SeedGoogle Gemini
Researcher 2 reports

Matthieu Devin

Google Gemini

Research scientist at Google DeepMind based in Paris, focused on deep learning and computer vision.

Google Gemini
Researcher 1 reports

Xiaohua Zhai

Google Gemini

Xiaohua Zhai is a researcher on the Google Research team in Zurich whose work focuses on large multimodal models and efficient deep learning.

Google Gemini
Researcher 1 reports

Xiuye Gu

Google Gemini

Xiuye Gu is a researcher whose public work focuses on vision-language modeling and machine learning systems.

Google Gemini
Researcher 1 reports

Maxwell Collins

Google Gemini

Maxwell Collins is a Research Scientist at Google DeepMind.

Google Gemini
Researcher 1 reports

Nan Ding

Google Gemini

Researcher at Google Research whose public work includes multimodal and vision-language modeling, with arXiv publications tied to PaliGemma and related transfer work.

Google Gemini
Researcher 1 reports

William Kolesnikov

Google Gemini

Staff software engineer at Google DeepMind working on post-training, alignment, multimodal models, and data filtering. He previously worked on hardware and software co-design for machine learning.

Google Gemini
Researcher 2 reports

Siyuan Li

Google Gemini / NVIDIA

Profile still being enriched.

Google GeminiNVIDIA
Researcher 1 reports

Xinyi Chen

Google Gemini

Profile still being enriched.

Google Gemini

LLMpeople is a public atlas for discovering frontier AI researchers with context, provenance, and respect.

Privacy ยท Terms