NVLM: Open Frontier-Class Multimodal LLMs

Assistant Professor of Computer Science and Engineering at UC Santa Cruz working on multimodal learning, computer vision, and medical image analysis.

Research manager at NVIDIA working on large-scale distributed pretraining, synthetic data, multimodal LLMs, and computer vision.

Jianfeng Gao is a researcher in natural language processing and multimodal foundation models whose public homepage and Google Scholar profile highlight work on dialogue systems, retrieval, reasoning, and vision-language models.

Research scientist at NVIDIA working on multimodal AI, especially language and vision models.

Research scientist at NVIDIA with public publications on multimodal language models and visual instruction tuning, including NVLM, VILA, and Video2Flow.

Vice President of AI Research and General Manager of AI Platforms at NVIDIA.

Research scientist at NVIDIA working on multimodal language models and vision-language research, with public publications including NVLM, VILA, and Visual Role Play.

PhD student at UCLA and research intern at NVIDIA, working on multimodal reasoning, vision-language models, and embodied AI.

Research scientist at NVIDIA working on AI agents, multimodal systems, and robotics, including the NVLM project.

Distinguished scientist and managing director at Microsoft Research working on natural language processing and large language models.

Canonical link

Yuyin Zhou

Zhaowen Wang

Jianfeng Gao

Thaddeus Culhane

Andy Yao

Caiming Xiong

Ran Tian

Rulin Shao

Shyamal Anadkat

Weizhu Chen