LLMpeople
Home People Organizations Reports Fields Schools
Public Atlas People first, reports as evidence, organizations as context.

Atlas / Reports / Detail

Nemotron-4 15B Technical Report

Large Language Models report from NVIDIA with 19 connected researchers in the LLMpeople atlas.

NVIDIA2024-02-2619 researchers
Field
Large Language Models
Organization
NVIDIA
arXiv
2402.16819

Canonical link

https://arxiv.org/abs/2402.16819

Connected researchers

Hanlin Tang portrait
Researcher 3 reports

Hanlin Tang

Cohere / NVIDIA

Hanlin Tang is a researcher at Cohere. His public page says his work focuses on foundation models, large language model post-training, reinforcement learning, and vision-language or language-model agents, and that he previously held research internships at NVIDIA and the Vector Institute.

CohereNVIDIA
Sanjiv Kumar portrait
Researcher 4 reports

Sanjiv Kumar

Google Gemini / NVIDIA

Sanjiv Kumar is a Google Fellow and vice president at Google Research. His public homepage says he leads teams working on large machine learning foundation models and generative AI, has spent more than 25 years building machine learning systems and products, and received a PhD in computer science from Carnegie Mellon University in 2005.

Google GeminiNVIDIA
United States
Prasanna Parthasarathi portrait
Researcher 2 reports

Prasanna Parthasarathi

NVIDIA

Prasanna Parthasarathi is a research scientist at Huawei Noah's Ark Lab in Montreal. His public speaker and lab-profile pages say he collaborates with Mila and McGill University, works on natural language processing, dialogue systems, and social simulation, and completed a PhD at McGill University in 2022 under Joelle Pineau.

NVIDIA
Boris Ginsburg portrait
Researcher 2 reports

Boris Ginsburg

NVIDIA

Boris Ginsburg is a principal engineer and research scientist at NVIDIA whose work focuses on efficient machine learning and deep learning for speech recognition, language processing, and computer vision.

NVIDIA
Dilek Hakkani-Tur portrait
Researcher 3 reports

Dilek Hakkani-Tur

NVIDIA

Dilek Hakkani-Tur is a Professor of Computer Science at the University of Illinois Urbana-Champaign and an Amazon Scholar at Amazon Health Science. Her UIUC faculty profile says her research interests include conversational AI, natural language and speech processing, spoken dialogue systems, and machine learning for language processing.

NVIDIA
Bryan Catanzaro portrait
Researcher 7 reports

Bryan Catanzaro

NVIDIA

Vice President of Applied Deep Learning Research at NVIDIA, leading work on conversational AI, generative AI, and accelerated deep learning software.

NVIDIA
Saurav Muralidharan portrait
Researcher 1 reports

Saurav Muralidharan

NVIDIA

Public report authorship links Saurav Muralidharan to the Nemotron-4 15B Technical Report at NVIDIA.

NVIDIA
Prathyusha Kamesetty portrait
Researcher 1 reports

Prathyusha Kamesetty

NVIDIA

Public report authorship links Prathyusha Kamesetty to the Nemotron-4 15B Technical Report at NVIDIA.

NVIDIA
Pramod Kumbhare portrait
Researcher 1 reports

Pramod Kumbhare

NVIDIA

Member of technical staff at NVIDIA Research focused on language models, deep learning, and efficient training systems.

NVIDIA
Pradeep Dasigi portrait
Researcher 1 reports

Pradeep Dasigi

NVIDIA

Research scientist on the AllenNLP team at the Allen Institute for AI, where his homepage highlights work on open language models such as OLMo and Tulu and a focus on post-training language models.

NVIDIA
1 likes
Carlos E. Jimenez portrait
Researcher 1 reports

Carlos E. Jimenez

NVIDIA

Research scientist at NVIDIA with publications in machine learning and embodied AI.

NVIDIA
Sang Michael Xie portrait
Researcher 1 reports

Sang Michael Xie

NVIDIA

Researcher at OpenAI focused on data-centric methods for foundation models, including synthetic data and reinforcement learning. Previously a research scientist at Meta GenAI; earned BS, MS, and PhD degrees in computer science at Stanford.

NVIDIA
Ali Payani portrait
Researcher 3 reports

Ali Payani

NVIDIA

Public report authorship links Ali Payani to the OLMoE: Open Mixture-of-Experts Language Models at Ai2.

NVIDIA
Rajarshi Das portrait
Researcher 1 reports

Rajarshi Das

NVIDIA

Research scientist at NVIDIA.

NVIDIA
Mohit Bansal portrait
Researcher 1 reports

Mohit Bansal

NVIDIA

Public report authorship links Mohit Bansal to the Nemotron-4 15B Technical Report at NVIDIA.

NVIDIA
Dragomir Radev portrait
Researcher 1 reports

Dragomir Radev

NVIDIA

Dragomir Radev is an Eminent Professor of natural language processing at MBZUAI. His research spans NLP, information retrieval, question answering, and summarization, and he is also a coauthor of the Nemotron-4 15B technical report.

NVIDIA
Yejin Choi portrait
Researcher 1 reports

Yejin Choi

NVIDIA

Public report authorship links Yejin Choi to the Nemotron-4 15B Technical Report at NVIDIA.

NVIDIA
Michael Flaherty portrait
Researcher 1 reports

Michael Flaherty

NVIDIA

Public report authorship links Michael Flaherty to the Nemotron-4 15B Technical Report at NVIDIA.

NVIDIA
Jianfeng Gao portrait
Researcher 2 reports

Jianfeng Gao

NVIDIA

Public report authorship links Jianfeng Gao to the Nemotron-4 15B Technical Report at NVIDIA.

NVIDIA

LLMpeople is a public atlas for discovering frontier AI researchers with context, provenance, and respect.

Privacy ยท Terms