LLMpeople
Home People Organizations Reports Fields Schools
Public Atlas People first, reports as evidence, organizations as context.

Atlas / Reports / Detail

Tulu 2: Demystifying the Effectiveness of RLHF and Reinforcement Learning with Human Feedback

Large Language Models report from Ai2 with 26 connected researchers in the LLMpeople atlas.

Ai22023-11-1726 researchers
Field
Large Language Models
Organization
Ai2
arXiv
2311.10702

Canonical link

https://arxiv.org/abs/2311.10702

Connected researchers

Yuling Gu portrait
Researcher 3 reports

Yuling Gu

Ai2

Yuling Gu is a PhD student at the NYU Center for Data Science studying large language models, machine reasoning, and robust evaluation. She was previously a predoctoral researcher at the Allen Institute for AI, where she contributed to OLMo, OLMo 2, OLMo 3, TULU 3, OLMoE, and OLMES.

Ai2
Nathan Lambert portrait
Researcher 4 reports

Nathan Lambert

Ai2

Machine learning scientist at Ai2 working on reinforcement learning, language models, and online social systems.

Ai2
United States
Jiacheng Liu portrait
Researcher 2 reports

Jiacheng Liu

Ai2

Jiacheng Liu is a researcher at Ai2 whose work focuses on improving the capabilities and understanding of language models. His public homepage says he is currently a PhD student at New York University and has previously spent time at Princeton and Google Research.

Ai2
Jacob Morrison portrait
Researcher 3 reports

Jacob Morrison

Ai2

Jacob Morrison is a researcher whose work spans language model post-training, alignment, and evaluation. His public research page highlights projects including Tulu 2, Tulu 3, OLMo 2, and RewardBench.

Ai2
Noah A. Smith portrait
Researcher 7 reports

Noah A. Smith

Ai2

Noah A. Smith is a computer scientist and professor at the University of Washington, where he serves as Vice Provost for Artificial Intelligence and co-directs the OLMo open language modeling effort with Ai2. His research focuses on natural language processing, machine learning, and evaluation methodology.

Ai2
United States
Yizhong Wang portrait
Researcher 4 reports

Yizhong Wang

Ai2

Yizhong Wang is a research scientist at the Allen Institute for AI and incoming assistant professor at the University of Washington whose work focuses on language models, agents, reasoning, and open-source AI.

Ai2
United States
Jena D. Hwang portrait
Researcher 3 reports

Jena D. Hwang

Ai2

Research scientist at the Allen Institute for AI (Ai2) whose work focuses on natural language understanding and commonsense reasoning.

Ai2
Hanjie Chen portrait
Researcher 1 reports

Hanjie Chen

Ai2

Hanjie Chen is listed as an author of the Ai2 technical report Tulu 2: Demystifying the Effectiveness of RLHF and Reinforcement Learning with Human Feedback.

Ai2
Nima Rajani portrait
Researcher 1 reports

Nima Rajani

Ai2

Nima Rajani is a research scientist at Ai2 whose work focuses on trustworthy, interpretable, and verifiable AI systems.

Ai2
Nicholas Ruas portrait
Researcher 2 reports

Nicholas Ruas

Ai2

Machine learning engineer at Ai2 whose public work focuses on open language models, post-training, and evaluation.

Ai2
Ming Yin portrait
Researcher 1 reports

Ming Yin

Ai2

Ming Yin is listed as an author of the Ai2 technical report Tulu 2: Demystifying the Effectiveness of RLHF and Reinforcement Learning with Human Feedback.

Ai2
Chandra Bhagavatula portrait
Researcher 1 reports

Chandra Bhagavatula

Ai2

Research scientist at Ai2 focused on natural language processing, commonsense reasoning, long-form generation, narrative intelligence, and text-based games.

Ai2
Tyler Scialom portrait
Researcher 1 reports

Tyler Scialom

Ai2

Research scientist at Ai2 working on personalized language models, instruction tuning, and reinforcement learning from human feedback.

Ai2
Ziyi Yang portrait
Researcher 1 reports

Ziyi Yang

Ai2

Ziyi Yang is listed as an author of the Ai2 technical report Tulu 2: Demystifying the Effectiveness of RLHF and Reinforcement Learning with Human Feedback.

Ai2
Mustafa Hajij portrait
Researcher 1 reports

Mustafa Hajij

Ai2

Mustafa Hajij is a research scientist at Ai2 and an adjunct professor in the Department of Computer Science at the University of Southern Maine. His research spans graph machine learning, geometric learning, and applied mathematics.

Ai2
Yizhu Jiao portrait
Researcher 1 reports

Yizhu Jiao

Ai2

Yizhu Jiao is listed as an author of the Ai2 technical report Tulu 2: Demystifying the Effectiveness of RLHF and Reinforcement Learning with Human Feedback.

Ai2
Bill Yuchen Lin portrait
Researcher 1 reports

Bill Yuchen Lin

Ai2

Researcher working on language models, agents, and retrieval-augmented generation; currently at xAI and incoming assistant professor at the University of Washington, previously a research scientist at the Allen Institute for AI.

Ai2
Aryo Pradipta Gema portrait
Researcher 1 reports

Aryo Pradipta Gema

Ai2

Research engineer at Ai2 focused on post-training and data for open language models.

Ai2
Julian Martin Eisenschlos portrait
Researcher 1 reports

Julian Martin Eisenschlos

Ai2

Julian Martin Eisenschlos is a Research Scientist at Ai2. His work focuses on natural language processing, language models, and instruction tuning, including contributions to the Tulu 2 project.

Ai2
Jeremy Dwivedi-Yu portrait
Researcher 1 reports

Jeremy Dwivedi-Yu

Ai2

Jeremy Dwivedi-Yu is listed as an author of the Ai2 technical report Tulu 2: Demystifying the Effectiveness of RLHF and Reinforcement Learning with Human Feedback.

Ai2
Alexandre Ramé portrait
Researcher 2 reports

Alexandre Ramé

Ai2

Alexandre Ramé is a research scientist at Google DeepMind and an adjunct professor at Ecole Polytechnique. His homepage says he previously held research roles at NYU and SCAI / Sorbonne Université, completed a PhD in machine learning at Ecole Polytechnique and ENS Paris-Saclay, and works on post-training and alignment for Gemma LLMs.

Ai2
Jesujoba Alabi portrait
Researcher 1 reports

Jesujoba Alabi

Ai2

Researcher in natural language processing, low-resource languages, machine translation, and responsible AI; publicly listed as a PhD candidate at UC Santa Barbara and a co-author of Tulu 2.

Ai2
Noel Nabeshima portrait
Researcher 1 reports

Noel Nabeshima

Ai2

Noel Nabeshima is listed as an author of the Ai2 technical report Tulu 2: Demystifying the Effectiveness of RLHF and Reinforcement Learning with Human Feedback.

Ai2
Tony Gracious portrait
Researcher 1 reports

Tony Gracious

Ai2

Tony Gracious completed his PhD in the Department of Computer Science and Automation at IISc Bangalore. His work includes representation learning, temporal point processes, and higher-order interaction forecasting, and he later joined Dolby's Advanced Technology Group in Bangalore.

Ai2
India

LLMpeople is a public atlas for discovering frontier AI researchers with context, provenance, and respect.

Privacy · Terms