Atlas / Reports / Detail
Tulu 2: Demystifying the Effectiveness of RLHF and Reinforcement Learning with Human Feedback
Large Language Models
Connected researchers
Yizhong Wang
Ai2
Yizhong Wang is a research scientist at the Allen Institute for AI and incoming assistant professor at the University of Washington whose work focuses on language models, agents, reasoning, and open-source AI.
Yuling Gu
Ai2
Yuling Gu is a PhD student at the NYU Center for Data Science studying large language models, machine reasoning, and robust evaluation. She was previously a predoctoral researcher at the Allen Institute for AI, where she contributed to OLMo, OLMo 2, OLMo 3, TULU 3, OLMoE, and OLMES.
Bill Yuchen Lin
Ai2
Researcher working on language models, agents, and retrieval-augmented generation; currently at xAI and incoming assistant professor at the University of Washington, previously a research scientist at the Allen Institute for AI.
Noah A. Smith
Ai2
Noah A. Smith is a computer scientist and professor at the University of Washington, where he serves as Vice Provost for Artificial Intelligence and co-directs the OLMo open language modeling effort with Ai2. His research focuses on natural language processing, machine learning, and evaluation methodology.
Chandra Bhagavatula
Ai2
Research scientist at Ai2 focused on natural language processing, commonsense reasoning, long-form generation, narrative intelligence, and text-based games.
Nima Rajani
Ai2
Nima Rajani is a research scientist at Ai2 whose work focuses on trustworthy, interpretable, and verifiable AI systems.
Jena D. Hwang
Ai2
Research scientist at the Allen Institute for AI (Ai2) whose work focuses on natural language understanding and commonsense reasoning.
Jesujoba Alabi
Ai2
Researcher in natural language processing, low-resource languages, machine translation, and responsible AI; publicly listed as a PhD candidate at UC Santa Barbara and a co-author of Tulu 2.
Julian Martin Eisenschlos
Ai2
Julian Martin Eisenschlos is a Research Scientist at Ai2. His work focuses on natural language processing, language models, and instruction tuning, including contributions to the Tulu 2 project.
Tyler Scialom
Ai2
Research scientist at Ai2 working on personalized language models, instruction tuning, and reinforcement learning from human feedback.
Tony Gracious
Ai2
Tony Gracious completed his PhD in the Department of Computer Science and Automation at IISc Bangalore. His work includes representation learning, temporal point processes, and higher-order interaction forecasting, and he later joined Dolby's Advanced Technology Group in Bangalore.
Nathan Lambert
Ai2
Machine learning scientist at Ai2 working on reinforcement learning, language models, and online social systems.
Jacob Morrison
Ai2
Jacob Morrison is a researcher whose work spans language model post-training, alignment, and evaluation. His public research page highlights projects including Tulu 2, Tulu 3, OLMo 2, and RewardBench.
Nicholas Ruas
Ai2
Machine learning engineer at Ai2 whose public work focuses on open language models, post-training, and evaluation.
Aryo Pradipta Gema
Ai2
Research engineer at Ai2 focused on post-training and data for open language models.
Kevin Gu
Ai2
Research scientist at Ai2 working on large language models, machine learning with human feedback, and related topics; previously at Stanford and MIT.
Mustafa Hajij
Ai2
Mustafa Hajij is a research scientist at Ai2 and an adjunct professor in the Department of Computer Science at the University of Southern Maine. His research spans graph machine learning, geometric learning, and applied mathematics.
Alexandre Ramé
Ai2
Profile still being enriched.
Jiacheng Liu
Ai2
Profile still being enriched.
Hanjie Chen
Ai2
Profile still being enriched.
Jeremy Dwivedi-Yu
Ai2
Profile still being enriched.
Maxwell Roberts
Ai2
Profile still being enriched.
Ming Yin
Ai2
Profile still being enriched.
Noel Nabeshima
Ai2
Profile still being enriched.