LLMpeople
Home People Organizations Reports Fields Schools
Public Atlas People first, reports as evidence, organizations as context.

Atlas / Reports / Detail

Tulu 2: Demystifying the Effectiveness of RLHF and Reinforcement Learning with Human Feedback

Large Language Models report from Ai2 with 17 connected researchers in the LLMpeople atlas.

Ai22023-11-1717 researchers
Field
Large Language Models
Organization
Ai2
arXiv
2311.10702

Canonical link

https://arxiv.org/abs/2311.10702

Connected researchers

Yizhong Wang portrait
Researcher 4 reports

Yizhong Wang

Ai2

Yizhong Wang is a research scientist at the Allen Institute for AI and incoming assistant professor at the University of Washington whose work focuses on language models, agents, reasoning, and open-source AI.

Ai2
Yuling Gu portrait
Researcher 3 reports

Yuling Gu

Ai2

Yuling Gu is a PhD student at the NYU Center for Data Science studying large language models, machine reasoning, and robust evaluation. She was previously a predoctoral researcher at the Allen Institute for AI, where she contributed to OLMo, OLMo 2, OLMo 3, TULU 3, OLMoE, and OLMES.

Ai2
Bill Yuchen Lin portrait
Researcher 1 reports

Bill Yuchen Lin

Ai2

Researcher working on language models, agents, and retrieval-augmented generation; currently at xAI and incoming assistant professor at the University of Washington, previously a research scientist at the Allen Institute for AI.

Ai2
Noah A. Smith portrait
Researcher 7 reports

Noah A. Smith

Ai2

Noah A. Smith is a computer scientist and professor at the University of Washington, where he serves as Vice Provost for Artificial Intelligence and co-directs the OLMo open language modeling effort with Ai2. His research focuses on natural language processing, machine learning, and evaluation methodology.

Ai2
United States
Chandra Bhagavatula portrait
Researcher 1 reports

Chandra Bhagavatula

Ai2

Research scientist at Ai2 focused on natural language processing, commonsense reasoning, long-form generation, narrative intelligence, and text-based games.

Ai2
Nima Rajani portrait
Researcher 1 reports

Nima Rajani

Ai2

Nima Rajani is a research scientist at Ai2 whose work focuses on trustworthy, interpretable, and verifiable AI systems.

Ai2
Jena D. Hwang portrait
Researcher 3 reports

Jena D. Hwang

Ai2

Research scientist at the Allen Institute for AI (Ai2) whose work focuses on natural language understanding and commonsense reasoning.

Ai2
Jesujoba Alabi portrait
Researcher 1 reports

Jesujoba Alabi

Ai2

Researcher in natural language processing, low-resource languages, machine translation, and responsible AI; publicly listed as a PhD candidate at UC Santa Barbara and a co-author of Tulu 2.

Ai2
Julian Martin Eisenschlos portrait
Researcher 1 reports

Julian Martin Eisenschlos

Ai2

Julian Martin Eisenschlos is a Research Scientist at Ai2. His work focuses on natural language processing, language models, and instruction tuning, including contributions to the Tulu 2 project.

Ai2
Tyler Scialom portrait
Researcher 1 reports

Tyler Scialom

Ai2

Research scientist at Ai2 working on personalized language models, instruction tuning, and reinforcement learning from human feedback.

Ai2
Tony Gracious portrait
Researcher 1 reports

Tony Gracious

Ai2

Tony Gracious completed his PhD in the Department of Computer Science and Automation at IISc Bangalore. His work includes representation learning, temporal point processes, and higher-order interaction forecasting, and he later joined Dolby's Advanced Technology Group in Bangalore.

Ai2
India
Nathan Lambert portrait
Researcher 4 reports

Nathan Lambert

Ai2

Machine learning scientist at Ai2 working on reinforcement learning, language models, and online social systems.

Ai2
Jacob Morrison portrait
Researcher 3 reports

Jacob Morrison

Ai2

Jacob Morrison is a researcher whose work spans language model post-training, alignment, and evaluation. His public research page highlights projects including Tulu 2, Tulu 3, OLMo 2, and RewardBench.

Ai2
Nicholas Ruas portrait
Researcher 2 reports

Nicholas Ruas

Ai2

Machine learning engineer at Ai2 whose public work focuses on open language models, post-training, and evaluation.

Ai2
Aryo Pradipta Gema portrait
Researcher 1 reports

Aryo Pradipta Gema

Ai2

Research engineer at Ai2 focused on post-training and data for open language models.

Ai2
Kevin Gu portrait
Researcher 1 reports

Kevin Gu

Ai2

Research scientist at Ai2 working on large language models, machine learning with human feedback, and related topics; previously at Stanford and MIT.

Ai2
Mustafa Hajij portrait
Researcher 1 reports

Mustafa Hajij

Ai2

Mustafa Hajij is a research scientist at Ai2 and an adjunct professor in the Department of Computer Science at the University of Southern Maine. His research spans graph machine learning, geometric learning, and applied mathematics.

Ai2

LLMpeople is a public atlas for discovering frontier AI researchers with context, provenance, and respect.

Privacy ยท Terms