Atlas / Reports / Detail
Training language models to follow instructions with human feedback
Alignment report from OpenAI with 11 connected researchers in the LLMpeople atlas.
Connected researchers
Sandhini Agarwal
OpenAI
Sandhini Agarwal is a researcher at OpenAI. Her OpenReview profile lists her as a researcher at OpenAI (2020–present) and an undergraduate student at Stanford University (2015–2019).
Diogo Almeida
OpenAI
Diogo Almeida is an AI researcher and a co-author of the InstructGPT paper (arXiv:2203.02155).
Pamela Mishkin
OpenAI
Research scientist at OpenAI focused on multimodal models.
Long Ouyang
OpenAI
Researcher at OpenAI who led InstructGPT and GPT-4 post-training. He previously co-founded Merlyn Mind and was an engineering director at Quora.
John Schulman
OpenAI
Research scientist at OpenAI working on reinforcement learning and robotics, with a PhD from UC Berkeley.
Katarina Slama
OpenAI
Research scientist at the UK AI Security Institute and former OpenAI member of technical staff who worked on model behavior and post-training research. Previously conducted computational neuroscience research at UC Berkeley.
Carroll Wainwright
OpenAI
Founder and AI Advisor at Metaculus and a named contributor to OpenAI's GPT-4 Technical Report.
Jeff Wu
OpenAI
Researcher at OpenAI working on language model training and evaluation, and co-author of the GPT-4 Technical Report.
Chong Zhang
OpenAI
Member of Technical Staff at OpenAI working on machine learning, reinforcement learning, natural language processing, and large language models.
Xu Jiang
OpenAI
Xu Jiang is a research scientist at OpenAI focused on large reasoning models and multimodal models. Before joining OpenAI, he worked on recommendation systems and search.
Alex Ray
OpenAI
Engineer who joined OpenAI after building radio encryption systems for small satellites at Planet Labs and working on aerial robots at Airware. He contributed to OpenAI's robotics and reinforcement learning work, including safe exploration research.