updated 3 public sources
report_authorreinforcement learningroboticsLLM post-training

Current frame

Peking University PhD student working on reinforcement learning, robotics, and LLM post-training.

Extended note

Public profiles identify Haobin Jiang as a PhD student at Peking University since 2020, following undergraduate study there from 2016 to 2020. His OpenReview profile lists reinforcement learning as an expertise area, and his linked GitHub profile describes work spanning RL, robotics, and LLM post-training.