updated 3 public sources
reinforcement learninglarge language modelsreasoning models

Current frame

ByteDance researcher focused on reinforcement learning for large language models.

Extended note

Public sources identify Ruofei Zhu as a ByteDance researcher with expertise in reinforcement learning. DBLP attributes multiple 2025 LLM and reasoning-related papers to this profile, including DAPO, VAPO, and the provided Seed1.5-Thinking report. An OpenReview profile for the same name lists a confirmed @bytedance.com email and an MS in the Program for Software Engineering at Peking University (2017-2020).