updated 2 public sources
large language modelsreasoningreinforcement learning

Current frame

PhD student in Computer Science and Technology at Tsinghua University working on large language models, reasoning, and reinforcement learning.

Extended note

Publicly available profiles identify Weinan Dai as a PhD student in Computer Science and Technology at Tsinghua University, following undergraduate study there from 2021 to 2025. His public publication record includes large-language-model reasoning and reinforcement-learning-related work such as Seed1.5-Thinking, DAPO, MemAgent, and Enigmata.