Current frame
Researcher focused on large language model alignment, RLHF, and reasoning.
Atlas / People / Detail
Public OpenReview and personal homepage/blog for Wei Shen show work on reinforcement learning from human feedback, adaptive chain-of-thought control, long-context behavior, and LLM data-scaling analysis.
Profile status: updated
Researcher focused on large language model alignment, RLHF, and reasoning.
Wei Shen is publicly associated with large language model research through an OpenReview profile and a personal LLM blog. Publicly listed work includes research on RLHF policy filtration, adaptive reasoning-step control, long-sequence behavior under RoPE, and data-scaling effects in preference learning, indicating a focus on alignment, reasoning, and training methodology for LLMs.