updated 2 public sources
large language modelsRLHFreasoningalignment

Current frame

Researcher focused on large language model alignment, RLHF, and reasoning.

Extended note

Wei Shen is publicly associated with large language model research through an OpenReview profile and a personal LLM blog. Publicly listed work includes research on RLHF policy filtration, adaptive reasoning-step control, long-sequence behavior under RoPE, and data-scaling effects in preference learning, indicating a focus on alignment, reasoning, and training methodology for LLMs.