updated 2 public sources
large language modelsreinforcement learningreasoning models

Current frame

Researcher at ByteDance; studied computer science at the University of Science and Technology of China.