updated 3 public sources
large language modelspreference modelingreward modelingcode intelligencedocument parsing

Current frame

PhD student at Fudan University working on LLM alignment, preference modeling, and code intelligence.