Current frame
Researcher in efficient large-model training and mixture-of-experts methods.
Atlas / People / Detail
Public OpenReview information identifies Sijun Zhang as a researcher formerly at ByteDance and later at WeChat AI, Tencent, with listed publications on mixture-of-experts methods and transformer training stability.
Profile status: updated
Researcher in efficient large-model training and mixture-of-experts methods.