Current frame
AI researcher at ByteDance focused on large language model architectures.
Atlas / People / Detail
Defa Zhu is an AI researcher whose public homepage says his work focuses on large language models and stronger model architectures.
Profile status: updated
AI researcher at ByteDance focused on large language model architectures.
Defa Zhu is an AI researcher at ByteDance whose official homepage says his work focuses on large language models and stronger LLM architectures. OpenReview lists his education history as an MS student at the University of Chinese Academy of Sciences from 2017 to 2020 and an undergraduate student in mathematics at Northeastern University from 2013 to 2017. His homepage highlights work including Hyper-Connections, Frac-Connections, Over-Tokenized Transformer, and Expert Race.