updated 2 public sources
large language models

Current frame

AI researcher at ByteDance focused on large language model architectures.

Extended note

Defa Zhu is an AI researcher at ByteDance whose official homepage says his work focuses on large language models and stronger LLM architectures. OpenReview lists his education history as an MS student at the University of Chinese Academy of Sciences from 2017 to 2020 and an undergraduate student in mathematics at Northeastern University from 2013 to 2017. His homepage highlights work including Hyper-Connections, Frac-Connections, Over-Tokenized Transformer, and Expert Race.