updated 1 public sources
mixture of expertstransformer training

Current frame

Researcher in efficient large-model training and mixture-of-experts methods.