layer6ai-labs / T-Fixup

Code for the ICML'20 paper "Improving Transformer Optimization Through Better Initialization"
89Updated 3 years ago

Alternatives and similar repositories for T-Fixup:

Users that are interested in T-Fixup are comparing it to the libraries listed below