layer6ai-labs / T-FixupView on GitHub
Code for the ICML'20 paper "Improving Transformer Optimization Through Better Initialization"
89Feb 1, 2021Updated 5 years ago

Alternatives and similar repositories for T-Fixup

Users that are interested in T-Fixup are comparing it to the libraries listed below

Sorting:

Are these results useful?