tnq177 / transformers_without_tears

Transformers without Tears: Improving the Normalization of Self-Attention
130Updated 5 months ago

Related projects

Alternatives and complementary repositories for transformers_without_tears