jaketae / param-share-transformer

PyTorch implementation of Lessons on Parameter Sharing across Layers in Transformers
25Updated 3 years ago

Related projects

Alternatives and complementary repositories for param-share-transformer