guolinke / TUPE

Transformer with Untied Positional Encoding (TUPE). Code of paper "Rethinking Positional Encoding in Language Pre-training". Improve existing models like BERT.
250Updated 3 years ago

Related projects

Alternatives and complementary repositories for TUPE