pkuzengqi / SkyformerLinks
Skyformer: Remodel Self-Attention with Gaussian Kernel and Nystr\"om Method (NeurIPS 2021)
☆63Updated 3 years ago
Alternatives and similar repositories for Skyformer
Users that are interested in Skyformer are comparing it to the libraries listed below
Sorting:
- Implementation of a Transformer using ReLA (Rectified Linear Attention) from https://arxiv.org/abs/2104.07012☆49Updated 3 years ago
- FairSeq repo with Apollo optimizer☆114Updated 2 years ago
- Implementation of Memformer, a Memory-augmented Transformer, in Pytorch