lucidrains / Mega-pytorch

Implementation of Mega, the Single-head Attention with Multi-headed EMA architecture that currently holds SOTA on Long Range Arena
204Updated last year

Alternatives and similar repositories for Mega-pytorch:

Users that are interested in Mega-pytorch are comparing it to the libraries listed below