lucidrains / Mega-pytorchLinks

Implementation of Mega, the Single-head Attention with Multi-headed EMA architecture that currently holds SOTA on Long Range Arena
203Updated last year

Alternatives and similar repositories for Mega-pytorch

Users that are interested in Mega-pytorch are comparing it to the libraries listed below

Sorting: