mega002 / ff-layers

The accompanying code for "Transformer Feed-Forward Layers Are Key-Value Memories". Mor Geva, Roei Schuster, Jonathan Berant, and Omer Levy. EMNLP, 2021.
89Updated 3 years ago

Alternatives and similar repositories for ff-layers:

Users that are interested in ff-layers are comparing it to the libraries listed below