lucidrains / memory-compressed-attention

Implementation of Memory-Compressed Attention, from the paper "Generating Wikipedia By Summarizing Long Sequences"
70Updated last year

Alternatives and similar repositories for memory-compressed-attention:

Users that are interested in memory-compressed-attention are comparing it to the libraries listed below