lucidrains / memory-compressed-attention

Implementation of Memory-Compressed Attention, from the paper "Generating Wikipedia By Summarizing Long Sequences"
71Updated last year

Related projects

Alternatives and complementary repositories for memory-compressed-attention