lucidrains / memory-efficient-attention-pytorch

Implementation of a memory efficient multi-head attention as proposed in the paper, "Self-attention Does Not Need O(n²) Memory"
370Updated last year

Alternatives and similar repositories for memory-efficient-attention-pytorch:

Users that are interested in memory-efficient-attention-pytorch are comparing it to the libraries listed below