jlamprou / Infini-Attention

Efficient Infinite Context Transformers with Infini-attention Pytorch Implementation + QwenMoE Implementation + Training Script + 1M context keypass retrieval
82Updated last year

Alternatives and similar repositories for Infini-Attention

Users that are interested in Infini-Attention are comparing it to the libraries listed below

Sorting: