jlamprou / Infini-Attention

Efficient Infinite Context Transformers with Infini-attention Pytorch Implementation + QwenMoE Implementation + Training Script + 1M context keypass retrieval
66Updated 6 months ago

Related projects

Alternatives and complementary repositories for Infini-Attention