OpenNLPLab / lightning-attention

Lightning Attention-2: A Free Lunch for Handling Unlimited Sequence Lengths in Large Language Models
260Updated this week

Alternatives and similar repositories for lightning-attention:

Users that are interested in lightning-attention are comparing it to the libraries listed below