NVIDIA / Star-Attention

Efficient LLM Inference over Long Sequences
β˜†357Updated this week

Alternatives and similar repositories for Star-Attention:

Users that are interested in Star-Attention are comparing it to the libraries listed below