NVIDIA / Star-Attention

Efficient LLM Inference over Long Sequences
362Updated 3 weeks ago

Alternatives and similar repositories for Star-Attention:

Users that are interested in Star-Attention are comparing it to the libraries listed below