Zyphra / tree_attentionLinks

Tree Attention: Topology-aware Decoding for Long-Context Attention on GPU clusters
126Updated 5 months ago

Alternatives and similar repositories for tree_attention

Users that are interested in tree_attention are comparing it to the libraries listed below

Sorting: