Zyphra / tree_attentionView on GitHub
Tree Attention: Topology-aware Decoding for Long-Context Attention on GPU clusters
133Dec 3, 2024Updated last year

Alternatives and similar repositories for tree_attention

Users that are interested in tree_attention are comparing it to the libraries listed below

Sorting:

Are these results useful?