Zyphra / tree_attention

Tree Attention: Topology-aware Decoding for Long-Context Attention on GPU clusters
104Updated last month

Related projects

Alternatives and complementary repositories for tree_attention