☆19Dec 24, 2024Updated last year
Alternatives and similar repositories for flash_tree_attn
Users that are interested in flash_tree_attn are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- triton ver of gqa flash attn, based on the tutorial☆12Aug 4, 2024Updated last year
- [ACL 2026 (Main)] LongSpec: Long-Context Lossless Speculative Decoding with Efficient Drafting and Verification☆79Jul 14, 2025Updated 8 months ago
- [ACL 2025 main] FR-Spec: Frequency-Ranked Speculative Sampling