LINs-lab / DeFT

[ICLR 2025] DeFT: Decoding with Flash Tree-attention for Efficient Tree-structured LLM Inference
19Updated last week

Alternatives and similar repositories for DeFT:

Users that are interested in DeFT are comparing it to the libraries listed below