LINs-lab / DeFTLinks

[ICLR 2025] DeFT: Decoding with Flash Tree-attention for Efficient Tree-structured LLM Inference
28Updated 3 weeks ago

Alternatives and similar repositories for DeFT

Users that are interested in DeFT are comparing it to the libraries listed below

Sorting: