LINs-lab / DeFTLinks

[ICLR 2025] DeFT: Decoding with Flash Tree-attention for Efficient Tree-structured LLM Inference
34Updated 2 months ago

Alternatives and similar repositories for DeFT

Users that are interested in DeFT are comparing it to the libraries listed below

Sorting: