LINs-lab / DeFTView on GitHub
[ICLR 2025] DeFT: Decoding with Flash Tree-attention for Efficient Tree-structured LLM Inference
51Jun 17, 2025Updated 10 months ago

Alternatives and similar repositories for DeFT

Users that are interested in DeFT are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

Are these results useful?