Infini-AI-Lab / MagicDecView on GitHub
[ICLR2025] Breaking Throughput-Latency Trade-off for Long Sequences with Speculative Decoding
145Dec 4, 2024Updated last year

Alternatives and similar repositories for MagicDec

Users that are interested in MagicDec are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

Are these results useful?