Infini-AI-Lab / MagicDec

Breaking Throughput-Latency Trade-off for Long Sequences with Speculative Decoding
78Updated this week

Related projects

Alternatives and complementary repositories for MagicDec