Infini-AI-Lab / TriForce

[COLM 2024] TriForce: Lossless Acceleration of Long Sequence Generation with Hierarchical Speculative Decoding
230Updated 2 months ago

Related projects

Alternatives and complementary repositories for TriForce