smart-lty / ParallelSpeculativeDecoding

The official code for paper "parallel speculative decoding with adaptive draft length."
23Updated 2 months ago

Related projects

Alternatives and complementary repositories for ParallelSpeculativeDecoding