hmarkc / parallel-prompt-decoding

Efficient LLM Inference Acceleration using Prompting
43Updated 3 weeks ago

Related projects

Alternatives and complementary repositories for parallel-prompt-decoding