siyan-zhao / prepacking

The source code of our work "Prepacking: A Simple Method for Fast Prefilling and Increased Throughput in Large Language Models"
59Updated 4 months ago

Alternatives and similar repositories for prepacking:

Users that are interested in prepacking are comparing it to the libraries listed below