zhengzangw / Sequence-Scheduling

PyTorch implementation of paper "Response Length Perception and Sequence Scheduling: An LLM-Empowered LLM Inference Pipeline".
81Updated last year

Alternatives and similar repositories for Sequence-Scheduling:

Users that are interested in Sequence-Scheduling are comparing it to the libraries listed below