llm-db / FineInfer

Deferred Continuous Batching in Resource-Efficient Large Language Model Serving (EuroMLSys 2024)
15Updated 11 months ago

Alternatives and similar repositories for FineInfer

Users that are interested in FineInfer are comparing it to the libraries listed below

Sorting: