llm-db / FineInfer

Deferred Continuous Batching in Resource-Efficient Large Language Model Serving (EuroMLSys 2024)
11Updated 5 months ago

Related projects

Alternatives and complementary repositories for FineInfer