Thesys-lab / Helix-ASPLOS25
Open-source implementation for "Helix: Serving Large Language Models over Heterogeneous GPUs and Network via Max-Flow"
☆40Updated 5 months ago
Alternatives and similar repositories for Helix-ASPLOS25:
Users that are interested in Helix-ASPLOS25 are comparing it to the libraries listed below
- Artifacts for our ASPLOS'23 paper ElasticFlow☆51Updated 11 months ago
- ☆20Updated 11 months ago
- Compiler for Dynamic Neural Networks☆46Updated last year
- ☆16Updated 11 months ago
- ☆36Updated 6 months ago
- ☆49Updated 2 years ago
- Efficient Interactive LLM Serving with Proxy Model-based Sequence Length Prediction | A tiny BERT model can tell you the verbosity of an …☆33Updated 11 months ago