cakeng / ASPENLinks
This is the proof-of-concept CPU implementation of ASPEN used for the NeurIPS'23 paper ASPEN: Breaking Operator Barriers for Efficient Parallelization of Deep Neural Networks.
☆11Updated last year
Alternatives and similar repositories for ASPEN
Users that are interested in ASPEN are comparing it to the libraries listed below
Sorting:
- ☆25Updated 2 years ago
- Official Repo for "SplitQuant / LLM-PQ: Resource-Efficient LLM Offline Serving on Heterogeneous GPUs via Phase-Aware Model Partition and …☆34Updated this week
- TiledLower is a Dataflow Analysis and Codegen Framework written in Rust.☆14Updated 9 months ago
- Official repository for the paper DynaPipe: Optimizing Multi-task Training through Dynamic Pipelines☆20Updated last year
- Artifacts of EVT ASPLOS'24☆26Updated last year
- Supplemental materials for The ASPLOS 2025 / EuroSys 2025 Contest on Intra-Operator Parallelism for Distributed Deep Learning☆22Updated 3 months ago
- SpInfer: Leveraging Low-Level Sparsity for Efficient Large Language Model Inference on GPUs☆57Updated 5 months ago
- LLM Inference analyzer for different hardware platforms