usc-isi / PipeEdgeLinks
PipeEdge: Pipeline Parallelism for Large-Scale Model Inference on Heterogeneous Edge Devices
☆37Updated last year
Alternatives and similar repositories for PipeEdge
Users that are interested in PipeEdge are comparing it to the libraries listed below
Sorting:
- This is a list of awesome edgeAI inference related papers.☆97Updated last year
- ☆41Updated 5 years ago
- a deep learning-driven scheduler for elastic training in deep learning clusters☆31Updated 4 years ago
- ☆102Updated last year
- InFi is a library for building input filters for resource-efficient inference.☆41Updated 2 years ago
- ☆211Updated last year
- Official Repo for "SplitQuant / LLM-PQ: Resource-Efficient LLM Offline Serving on Heterogeneous GPUs via Phase-Aware Model Partition and …☆35Updated 3 months ago
- PyTorch implementation of the paper: Decomposing Vision Transformers for Collaborative Inference in Edge Devices☆17Updated last year
- A Cluster-Wide Model Manager to Accelerate DNN Training via Automated Training Warmup☆35Updated 2 years ago
- A Portable C Library for Distributed CNN Inference on IoT Edge Clusters☆87Updated 5 years ago
- ☆23Updated 3 years ago
- ☆51Updated 3 years ago
- [IJCAI2023] An automated parallel training system that combines the advantages from both data and model parallelism. If you have any inte…☆52Updated 2 years ago
- MobiSys#114☆22Updated 2 years ago
- HeliosArtifact☆22Updated 3 years ago
- Proteus: A High-Throughput Inference-Serving System with Accuracy Scaling☆12Updated last year
- ☆15Updated last year
- ☆78Updated 2 years ago
- ☆13Updated 5 years ago
- Ok-Topk is a scheme for distributed training with sparse gradients. Ok-Topk integrates a novel sparse allreduce algorithm (less than 6k c…☆27Updated 3 years ago
- AlpaServe: Statistical Multiplexing with Model Parallelism for Deep Learning Serving (OSDI 23)☆92Updated 2 years ago
- Simple PyTorch graph capturing.☆21Updated 2 years ago
- A curated list of awesome projects and papers for AI on Mobile/IoT/Edge devices. Everything is continuously updating. Welcome contributio…☆44Updated 2 years ago
- Artifacts for our ASPLOS'23 paper ElasticFlow☆55Updated last year
- Code for "Solving Large-Scale Granular Resource Allocation Problems Efficiently with POP", which appeared at SOSP 2021☆28Updated 3 years ago
- ☆38Updated 5 months ago
- GRACE - GRAdient ComprEssion for distributed deep learning☆139Updated last year
- Autodidactic Neurosurgeon Collaborative Deep Inference for Mobile Edge Intelligence via Online Learning☆42Updated 4 years ago
- ☆24Updated 2 years ago
- ☆44Updated last year