usc-isi / PipeEdgeLinks
PipeEdge: Pipeline Parallelism for Large-Scale Model Inference on Heterogeneous Edge Devices
☆36Updated last year
Alternatives and similar repositories for PipeEdge
Users that are interested in PipeEdge are comparing it to the libraries listed below
Sorting:
- This is a list of awesome edgeAI inference related papers.☆97Updated last year
- ☆40Updated 4 years ago
- ☆205Updated last year
- ☆100Updated last year
- a deep learning-driven scheduler for elastic training in deep learning clusters☆30Updated 4 years ago
- InFi is a library for building input filters for resource-efficient inference.☆38Updated last year
- Official Repo for "LLM-PQ: Serving LLM on Heterogeneous Clusters with Phase-Aware Partition and Adaptive Quantization"☆34Updated last month
- ☆77Updated 2 years ago
- A Portable C Library for Distributed CNN Inference on IoT Edge Clusters☆83Updated 5 years ago
- ☆13Updated 5 years ago
- MobiSys#114☆21Updated last year
- PyTorch implementation of the paper: Decomposing Vision Transformers for Collaborative Inference in Edge Devices☆13Updated last year
- We present a set of all-reduce compatible gradient compression algorithms which significantly reduce the communication overhead while mai…☆10Updated 3 years ago
- A Cluster-Wide Model Manager to Accelerate DNN Training via Automated Training Warmup☆35Updated 2 years ago
- Autodidactic Neurosurgeon Collaborative Deep Inference for Mobile Edge Intelligence via Online Learning☆40Updated 3 years ago
- Ok-Topk is a scheme for distributed training with sparse gradients. Ok-Topk integrates a novel sparse allreduce algorithm (less than 6k c…☆26Updated 2 years ago
- LLMServingSim: A HW/SW Co-Simulation Infrastructure for LLM Inference Serving at Scale☆127Updated 3 weeks ago
- ☆37Updated last month
- ☆9Updated last year
- Create tiny ML systems for on-device learning.☆20Updated 4 years ago
- ☆51Updated 2 years ago
- Source code for the paper: "A Latency-Predictable Multi-Dimensional Optimization Framework forDNN-driven Autonomous Systems"☆22Updated 4 years ago
- A curated list of awesome projects and papers for AI on Mobile/IoT/Edge devices. Everything is continuously updating. Welcome contributio…☆40Updated 2 years ago
- Deep Compressive Offloading: Speeding Up Neural Network Inference by Trading Edge Computation for Network Latency☆28Updated 4 years ago
- ☆15Updated 11 months ago
- RLScheduler: An AutomatedHPC Batch Job Scheduler Using Reinforcement Learning [SC'20]☆61Updated 2 years ago
- LLM Inference analyzer for different hardware platforms☆82Updated last month
- Open-source implementation for "Helix: Serving Large Language Models over Heterogeneous GPUs and Network via Max-Flow"☆59Updated 8 months ago
- Source code and datasets for Ekya, a system for continuous learning on the edge.☆107Updated 3 years ago
- A Deep Learning Cluster Scheduler☆39Updated 4 years ago