usc-isi / PipeEdgeLinks
PipeEdge: Pipeline Parallelism for Large-Scale Model Inference on Heterogeneous Edge Devices
☆33Updated last year
Alternatives and similar repositories for PipeEdge
Users that are interested in PipeEdge are comparing it to the libraries listed below
Sorting:
- This is a list of awesome edgeAI inference related papers.☆96Updated last year
- Official Repo for "LLM-PQ: Serving LLM on Heterogeneous Clusters with Phase-Aware Partition and Adaptive Quantization"☆32Updated last year
- A Cluster-Wide Model Manager to Accelerate DNN Training via Automated Training Warmup☆34Updated 2 years ago
- a deep learning-driven scheduler for elastic training in deep learning clusters☆29Updated 4 years ago
- 云边协同- collaborative inference📚Dynamic adaptive DNN surgery for inference acceleration on the edge☆40Updated last year
- ☆40Updated 4 years ago
- [TMC'22] SplitPlace: AI Augmented Splitting and Placement of Large-Scale Neural Networks in Mobile Edge Environments☆17Updated 2 years ago
- ☆99Updated last year
- PyTorch implementation of the paper: Decomposing Vision Transformers for Collaborative Inference in Edge Devices☆12Updated 10 months ago
- ☆16Updated last year
- ☆13Updated 5 years ago
- Autodidactic Neurosurgeon Collaborative Deep Inference for Mobile Edge Intelligence via Online Learning☆40Updated 3 years ago
- ☆14Updated 9 months ago
- InFi is a library for building input filters for resource-efficient inference.☆38Updated last year
- Deep Compressive Offloading: Speeding Up Neural Network Inference by Trading Edge Computation for Network Latency☆27Updated 4 years ago
- ☆77Updated 2 years ago
- A curated list of early exiting (LLM, CV, NLP, etc)☆53Updated 9 months ago
- ☆22Updated 2 years ago
- MobiSys#114☆21Updated last year
- Source code for Jellyfish, a soft real-time inference serving system☆12Updated 2 years ago
- ☆9Updated last year
- ☆20Updated 3 years ago
- ☆14Updated last year
- A PyTorch Implementation for experiements in paper: Neurosurgeon: Collaborative Intelligence Between the Cloud and Mobile Edge.☆13Updated 2 years ago
- PyTorch implementation of the paper: Multi-Agent Collaborative Inference via DNN Decoupling: Intermediate Feature Compression and Edge Le…☆40Updated last year
- 云边协同- collaborative inference 📚Neurosurgeon: Collaborative Intelligence Between the Cloud and Mobile Edge☆81Updated last year
- A Portable C Library for Distributed CNN Inference on IoT Edge Clusters☆82Updated 5 years ago
- Create tiny ML systems for on-device learning.☆20Updated 3 years ago
- ☆50Updated 2 years ago
- [ICML 2024] Serving LLMs on heterogeneous decentralized clusters.☆25Updated last year