usc-isi / PipeEdgeLinks

PipeEdge: Pipeline Parallelism for Large-Scale Model Inference on Heterogeneous Edge Devices

☆35

Alternatives and similar repositories for PipeEdge

Users that are interested in PipeEdge are comparing it to the libraries listed below

Sorting:

Kyrie-Zhao / awesome-real-time-AI
This is a list of awesome edgeAI inference related papers.
☆98Updated last year
msr-fiddle / dnn-partitioning
☆40Updated 5 years ago
UbiquitousLearning / Paper-list-resource-efficient-large-language-model
☆100Updated last year
pengyanghua / DL2
a deep learning-driven scheduler for elastic training in deep learning clusters
☆32Updated 4 years ago
xumengwei / Edge-AI-Paper-List
☆208Updated last year
tonyzhao-jt / LLM-PQ
Official Repo for "SplitQuant / LLM-PQ: Resource-Efficient LLM Offline Serving on Heterogeneous GPUs via Phase-Aware Model Partition and …
☆34Updated last month
qipengwang / Melon
MobiSys#114
☆22Updated 2 years ago
ysyisyourbrother / awesome-on-device-AI
A curated list of awesome projects and papers for AI on Mobile/IoT/Edge devices. Everything is continuously updating. Welcome contributio…
☆43Updated 2 years ago
falcon-xu / DeViT
PyTorch implementation of the paper: Decomposing Vision Transformers for Collaborative Inference in Edge Devices
☆13Updated last year
csu-eis / CoDL
☆78Updated 2 years ago
ParCIS / Ok-Topk
Ok-Topk is a scheme for distributed training with sparse gradients. Ok-Topk integrates a novel sparse allreduce algorithm (less than 6k c…
☆27Updated 2 years ago
zoranzhao / DeepThings
A Portable C Library for Distributed CNN Inference on IoT Edge Clusters
☆83Updated 5 years ago
Rivendile / Muri
Artifacts for our SIGCOMM'22 paper Muri
☆43Updated last year
pengyanghua / optimus
A Deep Learning Cluster Scheduler
☆39Updated 4 years ago
SymbioticLab / ModelKeeper
A Cluster-Wide Model Manager to Accelerate DNN Training via Automated Training Warmup
☆35Updated 2 years ago
Raphael-Hao / Abacus
☆38Updated 3 months ago
UMass-LIDS / Proteus
Proteus: A High-Throughput Inference-Serving System with Accuracy Scaling
☆13Updated last year
sands-lab / grace
GRACE - GRAdient ComprEssion for distributed deep learning
☆140Updated last year
wenh18 / AdaptiveNet
☆16Updated 2 years ago
S-Lab-System-Group / HeliosArtifact
HeliosArtifact
☆21Updated 3 years ago
msr-fiddle / synergy
☆51Updated 2 years ago
pkusys / ElasticFlow
Artifacts for our ASPLOS'23 paper ElasticFlow
☆53Updated last year
S-Lab-System-Group / ChronusArtifact
☆23Updated 3 years ago
IoTDATALab / CNNPC
☆13Updated 5 years ago
alpha0422 / torch-graph
Simple PyTorch graph capturing.
☆20Updated 2 years ago
shaojiawei07 / Branchy-GNN
☆22Updated 2 years ago
CPS-AI / Deep-Compressive-Offloading
Deep Compressive Offloading: Speeding Up Neural Network Inference by Trading Edge Computation for Network Latency
☆28Updated 4 years ago
swagshaw / Awesome-Cloud-Edge-AI
A curated list of research in System for Edge Intelligence and Computing(Edge MLSys), including Frameworks, Tools, Repository, etc. Paper…
☆30Updated 3 years ago
Tjyy-1223 / DADS
云边协同- collaborative inference📚Dynamic adaptive DNN surgery for inference acceleration on the edge
☆42Updated 2 years ago
Soroosh129 / NeuOS
Source code for the paper: "A Latency-Predictable Multi-Dimensional Optimization Framework forDNN-driven Autonomous Systems"
☆22Updated 4 years ago