usc-isi / PipeEdgeLinks
PipeEdge: Pipeline Parallelism for Large-Scale Model Inference on Heterogeneous Edge Devices
☆35Updated last year
Alternatives and similar repositories for PipeEdge
Users that are interested in PipeEdge are comparing it to the libraries listed below
Sorting:
- This is a list of awesome edgeAI inference related papers.☆96Updated last year
- a deep learning-driven scheduler for elastic training in deep learning clusters☆30Updated 4 years ago
- ☆100Updated last year
- ☆203Updated last year
- ☆40Updated 4 years ago
- MobiSys#114☆21Updated last year
- InFi is a library for building input filters for resource-efficient inference.☆38Updated last year
- Official Repo for "LLM-PQ: Serving LLM on Heterogeneous Clusters with Phase-Aware Partition and Adaptive Quantization"☆34Updated 2 weeks ago
- ☆77Updated 2 years ago
- Simple PyTorch graph capturing.☆20Updated 2 years ago
- Lucid: A Non-Intrusive, Scalable and Interpretable Scheduler for Deep Learning Training Jobs☆55Updated 2 years ago
- ☆14Updated 11 months ago
- ☆13Updated 5 years ago
- A curated list of research in System for Edge Intelligence and Computing(Edge MLSys), including Frameworks, Tools, Repository, etc. Paper…☆30Updated 3 years ago
- A Portable C Library for Distributed CNN Inference on IoT Edge Clusters☆82Updated 5 years ago
- iGniter, an interference-aware GPU resource provisioning framework for achieving predictable performance of DNN inference in the cloud.☆38Updated last year
- A Cluster-Wide Model Manager to Accelerate DNN Training via Automated Training Warmup☆35Updated 2 years ago
- AlpaServe: Statistical Multiplexing with Model Parallelism for Deep Learning Serving (OSDI 23)☆82Updated 2 years ago
- ☆9Updated last year
- zTT: Learning-based DVFS with Zero Thermal Throttling for Mobile Devices [MobiSys'21] - Artifact Evaluation☆25Updated 4 years ago
- LLM serving cluster simulator☆107Updated last year
- Proteus: A High-Throughput Inference-Serving System with Accuracy Scaling☆13Updated last year
- 云边协同- collaborative inference📚Dynamic adaptive DNN surgery for inference acceleration on the edge☆41Updated last year
- A curated list of early exiting (LLM, CV, NLP, etc)☆56Updated 10 months ago
- A curated list of awesome projects and papers for AI on Mobile/IoT/Edge devices. Everything is continuously updating. Welcome contributio…☆38Updated 2 years ago
- [ICML 2024] Serving LLMs on heterogeneous decentralized clusters.☆26Updated last year
- Curated collection of papers in MoE model inference☆210Updated 4 months ago
- A Deep Learning Cluster Scheduler☆39Updated 4 years ago
- ☆22Updated 2 years ago
- Source code and datasets for Ekya, a system for continuous learning on the edge.☆106Updated 3 years ago