jaywonchung / ShadowTutor
(ICPP '20) ShadowTutor: Distributed Partial Distillation for Mobile Video DNN Inference
☆12Updated 4 years ago
Alternatives and similar repositories for ShadowTutor:
Users that are interested in ShadowTutor are comparing it to the libraries listed below
- Network Contention-Aware Cluster Scheduling with Reinforcement Learning (IEEE ICPADS 2023)☆15Updated 5 months ago
- ☆21Updated last year
- Metis: Learning to Schedule Long-Running Applications in Shared Container Clusters with at Scale☆18Updated 4 years ago
- ☆14Updated 3 years ago
- Multi-Instance-GPU profiling tool☆57Updated last year
- ☆24Updated last year
- Model-less Inference Serving☆85Updated last year
- ☆46Updated 2 months ago
- A Cluster-Wide Model Manager to Accelerate DNN Training via Automated Training Warmup☆34Updated 2 years ago
- Code for "Heterogenity-Aware Cluster Scheduling Policies for Deep Learning Workloads", which appeared at OSDI 2020☆126Updated 7 months ago
- Experimental deep learning framework written in Rust☆14Updated 2 years ago
- ☆37Updated 3 years ago
- This is a list of awesome edgeAI inference related papers.☆95Updated last year
- Helios Traces from SenseTime☆53Updated 2 years ago
- Fast and Efficient Model Serving Using Multi-GPUs with Direct-Host-Access (ACM EuroSys '23)☆57Updated 11 months ago
- ☆49Updated 2 years ago
- ☆45Updated 2 years ago
- ☆39Updated 4 years ago
- Tiresias is a GPU cluster manager for distributed deep learning training.☆152Updated 4 years ago
- Elastic Execution of a DNN Model Between Client and Server☆17Updated 5 years ago
- a deep learning-driven scheduler for elastic training in deep learning clusters☆29Updated 4 years ago
- PipeSwitch: Fast Pipelined Context Switching for Deep Learning Applications☆127Updated 2 years ago
- ☆40Updated 8 months ago
- Source code and datasets for Ekya, a system for continuous learning on the edge.☆105Updated 3 years ago
- Source code for the paper: "A Latency-Predictable Multi-Dimensional Optimization Framework forDNN-driven Autonomous Systems"☆22Updated 4 years ago
- Auto-Split: A General Framework of Collaborative Edge-Cloud AI☆12Updated 3 years ago
- ☆53Updated 4 years ago
- FastFlow is a system that automatically detects CPU bottlenecks in deep learning training pipelines and resolves the bottlenecks with dat…☆27Updated last year
- Code for "Solving Large-Scale Granular Resource Allocation Problems Efficiently with POP", which appeared at SOSP 2021☆25Updated 3 years ago
- BATCH: Adaptive Batching for Efficient MachineLearning Serving on Serverless Platforms☆9Updated 3 years ago