yuanmu97 / InFiLinks
InFi is a library for building input filters for resource-efficient inference.
☆38Updated last year
Alternatives and similar repositories for InFi
Users that are interested in InFi are comparing it to the libraries listed below
Sorting:
- PipeEdge: Pipeline Parallelism for Large-Scale Model Inference on Heterogeneous Edge Devices☆35Updated last year
- ☆13Updated 5 years ago
- This is a list of awesome edgeAI inference related papers.☆95Updated last year
- ☆16Updated last year
- Source code and datasets for Ekya, a system for continuous learning on the edge.☆106Updated 3 years ago
- a deep learning-driven scheduler for elastic training in deep learning clusters☆30Updated 4 years ago
- ☆13Updated last year
- ☆45Updated 2 years ago
- Partial implementation of paper "DEEP GRADIENT COMPRESSION: REDUCING THE COMMUNICATION BANDWIDTH FOR DISTRIBUTED TRAINING"☆31Updated 4 years ago
- INFOCOM 2024: Online Resource Allocation for Edge Intelligence with Colocated Model Retraining and Inference☆18Updated 8 months ago
- ☆56Updated 3 years ago
- Autodidactic Neurosurgeon Collaborative Deep Inference for Mobile Edge Intelligence via Online Learning☆40Updated 3 years ago
- ☆28Updated 2 years ago
- [ICLR 2018] Deep Gradient Compression: Reducing the Communication Bandwidth for Distributed Training☆222Updated 11 months ago
- ☆202Updated last year
- A curated list of research in System for Edge Intelligence and Computing(Edge MLSys), including Frameworks, Tools, Repository, etc. Paper…☆30Updated 3 years ago
- Server-driven Video Streaming for Deep Learning Inference☆92Updated 2 years ago
- Adaptive Model Streaming for real-time video inference on edge devices☆41Updated 3 years ago
- ☆45Updated 3 years ago
- Deep Compressive Offloading: Speeding Up Neural Network Inference by Trading Edge Computation for Network Latency☆28Updated 4 years ago
- Create tiny ML systems for on-device learning.☆20Updated 3 years ago
- ☆22Updated 2 years ago
- A Cluster-Wide Model Manager to Accelerate DNN Training via Automated Training Warmup☆35Updated 2 years ago
- ☆99Updated last year
- vector quantization for stochastic gradient descent.☆35Updated 5 years ago
- A Distributed Camera System for Inference Scheduling and Continuous Learning in Video Analytics☆17Updated 2 years ago
- PyTorch implementation of the paper: Decomposing Vision Transformers for Collaborative Inference in Edge Devices☆12Updated 11 months ago
- ☆9Updated last year
- FilterForward: Scaling Video Analytics on Constrained Edge Nodes☆28Updated 5 years ago
- Code for the paper: "BottleNet++: An End-to-End Approach for Feature Compression in Device-Edge Co-Inference Systems"☆50Updated 3 years ago