yuanmu97 / InFiLinks
InFi is a library for building input filters for resource-efficient inference.
☆41Updated 2 years ago
Alternatives and similar repositories for InFi
Users that are interested in InFi are comparing it to the libraries listed below
Sorting:
- ☆212Updated 2 years ago
- Source code and datasets for Ekya, a system for continuous learning on the edge.☆112Updated 3 years ago
- PipeEdge: Pipeline Parallelism for Large-Scale Model Inference on Heterogeneous Edge Devices☆37Updated 2 years ago
- This is a list of awesome edgeAI inference related papers.☆98Updated 2 years ago
- a deep learning-driven scheduler for elastic training in deep learning clusters☆31Updated 5 years ago
- ☆78Updated 2 years ago
- ☆102Updated 2 years ago
- ☆57Updated 4 years ago
- Autodidactic Neurosurgeon Collaborative Deep Inference for Mobile Edge Intelligence via Online Learning☆42Updated 4 years ago
- A Portable C Library for Distributed CNN Inference on IoT Edge Clusters☆88Updated 5 years ago
- A curated list of awesome projects and papers for AI on Mobile/IoT/Edge devices. Everything is continuously updating. Welcome contributio…☆47Updated this week
- ☆48Updated 2 years ago
- [ICLR 2018] Deep Gradient Compression: Reducing the Communication Bandwidth for Distributed Training☆226Updated last year
- A curated list of research in System for Edge Intelligence and Computing(Edge MLSys), including Frameworks, Tools, Repository, etc. Paper…☆32Updated 4 years ago
- Adaptive Model Streaming for real-time video inference on edge devices☆41Updated 4 years ago
- GRACE - GRAdient ComprEssion for distributed deep learning☆139Updated last year
- Source code for the paper: "A Latency-Predictable Multi-Dimensional Optimization Framework forDNN-driven Autonomous Systems"☆22Updated 5 years ago
- ☆13Updated 6 years ago
- FilterForward: Scaling Video Analytics on Constrained Edge Nodes☆28Updated 6 years ago
- Server-driven Video Streaming for Deep Learning Inference☆93Updated 3 years ago
- Partial implementation of paper "DEEP GRADIENT COMPRESSION: REDUCING THE COMMUNICATION BANDWIDTH FOR DISTRIBUTED TRAINING"☆31Updated 5 years ago
- PyTorch implementation of the paper: Decomposing Vision Transformers for Collaborative Inference in Edge Devices☆18Updated last year
- ☆16Updated 2 years ago
- Code for "Solving Large-Scale Granular Resource Allocation Problems Efficiently with POP", which appeared at SOSP 2021☆28Updated 4 years ago
- INFOCOM 2024: Online Resource Allocation for Edge Intelligence with Colocated Model Retraining and Inference☆34Updated last year
- Source code for Jellyfish, a soft real-time inference serving system☆15Updated 3 years ago
- A Cluster-Wide Model Manager to Accelerate DNN Training via Automated Training Warmup☆35Updated 3 years ago
- MobiSys#114☆23Updated 2 years ago
- ☆23Updated 4 years ago
- ☆15Updated 2 years ago