yuanmu97 / InFi
InFi is a library for building input filters for resource-efficient inference.
☆37Updated last year
Alternatives and similar repositories for InFi:
Users that are interested in InFi are comparing it to the libraries listed below
- PipeEdge: Pipeline Parallelism for Large-Scale Model Inference on Heterogeneous Edge Devices☆31Updated last year
- ☆99Updated last year
- This is a list of awesome edgeAI inference related papers.☆96Updated last year
- ☆13Updated 5 years ago
- Source code and datasets for Ekya, a system for continuous learning on the edge.☆105Updated 3 years ago
- LotteryFL: Empower Edge Intelligence with Personalized and Communication-Efficient Federated Learning (2021 IEEE/ACM Symposium on Edge Co…☆42Updated 2 years ago
- [ICLR 2018] Deep Gradient Compression: Reducing the Communication Bandwidth for Distributed Training☆221Updated 9 months ago
- ☆201Updated last year
- ☆22Updated last year
- A curated list of research in System for Edge Intelligence and Computing(Edge MLSys), including Frameworks, Tools, Repository, etc. Paper…☆29Updated 3 years ago
- Autodidactic Neurosurgeon Collaborative Deep Inference for Mobile Edge Intelligence via Online Learning☆42Updated 3 years ago
- A curated list of early exiting (LLM, CV, NLP, etc)☆49Updated 8 months ago
- Deep Compressive Offloading: Speeding Up Neural Network Inference by Trading Edge Computation for Network Latency☆27Updated 4 years ago
- Partial implementation of paper "DEEP GRADIENT COMPRESSION: REDUCING THE COMMUNICATION BANDWIDTH FOR DISTRIBUTED TRAINING"☆31Updated 4 years ago
- Pytorch-based early exit network inspired by branchynet☆31Updated 3 weeks ago
- ☆77Updated last year
- A Cluster-Wide Model Manager to Accelerate DNN Training via Automated Training Warmup☆34Updated 2 years ago
- Create tiny ML systems for on-device learning.☆20Updated 3 years ago
- ☆16Updated last year
- DNN_Partition辅助工具,用于对pytorch模型进行简单的性能分析以及支持模型切分☆11Updated 3 years ago
- ☆14Updated last year
- GRACE - GRAdient ComprEssion for distributed deep learning☆139Updated 9 months ago
- FilterForward: Scaling Video Analytics on Constrained Edge Nodes☆28Updated 5 years ago
- MobiSys#114☆21Updated last year
- 云边协同- collaborative inference📚Dynamic adaptive DNN surgery for inference acceleration on the edge☆39Updated last year
- About DNN compression and acceleration on Edge Devices.☆55Updated 3 years ago
- INFOCOM 2024: Online Resource Allocation for Edge Intelligence with Colocated Model Retraining and Inference☆14Updated 6 months ago
- ☆45Updated 2 years ago
- A DNN model partition demo☆31Updated 5 years ago
- 基于提前退出部分样本原理而实现的带分支网络(supported by chainer)☆45Updated 6 years ago