yuanmu97 / InFi
InFi is a library for building input filters for resource-efficient inference.
☆37Updated last year
Alternatives and similar repositories for InFi:
Users that are interested in InFi are comparing it to the libraries listed below
- ☆199Updated last year
- PipeEdge: Pipeline Parallelism for Large-Scale Model Inference on Heterogeneous Edge Devices☆29Updated last year
- Source code and datasets for Ekya, a system for continuous learning on the edge.☆105Updated 3 years ago
- ☆22Updated last year
- [ICLR 2018] Deep Gradient Compression: Reducing the Communication Bandwidth for Distributed Training☆217Updated 9 months ago
- ☆16Updated last year
- Autodidactic Neurosurgeon Collaborative Deep Inference for Mobile Edge Intelligence via Online Learning☆41Updated 3 years ago
- This is a list of awesome edgeAI inference related papers.☆95Updated last year
- One-size-fits-all model for mobile AI, a novel paradigm for mobile AI in which the OS and hardware co-manage a foundation model that is c…☆25Updated last year
- ☆56Updated 3 years ago
- ☆45Updated 2 years ago
- Partial implementation of paper "DEEP GRADIENT COMPRESSION: REDUCING THE COMMUNICATION BANDWIDTH FOR DISTRIBUTED TRAINING"☆31Updated 4 years ago
- Create tiny ML systems for on-device learning.☆20Updated 3 years ago
- ☆13Updated 5 years ago
- ☆98Updated last year
- A Cluster-Wide Model Manager to Accelerate DNN Training via Automated Training Warmup☆34Updated 2 years ago
- ☆9Updated last year
- DNN_Partition辅助工具,用于对pytorch模型进行简单的性能分析以及支持模型切分☆11Updated 3 years ago
- ☆28Updated 2 years ago
- Official Pytorch implementation of "Communication-Efficient Federated Learning with Compensated Overlap-FedAvg"☆22Updated 3 years ago
- PyTorch implementation of the paper: Multi-Agent Collaborative Inference via DNN Decoupling: Intermediate Feature Compression and Edge Le…☆39Updated last year
- A demo of end-to-end federated learning system.☆68Updated 2 years ago
- "Efficient Federated Learning for Modern NLP", to appear at MobiCom 2023.☆33Updated last year
- Adaptive Model Streaming for real-time video inference on edge devices☆41Updated 3 years ago
- A PyTorch Implementation for experiements in paper: Neurosurgeon: Collaborative Intelligence Between the Cloud and Mobile Edge.☆13Updated last year
- LotteryFL: Empower Edge Intelligence with Personalized and Communication-Efficient Federated Learning (2021 IEEE/ACM Symposium on Edge Co…☆42Updated 2 years ago
- ☆77Updated last year
- 云边协同- collaborative inference📚Dynamic adaptive DNN surgery for inference acceleration on the edge☆37Updated last year
- vector quantization for stochastic gradient descent.☆34Updated 4 years ago
- Source code for Jellyfish, a soft real-time inference serving system☆12Updated 2 years ago