yuanmu97 / InFi
InFi is a library for building input filters for resource-efficient inference.
☆37Updated last year
Alternatives and similar repositories for InFi:
Users that are interested in InFi are comparing it to the libraries listed below
- ☆192Updated last year
- This is a list of awesome edgeAI inference related papers.☆91Updated last year
- ☆99Updated last year
- PipeEdge: Pipeline Parallelism for Large-Scale Model Inference on Heterogeneous Edge Devices☆29Updated 11 months ago
- ☆17Updated last year
- A curated list of research in System for Edge Intelligence and Computing(Edge MLSys), including Frameworks, Tools, Repository, etc. Paper…☆26Updated 3 years ago
- Create tiny ML systems for on-device learning.☆20Updated 3 years ago
- ☆15Updated last year
- Source code and datasets for Ekya, a system for continuous learning on the edge.☆104Updated 2 years ago
- ☆43Updated 2 years ago
- Partial implementation of paper "DEEP GRADIENT COMPRESSION: REDUCING THE COMMUNICATION BANDWIDTH FOR DISTRIBUTED TRAINING"☆31Updated 4 years ago
- [ICLR 2018] Deep Gradient Compression: Reducing the Communication Bandwidth for Distributed Training☆214Updated 6 months ago
- vector quantization for stochastic gradient descent.☆33Updated 4 years ago
- ☆12Updated 5 years ago
- A demo of end-to-end federated learning system.☆68Updated 2 years ago
- ☆74Updated last year
- ☆56Updated 3 years ago
- LotteryFL: Empower Edge Intelligence with Personalized and Communication-Efficient Federated Learning (2021 IEEE/ACM Symposium on Edge Co…☆41Updated 2 years ago
- Autodidactic Neurosurgeon Collaborative Deep Inference for Mobile Edge Intelligence via Online Learning☆39Updated 3 years ago
- ☆46Updated last year
- A Cluster-Wide Model Manager to Accelerate DNN Training via Automated Training Warmup☆34Updated 2 years ago
- One-size-fits-all model for mobile AI, a novel paradigm for mobile AI in which the OS and hardware co-manage a foundation model that is c…☆22Updated 10 months ago
- Federated Dynamic Sparse Training☆29Updated 2 years ago
- A Distributed Camera System for Inference Scheduling and Continuous Learning in Video Analytics☆15Updated last year
- Adaptive Model Streaming for real-time video inference on edge devices☆42Updated 3 years ago
- A curated list of early exiting (LLM, CV, NLP, etc)☆38Updated 4 months ago
- PyTorch implementation of the paper: Decomposing Vision Transformers for Collaborative Inference in Edge Devices☆10Updated 5 months ago
- Source code for Jellyfish, a soft real-time inference serving system☆12Updated 2 years ago
- ☆19Updated 2 years ago
- MobiSys#114☆21Updated last year