yuanmu97 / InFiLinks
InFi is a library for building input filters for resource-efficient inference.
☆38Updated last year
Alternatives and similar repositories for InFi
Users that are interested in InFi are comparing it to the libraries listed below
Sorting:
- PipeEdge: Pipeline Parallelism for Large-Scale Model Inference on Heterogeneous Edge Devices☆33Updated last year
- Source code and datasets for Ekya, a system for continuous learning on the edge.☆105Updated 3 years ago
- ☆56Updated 3 years ago
- ☆16Updated last year
- 云边协同- collaborative inference📚Dynamic adaptive DNN surgery for inference acceleration on the edge☆40Updated last year
- This is a list of awesome edgeAI inference related papers.☆96Updated last year
- Autodidactic Neurosurgeon Collaborative Deep Inference for Mobile Edge Intelligence via Online Learning☆40Updated 3 years ago
- ☆201Updated last year
- ☆9Updated last year
- ☆13Updated 5 years ago
- Deep Compressive Offloading: Speeding Up Neural Network Inference by Trading Edge Computation for Network Latency☆27Updated 4 years ago
- A Cluster-Wide Model Manager to Accelerate DNN Training via Automated Training Warmup☆34Updated 2 years ago
- ☆21Updated last year
- ☆45Updated 2 years ago
- Adaptive Model Streaming for real-time video inference on edge devices☆41Updated 3 years ago
- ☆14Updated last year
- A Distributed Camera System for Inference Scheduling and Continuous Learning in Video Analytics☆17Updated last year
- ☆77Updated 2 years ago
- a deep learning-driven scheduler for elastic training in deep learning clusters☆29Updated 4 years ago
- ☆99Updated last year
- Source code for Jellyfish, a soft real-time inference serving system☆12Updated 2 years ago
- A curated list of research in System for Edge Intelligence and Computing(Edge MLSys), including Frameworks, Tools, Repository, etc. Paper…☆30Updated 3 years ago
- ☆10Updated 4 years ago
- PyTorch implementation of the paper: Decomposing Vision Transformers for Collaborative Inference in Edge Devices☆12Updated 10 months ago
- DNN_Partition辅助工具,用于对pytorch模型进行简单的性能分析以及支持模型切分☆12Updated 4 years ago
- Server-driven Video Streaming for Deep Learning Inference☆91Updated 2 years ago
- ☆22Updated 2 years ago
- The implementation of paper : RTCoInfer: Real-time Edge-Cloud Collaborative CNN Inference for Stream Analytics on Ubiquitous Images☆14Updated 2 years ago
- A DNN model partition demo☆31Updated 5 years ago
- A curated list of early exiting (LLM, CV, NLP, etc)☆53Updated 9 months ago