LINC-BIT / legodnnLinks
LegoDNN: a block-grained scaling tool for mobile vision systems
☆50Updated 2 years ago
Alternatives and similar repositories for legodnn
Users that are interested in legodnn are comparing it to the libraries listed below
Sorting:
- ☆13Updated last year
- ☆130Updated last year
- [ICLR 2018] Deep Gradient Compression: Reducing the Communication Bandwidth for Distributed Training☆222Updated 11 months ago
- To deploy Transformer models in CV to mobile devices.☆18Updated 3 years ago
- Deep Compressive Offloading: Speeding Up Neural Network Inference by Trading Edge Computation for Network Latency☆28Updated 4 years ago
- ☆202Updated last year
- This is a list of awesome edgeAI inference related papers.☆95Updated last year
- Source code and datasets for Ekya, a system for continuous learning on the edge.☆106Updated 3 years ago
- About DNN compression and acceleration on Edge Devices.☆55Updated 4 years ago
- Cache design for CNN on mobile☆32Updated 6 years ago
- Code for paper "ElasticTrainer: Speeding Up On-Device Training with Runtime Elastic Tensor Selection" (MobiSys'23)☆13Updated last year
- InFi is a library for building input filters for resource-efficient inference.☆38Updated last year
- Autodidactic Neurosurgeon Collaborative Deep Inference for Mobile Edge Intelligence via Online Learning☆40Updated 3 years ago
- Pytorch-based early exit network inspired by branchynet☆31Updated last month
- ☆16Updated last year
- This project will realize experiments about BranchyNet partitioning using pytorch framework☆28Updated 5 years ago
- ☆19Updated 3 years ago
- ☆117Updated 6 years ago
- Adaptive Model Streaming for real-time video inference on edge devices☆41Updated 3 years ago
- PipeEdge: Pipeline Parallelism for Large-Scale Model Inference on Heterogeneous Edge Devices☆35Updated last year
- MobiSys#114☆21Updated last year
- [ICLR 2022] The Unreasonable Effectiveness of Random Pruning: Return of the Most Naive Baseline for Sparse Training by Shiwei Liu, Tianlo…☆73Updated 2 years ago
- This is the pytorch implementation for the paper: Generalizable Mixed-Precision Quantization via Attribution Rank Preservation, which is…☆25Updated 3 years ago
- ☆56Updated 3 years ago
- FedNAS: Federated Deep Learning via Neural Architecture Search☆55Updated 3 years ago
- ActNN: Reducing Training Memory Footprint via 2-Bit Activation Compressed Training☆200Updated 2 years ago
- A DNN inference latency prediction toolkit for accurately modeling and predicting the latency on diverse edge devices.☆354Updated 10 months ago
- Official implementation for ECCV 2022 paper LIMPQ - "Mixed-Precision Neural Network Quantization via Learned Layer-wise Importance"☆54Updated 2 years ago
- A curated list of research in System for Edge Intelligence and Computing(Edge MLSys), including Frameworks, Tools, Repository, etc. Paper…☆30Updated 3 years ago
- Experimental deep learning framework written in Rust☆15Updated 2 years ago