LINC-BIT / legodnnLinks
LegoDNN: a block-grained scaling tool for mobile vision systems
☆51Updated 2 years ago
Alternatives and similar repositories for legodnn
Users that are interested in legodnn are comparing it to the libraries listed below
Sorting:
- A DNN inference latency prediction toolkit for accurately modeling and predicting the latency on diverse edge devices.☆363Updated last year
- ☆134Updated 2 years ago
- ☆212Updated 2 years ago
- About DNN compression and acceleration on Edge Devices.☆57Updated 4 years ago
- ☆78Updated 2 years ago
- To deploy Transformer models in CV to mobile devices.☆18Updated 4 years ago
- [MobiSys 2020] Fast and Scalable In-memory Deep Multitask Learning via Neural Weight Virtualization☆15Updated 5 years ago
- This is a list of awesome edgeAI inference related papers.☆98Updated 2 years ago
- Source code and datasets for Ekya, a system for continuous learning on the edge.☆112Updated 3 years ago
- ☆16Updated 2 years ago
- [ICLR 2021] HW-NAS-Bench: Hardware-Aware Neural Architecture Search Benchmark☆114Updated 2 years ago
- ☆57Updated 4 years ago
- ☆15Updated 2 years ago
- [CVPR 2019, Oral] HAQ: Hardware-Aware Automated Quantization with Mixed Precision☆400Updated 4 years ago
- Official implementation for ECCV 2022 paper LIMPQ - "Mixed-Precision Neural Network Quantization via Learned Layer-wise Importance"☆61Updated 2 years ago
- This project will realize experiments about BranchyNet partitioning using pytorch framework☆29Updated 5 years ago
- [CVPR'20] ZeroQ: A Novel Zero Shot Quantization Framework☆280Updated 2 years ago
- Cache design for CNN on mobile☆33Updated 7 years ago
- ☆37Updated 3 years ago
- 2021 Summer Research Internship project (UROP) at Imperial College London. Supervised by Prof George Constantinides and Ben Biggs☆17Updated 3 years ago
- [ICLR 2018] Deep Gradient Compression: Reducing the Communication Bandwidth for Distributed Training☆226Updated last year
- Pytorch-based early exit network inspired by branchynet☆34Updated 8 months ago
- [ICLR 2022] The Unreasonable Effectiveness of Random Pruning: Return of the Most Naive Baseline for Sparse Training by Shiwei Liu, Tianlo…☆77Updated 3 years ago
- PyTorch implementation for the APoT quantization (ICLR 2020)☆281Updated last year
- ☆19Updated 3 years ago
- InFi is a library for building input filters for resource-efficient inference.☆41Updated 2 years ago
- Autodidactic Neurosurgeon Collaborative Deep Inference for Mobile Edge Intelligence via Online Learning☆42Updated 4 years ago
- Code for paper "ElasticTrainer: Speeding Up On-Device Training with Runtime Elastic Tensor Selection" (MobiSys'23)☆14Updated 2 years ago
- Nonuniform-to-Uniform Quantization: Towards Accurate Quantization via Generalized Straight-Through Estimation. In CVPR 2022.☆138Updated 3 years ago
- [ICLR 2022 Oral] F8Net: Fixed-Point 8-bit Only Multiplication for Network Quantization☆93Updated 3 years ago