LINC-BIT / legodnnLinks
LegoDNN: a block-grained scaling tool for mobile vision systems
☆50Updated 2 years ago
Alternatives and similar repositories for legodnn
Users that are interested in legodnn are comparing it to the libraries listed below
Sorting:
- [ICLR 2022] The Unreasonable Effectiveness of Random Pruning: Return of the Most Naive Baseline for Sparse Training by Shiwei Liu, Tianlo…☆73Updated 2 years ago
- A DNN inference latency prediction toolkit for accurately modeling and predicting the latency on diverse edge devices.☆355Updated 11 months ago
- About DNN compression and acceleration on Edge Devices.☆55Updated 4 years ago
- To deploy Transformer models in CV to mobile devices.☆18Updated 3 years ago
- Source code and datasets for Ekya, a system for continuous learning on the edge.☆106Updated 3 years ago
- Code for paper "ElasticTrainer: Speeding Up On-Device Training with Runtime Elastic Tensor Selection" (MobiSys'23)☆13Updated last year
- [ICML 2022] "DepthShrinker: A New Compression Paradigm Towards Boosting Real-Hardware Efficiency of Compact Neural Networks", by Yonggan …☆71Updated 3 years ago
- Official implementation for ECCV 2022 paper LIMPQ - "Mixed-Precision Neural Network Quantization via Learned Layer-wise Importance"☆56Updated 2 years ago
- Arch-Net: Model Distillation for Architecture Agnostic Model Deployment☆23Updated 3 years ago
- Collections of model quantization algorithms. Any issues, please contact Peng Chen (blueardour@gmail.com)☆71Updated 3 years ago
- ☆17Updated 3 years ago
- This is the pytorch implementation for the paper: Generalizable Mixed-Precision Quantization via Attribution Rank Preservation, which is…☆25Updated 3 years ago
- InFi is a library for building input filters for resource-efficient inference.☆38Updated last year
- Code for High-Capacity Expert Binary Networks (ICLR 2021).☆27Updated 3 years ago
- ActNN: Reducing Training Memory Footprint via 2-Bit Activation Compressed Training☆200Updated 2 years ago
- ☆130Updated last year
- ☆25Updated 3 years ago
- [CVPRW 2021] Dynamic-OFA: Runtime DNN Architecture Switching for Performance Scaling on Heterogeneous Embedded Platforms☆29Updated 2 years ago
- [ICCV 2021] Code release for "Sub-bit Neural Networks: Learning to Compress and Accelerate Binary Neural Networks"☆32Updated 2 years ago
- Pytorch implementation of channel pruning and AMC☆32Updated 6 years ago
- [ICLR 2022 Oral] F8Net: Fixed-Point 8-bit Only Multiplication for Network Quantization☆95Updated 3 years ago
- Adaptive Model Streaming for real-time video inference on edge devices☆41Updated 3 years ago
- This is a list of awesome edgeAI inference related papers.☆96Updated last year
- [ICLR 2022] "Unified Vision Transformer Compression" by Shixing Yu*, Tianlong Chen*, Jiayi Shen, Huan Yuan, Jianchao Tan, Sen Yang, Ji Li…☆54Updated last year
- Spartan is an algorithm for training sparse neural network models. This repository accompanies the paper "Spartan Differentiable Sparsity…☆24Updated 2 years ago
- A highly modular PyTorch framework with a focus on Neural Architecture Search (NAS).☆23Updated 3 years ago
- Dynamic Channel Pruning: Feature Boosting and Suppression☆17Updated 6 years ago
- In progress.☆65Updated last year
- The official code for [ECCV2020] "HALO: Hardware-aware Learning to Optimize"☆10Updated 2 years ago
- Pytorch implementation of EENets☆19Updated 10 months ago