LINC-BIT / legodnnLinks
LegoDNN: a block-grained scaling tool for mobile vision systems
☆51Updated 2 years ago
Alternatives and similar repositories for legodnn
Users that are interested in legodnn are comparing it to the libraries listed below
Sorting:
- A DNN inference latency prediction toolkit for accurately modeling and predicting the latency on diverse edge devices.☆360Updated last year
- ☆78Updated 2 years ago
- ☆133Updated 2 years ago
- ☆209Updated last year
- Source code and datasets for Ekya, a system for continuous learning on the edge.☆112Updated 3 years ago
- About DNN compression and acceleration on Edge Devices.☆57Updated 4 years ago
- InFi is a library for building input filters for resource-efficient inference.☆39Updated 2 years ago
- [ICLR 2022] The Unreasonable Effectiveness of Random Pruning: Return of the Most Naive Baseline for Sparse Training by Shiwei Liu, Tianlo…☆77Updated 2 years ago
- ☆19Updated 3 years ago
- Official implementation for ECCV 2022 paper LIMPQ - "Mixed-Precision Neural Network Quantization via Learned Layer-wise Importance"☆62Updated 2 years ago
- [MobiSys 2020] Fast and Scalable In-memory Deep Multitask Learning via Neural Weight Virtualization☆15Updated 5 years ago
- Pytorch-based early exit network inspired by branchynet☆35Updated 5 months ago
- This project will realize experiments about BranchyNet partitioning using pytorch framework☆29Updated 5 years ago
- [ICLR 2021] HW-NAS-Bench: Hardware-Aware Neural Architecture Search Benchmark☆112Updated 2 years ago
- To deploy Transformer models in CV to mobile devices.☆18Updated 3 years ago
- This is a list of awesome edgeAI inference related papers.☆99Updated last year
- [ICLR 2018] Deep Gradient Compression: Reducing the Communication Bandwidth for Distributed Training☆225Updated last year
- FedNAS: Federated Deep Learning via Neural Architecture Search☆54Updated 4 years ago
- ☆15Updated 2 years ago
- [CVPR'20] ZeroQ: A Novel Zero Shot Quantization Framework☆280Updated last year
- A Portable C Library for Distributed CNN Inference on IoT Edge Clusters☆86Updated 5 years ago
- This is the pytorch implementation for the paper: Generalizable Mixed-Precision Quantization via Attribution Rank Preservation, which is…☆24Updated 4 years ago
- [ICML 2022] "DepthShrinker: A New Compression Paradigm Towards Boosting Real-Hardware Efficiency of Compact Neural Networks", by Yonggan …☆72Updated 3 years ago
- [ICLR 2022 Oral] F8Net: Fixed-Point 8-bit Only Multiplication for Network Quantization☆94Updated 3 years ago
- Adaptive Model Streaming for real-time video inference on edge devices☆41Updated 4 years ago
- [CVPR 2019, Oral] HAQ: Hardware-Aware Automated Quantization with Mixed Precision☆398Updated 4 years ago
- Collections of model quantization algorithms. Any issues, please contact Peng Chen (blueardour@gmail.com)☆72Updated 4 years ago
- Pytorch implementation of EENets☆19Updated last year
- PyTorch Implementation of AutoPruner☆23Updated 5 years ago
- PyTorch implementation for the APoT quantization (ICLR 2020)☆277Updated 10 months ago