LINC-BIT / legodnnLinks

LegoDNN: a block-grained scaling tool for mobile vision systems

☆51

Alternatives and similar repositories for legodnn

Users that are interested in legodnn are comparing it to the libraries listed below

Sorting:

fangvv / awesome-edge-intelligence-collections
About DNN compression and acceleration on Edge Devices.
☆56Updated 4 years ago
microsoft / nn-Meter
A DNN inference latency prediction toolkit for accurately modeling and predicting the latency on diverse edge devices.
☆359Updated last year
xudoong / EdgeVisionTransformer
To deploy Transformer models in CV to mobile devices.
☆18Updated 3 years ago
csu-eis / CoDL
☆78Updated 2 years ago
kunglab / branchynet
☆132Updated 2 years ago
xumengwei / Edge-AI-Paper-List
☆209Updated last year
learning1234embed / NeuralWeightVirtualization
[MobiSys 2020] Fast and Scalable In-memory Deep Multitask Learning via Neural Weight Virtualization
☆15Updated 5 years ago
wenh18 / AdaptiveNet_artifact
☆15Updated 2 years ago
Kyrie-Zhao / awesome-real-time-AI
This is a list of awesome edgeAI inference related papers.
☆98Updated last year
edge-video-services / ekya
Source code and datasets for Ekya, a system for continuous learning on the edge.
☆109Updated 3 years ago
UbiquitousLearning / MobileDLFrameworksBenchmark
☆19Updated 3 years ago
biggsbenjamin / earlyexitnet
Pytorch-based early exit network inspired by branchynet
☆34Updated 5 months ago
pittisl / ElasticTrainer
Code for paper "ElasticTrainer: Speeding Up On-Device Training with Runtime Elastic Tensor Selection" (MobiSys'23)
☆13Updated last year
synxlin / deep-gradient-compression
[ICLR 2018] Deep Gradient Compression: Reducing the Communication Bandwidth for Distributed Training
☆224Updated last year
VITA-Group / Random_Pruning
[ICLR 2022] The Unreasonable Effectiveness of Random Pruning: Return of the Most Naive Baseline for Sparse Training by Shiwei Liu, Tianlo…
☆77Updated 2 years ago
yuanmu97 / InFi
InFi is a library for building input filters for resource-efficient inference.
☆39Updated last year
GATECH-EIC / HW-NAS-Bench
[ICLR 2021] HW-NAS-Bench: Hardware-Aware Neural Architecture Search Benchmark
☆113Updated 2 years ago
qipengwang / Melon
MobiSys#114
☆22Updated 2 years ago
chaoyanghe / FedNAS
FedNAS: Federated Deep Learning via Neural Architecture Search
☆54Updated 4 years ago
ucbrise / actnn
ActNN: Reducing Training Memory Footprint via 2-Bit Activation Compressed Training
☆199Updated 2 years ago
1hunters / LIMPQ
Official implementation for ECCV 2022 paper LIMPQ - "Mixed-Precision Neural Network Quantization via Learned Layer-wise Importance"
☆62Updated 2 years ago
pachecobeto95 / POPEX
This project will realize experiments about BranchyNet partitioning using pytorch framework
☆29Updated 5 years ago
wuyangzhang / elf
☆58Updated 3 years ago
eis-lab / sage
Experimental deep learning framework written in Rust
☆15Updated 2 years ago
wenh18 / AdaptiveNet
☆16Updated 2 years ago
UbiquitousLearning / Paper-list-resource-efficient-large-language-model
☆100Updated last year
snap-research / F8Net
[ICLR 2022 Oral] F8Net: Fixed-Point 8-bit Only Multiplication for Network Quantization
☆94Updated 3 years ago
falcon-xu / early-exit-papers
A curated list of early exiting (LLM, CV, NLP, etc)
☆65Updated last year
gorakraj / earlyexit_onnx
2021 Summer Research Internship project (UROP) at Imperial College London. Supervised by Prof George Constantinides and Ben Biggs
☆17Updated 2 years ago
mit-han-lab / haq
[CVPR 2019, Oral] HAQ: Hardware-Aware Automated Quantization with Mixed Precision
☆396Updated 4 years ago