LINC-BIT / legodnn
LegoDNN: a block-grained scaling tool for mobile vision systems
☆50Updated last year
Alternatives and similar repositories for legodnn:
Users that are interested in legodnn are comparing it to the libraries listed below
- ☆14Updated last year
- This is a list of awesome edgeAI inference related papers.☆95Updated last year
- ☆127Updated last year
- Autodidactic Neurosurgeon Collaborative Deep Inference for Mobile Edge Intelligence via Online Learning☆41Updated 3 years ago
- [CVPRW 2021] Dynamic-OFA: Runtime DNN Architecture Switching for Performance Scaling on Heterogeneous Embedded Platforms☆29Updated 2 years ago
- Source code and datasets for Ekya, a system for continuous learning on the edge.☆105Updated 3 years ago
- This is the pytorch implementation for the paper: Generalizable Mixed-Precision Quantization via Attribution Rank Preservation, which is…☆25Updated 3 years ago
- Measuring and predicting on-device metrics (latency, power, etc.) of machine learning models☆66Updated 2 years ago
- To deploy Transformer models in CV to mobile devices.☆18Updated 3 years ago
- Code for paper "ElasticTrainer: Speeding Up On-Device Training with Runtime Elastic Tensor Selection" (MobiSys'23)☆13Updated last year
- Deep Compressive Offloading: Speeding Up Neural Network Inference by Trading Edge Computation for Network Latency☆27Updated 4 years ago
- Pytorch-based early exit network inspired by branchynet☆31Updated last week
- ☆25Updated 3 years ago
- [ICLR 2022] The Unreasonable Effectiveness of Random Pruning: Return of the Most Naive Baseline for Sparse Training by Shiwei Liu, Tianlo…☆73Updated 2 years ago
- ☆16Updated last year
- [ICPR 2020] "Neural Compression and Filtering for Edge-assisted Real-time Object Detection in Challenged Networks" and [ACM MobiCom EMDL …☆25Updated last year
- PipeEdge: Pipeline Parallelism for Large-Scale Model Inference on Heterogeneous Edge Devices☆30Updated last year
- ☆77Updated last year
- Cache design for CNN on mobile☆32Updated 6 years ago
- ☆56Updated 3 years ago
- ☆19Updated 3 years ago
- Dynamic Channel Pruning: Feature Boosting and Suppression☆17Updated 5 years ago
- MobiSys#114☆21Updated last year
- Official implementation for ECCV 2022 paper LIMPQ - "Mixed-Precision Neural Network Quantization via Learned Layer-wise Importance"☆53Updated 2 years ago
- A DNN inference latency prediction toolkit for accurately modeling and predicting the latency on diverse edge devices.☆351Updated 8 months ago
- Splits Keras with Tensorflow backends into two or more submodels.☆18Updated 2 years ago
- InFi is a library for building input filters for resource-efficient inference.☆37Updated last year
- Adaptive Model Streaming for real-time video inference on edge devices☆41Updated 3 years ago
- FedNAS: Federated Deep Learning via Neural Architecture Search☆54Updated 3 years ago
- 2021 Summer Research Internship project (UROP) at Imperial College London. Supervised by Prof George Constantinides and Ben Biggs☆17Updated 2 years ago