learning1234embed / NeuralWeightVirtualizationLinks

[MobiSys 2020] Fast and Scalable In-memory Deep Multitask Learning via Neural Weight Virtualization

☆15

Alternatives and similar repositories for NeuralWeightVirtualization

Users that are interested in NeuralWeightVirtualization are comparing it to the libraries listed below

Sorting:

Kyrie-Zhao / awesome-real-time-AI
This is a list of awesome edgeAI inference related papers.
☆98Updated last year
csu-eis / CoDL
☆78Updated 2 years ago
microsoft / nn-Meter
A DNN inference latency prediction toolkit for accurately modeling and predicting the latency on diverse edge devices.
☆359Updated last year
Soroosh129 / NeuOS
Source code for the paper: "A Latency-Predictable Multi-Dimensional Optimization Framework forDNN-driven Autonomous Systems"
☆22Updated 4 years ago
xumengwei / Edge-AI-Paper-List
☆208Updated last year
yhhhli / APoT_Quantization
PyTorch implementation for the APoT quantization (ICLR 2020)
☆278Updated 10 months ago
amirgholami / ZeroQ
[CVPR'20] ZeroQ: A Novel Zero Shot Quantization Framework
☆281Updated last year
mit-han-lab / inter-operator-scheduler
[MLSys 2021] IOS: Inter-Operator Scheduler for CNN Acceleration
☆199Updated 3 years ago
A-suozhang / awesome-quantization-and-fixed-point-training
Neural Network Quantization & Low-Bit Fixed Point Training For Hardware-Friendly Algorithm Design
☆160Updated 4 years ago
GATECH-EIC / HW-NAS-Bench
[ICLR 2021] HW-NAS-Bench: Hardware-Aware Neural Architecture Search Benchmark
☆113Updated 2 years ago
jeho-lee / Awesome-On-Device-AI-Systems
☆86Updated last month
eis-lab / sage
Experimental deep learning framework written in Rust
☆15Updated 2 years ago
mit-han-lab / haq
[CVPR 2019, Oral] HAQ: Hardware-Aware Automated Quantization with Mixed Precision
☆396Updated 4 years ago
itayhubara / CalibTIP
Improving Post Training Neural Quantization: Layer-wise Calibration and Integer Programming
☆98Updated 4 years ago
submission2019 / cnn-quantization
Quantization of Convolutional Neural networks.
☆243Updated last year
xumengwei / DeepCache
Cache design for CNN on mobile
☆34Updated 7 years ago
1hunters / LIMPQ
Official implementation for ECCV 2022 paper LIMPQ - "Mixed-Precision Neural Network Quantization via Learned Layer-wise Importance"
☆62Updated 2 years ago
SHI-Labs / Any-Precision-DNNs
Any-Precision Deep Neural Networks (AAAI 2021)
☆61Updated 5 years ago
zzzxxxttt / pytorch_DoReFaNet
A pytorch implementation of DoReFa-Net
☆132Updated 5 years ago
snap-research / F8Net
[ICLR 2022 Oral] F8Net: Fixed-Point 8-bit Only Multiplication for Network Quantization
☆94Updated 3 years ago
lcm97 / TX2-dnn-power-measurements
☆20Updated 5 years ago
bzantium / pytorch-admm-pruning
Prune DNN using Alternating Direction Method of Multipliers (ADMM)
☆99Updated 5 years ago
ztt-21 / zTT
zTT: Learning-based DVFS with Zero Thermal Throttling for Mobile Devices [MobiSys'21] - Artifact Evaluation
☆26Updated 4 years ago
UbiquitousLearning / Mandheling-DSP-Training
The open-source project for "Mandheling: Mixed-Precision On-Device DNN Training with DSP Offloading"[MobiCom'2022]
☆19Updated 3 years ago
HayeonLee / HELP
Official PyTorch Implementation of HELP: Hardware-adaptive Efficient Latency Prediction for NAS via Meta-Learning (NeurIPS 2021 Spotlight…
☆63Updated last year
qipengwang / Melon
MobiSys#114
☆22Updated 2 years ago
Zhen-Dong / HAWQ
Quantization library for PyTorch. Support low-precision and mixed-precision quantization, with hardware implementation through TVM.
☆450Updated 2 years ago
yeshaokai / ADMM-NN
☆36Updated 6 years ago
jun-fang / PWLQ
Code for our paper at ECCV 2020: Post-Training Piecewise Linear Quantization for Deep Neural Networks
☆68Updated 3 years ago
enyac-group / NeuralPower
The code for paper: Neuralpower: Predict and deploy energy-efficient convolutional neural networks
☆21Updated 6 years ago