learning1234embed / NeuralWeightVirtualization
[MobiSys 2020] Fast and Scalable In-memory Deep Multitask Learning via Neural Weight Virtualization
☆16Updated 4 years ago
Alternatives and similar repositories for NeuralWeightVirtualization:
Users that are interested in NeuralWeightVirtualization are comparing it to the libraries listed below
- This is a list of awesome edgeAI inference related papers.☆95Updated last year
- BSQ: Exploring Bit-Level Sparsity for Mixed-Precision Neural Network Quantization (ICLR 2021)☆40Updated 4 years ago
- PyTorch implementation of "Quantization and Training of Neural Networks for Efficient Integer-Arithmetic-Only Inference"☆55Updated 5 years ago
- Measuring and predicting on-device metrics (latency, power, etc.) of machine learning models☆66Updated last year
- Improving Post Training Neural Quantization: Layer-wise Calibration and Integer Programming☆96Updated 3 years ago
- [ICLR 2021] HW-NAS-Bench: Hardware-Aware Neural Architecture Search Benchmark☆110Updated last year
- ☆36Updated 6 years ago
- Code for our paper at ECCV 2020: Post-Training Piecewise Linear Quantization for Deep Neural Networks☆68Updated 3 years ago
- Official implementation for paper LIMPQ, "Mixed-Precision Neural Network Quantization via Learned Layer-wise Importance", ECCV 2022☆53Updated 2 years ago
- ☆125Updated last year
- zTT: Learning-based DVFS with Zero Thermal Throttling for Mobile Devices [MobiSys'21] - Artifact Evaluation☆24Updated 3 years ago
- Post-training sparsity-aware quantization☆34Updated 2 years ago
- This repository containts the pytorch scripts to train mixed-precision networks for microcontroller deployment, based on the memory contr…☆49Updated 10 months ago
- MobiSys#114☆21Updated last year
- ☆77Updated last year
- Improving Post Training Neural Quantization: Layer-wise Calibration and Integer Programming☆34Updated last year
- [ICLR 2022 Oral] F8Net: Fixed-Point 8-bit Only Multiplication for Network Quantization☆94Updated 2 years ago
- ProxQuant: Quantized Neural Networks via Proximal Operators☆29Updated 6 years ago
- Official PyTorch Implementation of HELP: Hardware-adaptive Efficient Latency Prediction for NAS via Meta-Learning (NeurIPS 2021 Spotlight…☆62Updated 7 months ago
- BitSplit Post-trining Quantization☆49Updated 3 years ago
- Source code for the paper: "A Latency-Predictable Multi-Dimensional Optimization Framework forDNN-driven Autonomous Systems"☆22Updated 4 years ago
- About DNN compression and acceleration on Edge Devices.☆56Updated 3 years ago
- ☆39Updated 2 years ago
- Experimental deep learning framework written in Rust☆14Updated 2 years ago
- ☆35Updated 2 years ago
- CMix-NN: Mixed Low-Precision CNN Library for Memory-Constrained Edge Devices☆41Updated 5 years ago
- Nonuniform-to-Uniform Quantization: Towards Accurate Quantization via Generalized Straight-Through Estimation. In CVPR 2022.☆129Updated 2 years ago
- Chameleon: Adaptive Code Optimization for Expedited Deep Neural Network Compilation☆27Updated 5 years ago
- PyTorch implementation for the APoT quantization (ICLR 2020)☆271Updated 3 months ago
- DNN quantization with outlier channel splitting☆112Updated 5 years ago