learning1234embed / NeuralWeightVirtualization
[MobiSys 2020] Fast and Scalable In-memory Deep Multitask Learning via Neural Weight Virtualization
☆16Updated 4 years ago
Alternatives and similar repositories for NeuralWeightVirtualization:
Users that are interested in NeuralWeightVirtualization are comparing it to the libraries listed below
- Experimental deep learning framework written in Rust☆14Updated 2 years ago
- This is a list of awesome edgeAI inference related papers.☆96Updated last year
- Source code for the paper: "A Latency-Predictable Multi-Dimensional Optimization Framework forDNN-driven Autonomous Systems"☆22Updated 4 years ago
- Improving Post Training Neural Quantization: Layer-wise Calibration and Integer Programming☆96Updated 3 years ago
- Any-Precision Deep Neural Networks (AAAI 2021)☆60Updated 5 years ago
- [ICLR 2022 Oral] F8Net: Fixed-Point 8-bit Only Multiplication for Network Quantization☆94Updated 3 years ago
- Code for our paper at ECCV 2020: Post-Training Piecewise Linear Quantization for Deep Neural Networks☆69Updated 3 years ago
- BSQ: Exploring Bit-Level Sparsity for Mixed-Precision Neural Network Quantization (ICLR 2021)☆40Updated 4 years ago
- Official implementation for ECCV 2022 paper LIMPQ - "Mixed-Precision Neural Network Quantization via Learned Layer-wise Importance"☆54Updated 2 years ago
- PyTorch implementation of "Quantization and Training of Neural Networks for Efficient Integer-Arithmetic-Only Inference"☆55Updated 5 years ago
- ☆77Updated last year
- Measuring and predicting on-device metrics (latency, power, etc.) of machine learning models☆66Updated 2 years ago
- Post-training sparsity-aware quantization☆34Updated 2 years ago
- BitSplit Post-trining Quantization☆49Updated 3 years ago
- Conditional channel- and precision-pruning on neural networks☆73Updated 5 years ago
- Official implementation of "Searching for Winograd-aware Quantized Networks" (MLSys'20)☆27Updated last year
- A pytorch implementation of DoReFa-Net☆133Updated 5 years ago
- The code for paper: Neuralpower: Predict and deploy energy-efficient convolutional neural networks☆21Updated 5 years ago
- Create tiny ML systems for on-device learning.☆20Updated 3 years ago
- About DNN compression and acceleration on Edge Devices.☆55Updated 3 years ago
- ProxQuant: Quantized Neural Networks via Proximal Operators☆29Updated 6 years ago
- ☆39Updated 2 years ago
- MobiSys#114☆21Updated last year
- ☆19Updated 3 years ago
- A curated list of early exiting (LLM, CV, NLP, etc)☆49Updated 8 months ago
- Improving Post Training Neural Quantization: Layer-wise Calibration and Integer Programming☆34Updated last year
- The open-source project for "Mandheling: Mixed-Precision On-Device DNN Training with DSP Offloading"[MobiCom'2022]☆19Updated 2 years ago
- ☆128Updated last year
- CMix-NN: Mixed Low-Precision CNN Library for Memory-Constrained Edge Devices☆41Updated 5 years ago
- Quantize pytorch model, support post-training quantization and quantization aware training methods☆13Updated last year