learning1234embed / NeuralWeightVirtualization
[MobiSys 2020] Fast and Scalable In-memory Deep Multitask Learning via Neural Weight Virtualization
☆15Updated 4 years ago
Alternatives and similar repositories for NeuralWeightVirtualization:
Users that are interested in NeuralWeightVirtualization are comparing it to the libraries listed below
- This is a list of awesome edgeAI inference related papers.☆91Updated last year
- ☆74Updated last year
- zTT: Learning-based DVFS with Zero Thermal Throttling for Mobile Devices [MobiSys'21] - Artifact Evaluation☆22Updated 3 years ago
- Measuring and predicting on-device metrics (latency, power, etc.) of machine learning models☆66Updated last year
- Create tiny ML systems for on-device learning.☆20Updated 3 years ago
- Experimental deep learning framework written in Rust☆14Updated 2 years ago
- [ICLR 2021] HW-NAS-Bench: Hardware-Aware Neural Architecture Search Benchmark☆105Updated last year
- Source code for the paper: "A Latency-Predictable Multi-Dimensional Optimization Framework forDNN-driven Autonomous Systems"☆20Updated 4 years ago
- MobiSys#114☆21Updated last year
- The open-source project for "Mandheling: Mixed-Precision On-Device DNN Training with DSP Offloading"[MobiCom'2022]☆18Updated 2 years ago
- ☆124Updated last year
- Cache design for CNN on mobile☆32Updated 6 years ago
- ☆192Updated last year
- A curated list of early exiting (LLM, CV, NLP, etc)☆38Updated 4 months ago
- BSQ: Exploring Bit-Level Sparsity for Mixed-Precision Neural Network Quantization (ICLR 2021)☆38Updated 4 years ago
- [ICLR 2022 Oral] F8Net: Fixed-Point 8-bit Only Multiplication for Network Quantization☆94Updated 2 years ago
- Official implementation for paper LIMPQ, "Mixed-Precision Neural Network Quantization via Learned Layer-wise Importance", ECCV 2022☆50Updated last year
- ☆36Updated 5 years ago
- ☆20Updated 4 years ago
- PyTorch implementation of "Quantization and Training of Neural Networks for Efficient Integer-Arithmetic-Only Inference"☆55Updated 5 years ago
- [MLSys 2021] IOS: Inter-Operator Scheduler for CNN Acceleration☆196Updated 2 years ago
- ☆34Updated 2 years ago
- μNAS is a neural architecture search (NAS) system that designs small-yet-powerful microcontroller-compatible neural networks.☆78Updated 3 years ago
- Conditional channel- and precision-pruning on neural networks☆72Updated 4 years ago
- Post-training sparsity-aware quantization☆34Updated last year
- This repository containts the pytorch scripts to train mixed-precision networks for microcontroller deployment, based on the memory contr…☆49Updated 8 months ago
- PyTorch implementation for the APoT quantization (ICLR 2020)☆270Updated last month
- ☆19Updated 2 years ago
- Code for our paper at ECCV 2020: Post-Training Piecewise Linear Quantization for Deep Neural Networks☆68Updated 3 years ago
- ☆37Updated 3 years ago