SKKU-ESLAB / Auto-Compression
Automatic DNN compression tool with various model compression and neural architecture search techniques
☆20Updated 9 months ago
Related projects ⓘ
Alternatives and complementary repositories for Auto-Compression
- CNN functions for dense matrices resident in flash storage☆23Updated 5 years ago
- arm compute library implementation of efficient low precision neural network☆24Updated 3 years ago
- ANT framework's model database that provides DNN models for the various range of IoT devices☆16Updated 3 years ago
- Enhanced version of IoT.js for ANT Framework - Platform for Internet of Things with JavaScript☆15Updated 3 years ago
- Virtual Connection: Framework for P2P Communication Abstraction☆23Updated 4 years ago
- ANT (AI-based Networked Things) Framework☆26Updated 10 months ago
- IoT.js of ANT based on Tizen RT☆14Updated 4 years ago
- Neural Network Acceleration using CPU/GPU, ASIC, FPGA☆60Updated 4 years ago
- Post-training sparsity-aware quantization☆33Updated last year
- nnq_cnd_study stands for Neural Network Quantization & Compact Networks Design Study☆13Updated 4 years ago
- LLMServingSim: A HW/SW Co-Simulation Infrastructure for LLM Inference Serving at Scale☆58Updated last month
- Codes for Binary Ensemble Neural Network: More Bits per Network or More Networks per Bit?☆31Updated 5 years ago
- ☆36Updated 5 years ago
- BSQ: Exploring Bit-Level Sparsity for Mixed-Precision Neural Network Quantization (ICLR 2021)☆36Updated 3 years ago
- Measuring and predicting on-device metrics (latency, power, etc.) of machine learning models☆66Updated last year
- PyTorch implementation of EdMIPS: https://arxiv.org/pdf/2004.05795.pdf☆57Updated 4 years ago
- Implement of Dynamic Model Pruning with Feedback with pytorch☆39Updated 2 years ago
- NEST-SNN☆13Updated 2 years ago
- ☆47Updated 2 years ago
- A pytorch implementation of DoReFa-Net☆131Updated 4 years ago
- Neural Network Acceleration such as ASIC, FPGA, GPU, and PIM☆51Updated 4 years ago
- PyTorch implementation of Data Free Quantization Through Weight Equalization and Bias Correction.☆258Updated last year
- Official PyTorch Implementation of HELP: Hardware-adaptive Efficient Latency Prediction for NAS via Meta-Learning (NeurIPS 2021 Spotlight…☆60Updated 3 months ago
- Conditional channel- and precision-pruning on neural networks☆72Updated 4 years ago
- ☆12Updated 4 years ago
- Source code for the paper: "A Latency-Predictable Multi-Dimensional Optimization Framework forDNN-driven Autonomous Systems"☆19Updated 3 years ago
- PyTorch implementation for the APoT quantization (ICLR 2020)☆269Updated 2 years ago
- The code for paper: Neuralpower: Predict and deploy energy-efficient convolutional neural networks☆21Updated 5 years ago
- ☆45Updated 5 years ago