stepbuystep / LightNAS
You Only Search Once: On Lightweight Differentiable Architecture Search for Resource-Constrained Embedded Platforms
β10Updated last year
Alternatives and similar repositories for LightNAS:
Users that are interested in LightNAS are comparing it to the libraries listed below
- Post-training sparsity-aware quantizationβ34Updated 2 years ago
- Code for ICML 2022 paper "SPDY: Accurate Pruning with Speedup Guarantees"β18Updated last year
- Personal Digest of NAS (Under Construction π )β25Updated 4 years ago
- The code for Joint Neural Architecture Search and Quantizationβ13Updated 5 years ago
- β43Updated last year
- Code for ICML 2021 submissionβ35Updated 3 years ago
- TQT's pytorch implementation.β21Updated 3 years ago
- β25Updated 3 years ago
- Artifact for IPDPS'21: DSXplore: Optimizing Convolutional Neural Networks via Sliding-Channel Convolutions.β13Updated 3 years ago
- [ICLR 2021] HW-NAS-Bench: Hardware-Aware Neural Architecture Search Benchmarkβ109Updated last year
- Official implementation for paper LIMPQ, "Mixed-Precision Neural Network Quantization via Learned Layer-wise Importance", ECCV 2022β51Updated last year
- An external memory allocator example for PyTorch.β14Updated 3 years ago
- [ICML 2021] "Auto-NBA: Efficient and Effective Search Over the Joint Space of Networks, Bitwidths, and Accelerators" by Yonggan Fu, Yongaβ¦β15Updated 3 years ago
- β17Updated 3 years ago
- This is the implementation for paper: AdaTune: Adaptive Tensor Program CompilationMade Efficient (NeurIPS 2020).β13Updated 3 years ago
- Official implementation of the EMNLP23 paper: Outlier Suppression+: Accurate quantization of large language models by equivalent and optiβ¦β47Updated last year
- The official PyTorch implementation of the NeurIPS2022 (spotlight) paper, Outlier Suppression: Pushing the Limit of Low-bit Transformer Lβ¦β48Updated 2 years ago
- Chameleon: Adaptive Code Optimization for Expedited Deep Neural Network Compilationβ27Updated 5 years ago
- Measuring and predicting on-device metrics (latency, power, etc.) of machine learning modelsβ66Updated last year
- PyTorch compilation tutorial covering TorchScript, torch.fx, and Slapoβ18Updated last year
- A Out-of-box PyTorch Scaffold for Neural Network Quantization-Aware-Training (QAT) Research. Website: https://github.com/zhutmost/neuralzβ¦β26Updated 2 years ago
- A collection of research papers on efficient training of DNNsβ70Updated 2 years ago
- code for the paper "A Statistical Framework for Low-bitwidth Training of Deep Neural Networks"β27Updated 4 years ago
- Official PyTorch Implementation of HELP: Hardware-adaptive Efficient Latency Prediction for NAS via Meta-Learning (NeurIPS 2021 Spotlightβ¦β61Updated 6 months ago
- Official implementation of "Searching for Winograd-aware Quantized Networks" (MLSys'20)β27Updated last year
- [ICCV-2023] EMQ: Evolving Training-free Proxies for Automated Mixed Precision Quantizationβ25Updated last year
- BSQ: Exploring Bit-Level Sparsity for Mixed-Precision Neural Network Quantization (ICLR 2021)β40Updated 4 years ago
- [ICML 2023] This project is the official implementation of our accepted ICML 2023 paper BiBench: Benchmarking and Analyzing Network Binarβ¦β54Updated 11 months ago
- BitSplit Post-trining Quantizationβ49Updated 3 years ago
- β18Updated 3 years ago