agural / memory-optimal-direct-convolutionsLinks

Code for reproducing work of ICML 2019 paper: Memory-Optimal Direct Convolutions for Maximizing Classification Accuracy in Embedded Applications

☆12

Alternatives and similar repositories for memory-optimal-direct-convolutions

Users that are interested in memory-optimal-direct-convolutions are comparing it to the libraries listed below

Sorting:

anony-sub / chameleon
Chameleon: Adaptive Code Optimization for Expedited Deep Neural Network Compilation
☆27Updated 5 years ago
fastconvnets / cvpr2020
Code for "Fast Sparse ConvNets" CVPR2020 submissions
☆13Updated 5 years ago
YukeWang96 / DSXplore_IPDPS21
Artifact for IPDPS'21: DSXplore: Optimizing Convolutional Neural Networks via Sliding-Channel Convolutions.
☆13Updated 4 years ago
yukang2017 / NAS-quantization
The code for Joint Neural Architecture Search and Quantization
☆13Updated 6 years ago
GATECH-EIC / SuperTickets
[ECCV 2022] SuperTickets: Drawing Task-Agnostic Lottery Tickets from Supernets via Jointly Architecture Searching and Parameter Pruning
☆20Updated 3 years ago
mlzxy / qsparse
Train neural networks with joint quantization and pruning on both weights and activations using any pytorch modules
☆42Updated 2 years ago
GATECH-EIC / CPT
[ICLR 2021 Spotlight] "CPT: Efficient Deep Neural Network Training via Cyclic Precision" by Yonggan Fu, Han Guo, Meng Li, Xin Yang, Yinin…
☆31Updated last year
wangmaolin / niti
Implementation of "NITI: Training Integer Neural Networks Using Integer-only Arithmetic" on arxiv
☆84Updated 2 years ago
gilshm / sparq
Post-training sparsity-aware quantization
☆34Updated 2 years ago
jaewoosong / pocketnn
The official, proof-of-concept C++ implementation of PocketNN.
☆34Updated last year
kumasento / gconv-prune
Code repository for paper "Efficient Structured Pruning and Architecture Searching for Group Convolution" https://arxiv.org/abs/1811.0934…
☆8Updated 3 years ago
parsa-epfl / HBFPEmulator
ColTraIn HBFP Training Emulator
☆16Updated 2 years ago
AamirRaihan / SWAT
Official implementation of Neurips 2020 "Sparse Weight Activation Training" paper.
☆27Updated 3 years ago
Jangho-Kim / PSG-pytorch
Position-based Scaled Gradient for Model Quantization and Pruning Code (NeurIPS 2020)
☆26Updated 4 years ago
Torment123 / DFS
☆15Updated 5 years ago
HayeonLee / HELP
Official PyTorch Implementation of HELP: Hardware-adaptive Efficient Latency Prediction for NAS via Meta-Learning (NeurIPS 2021 Spotlight…
☆63Updated 11 months ago
VITA-Group / WeakNAS
[NeurIPS 2021] “Stronger NAS with Weaker Predictors“, Junru Wu, Xiyang Dai, Dongdong Chen, Yinpeng Chen, Mengchen Liu, Ye Yu, Zhangyang W…
☆27Updated 2 years ago
YashasSamaga / ConvolutionBuildingBlocks
GEMM and Winograd based convolutions using CUTLASS
☆26Updated 5 years ago
plumerai / rethinking-bnn-optimization
Implementation for the paper "Latent Weights Do Not Exist: Rethinking Binarized Neural Network Optimization"
☆74Updated 5 years ago
pytorchfi / pytorchfi
A runtime fault injection tool for PyTorch
☆115Updated 11 months ago
BayesWatch / pytorch-blockswap
Code for BlockSwap (ICLR 2020).
☆33Updated 4 years ago
ysbsb / awesome-quantization
Awesome Quantization Paper lists with Codes
☆11Updated 4 years ago
iwls2020-lsml-contest / iwls2020-lsml-contest
☆21Updated 2 years ago
yaozhewei / HAP
☆43Updated last year
limenghao / AdaTune
This is the implementation for paper: AdaTune: Adaptive Tensor Program CompilationMade Efficient (NeurIPS 2020).
☆14Updated 4 years ago
1adrianb / expert-binary-networks
Code for High-Capacity Expert Binary Networks (ICLR 2021).
☆27Updated 3 years ago
Zhen-Dong / CoDeNet
[FPGA'21] CoDeNet is an efficient object detection model on PyTorch, with SOTA performance on VOC and COCO based on CenterNet and Co-Desi…
☆26Updated 2 years ago
XinDongol / DNNAC
All about acceleration and compression of Deep Neural Networks
☆33Updated 5 years ago
ARM-software / scalpel
This is a PyTorch implementation of the Scalpel. Node pruning for five benchmark networks and SIMD-aware weight pruning for LeNet-300-100…
☆41Updated 6 years ago
GATECH-EIC / Auto-NBA
[ICML 2021] "Auto-NBA: Efficient and Effective Search Over the Joint Space of Networks, Bitwidths, and Accelerators" by Yonggan Fu, Yonga…
☆16Updated 3 years ago