agural / memory-optimal-direct-convolutions
Code for reproducing work of ICML 2019 paper: Memory-Optimal Direct Convolutions for Maximizing Classification Accuracy in Embedded Applications
☆12Updated 5 years ago
Alternatives and similar repositories for memory-optimal-direct-convolutions:
Users that are interested in memory-optimal-direct-convolutions are comparing it to the libraries listed below
- Code for "Fast Sparse ConvNets" CVPR2020 submissions☆13Updated 5 years ago
- Chameleon: Adaptive Code Optimization for Expedited Deep Neural Network Compilation☆27Updated 5 years ago
- [ICML 2021] "Auto-NBA: Efficient and Effective Search Over the Joint Space of Networks, Bitwidths, and Accelerators" by Yonggan Fu, Yonga…☆15Updated 3 years ago
- Artifact for IPDPS'21: DSXplore: Optimizing Convolutional Neural Networks via Sliding-Channel Convolutions.☆13Updated 3 years ago
- [ECCV 2022] SuperTickets: Drawing Task-Agnostic Lottery Tickets from Supernets via Jointly Architecture Searching and Parameter Pruning☆19Updated 2 years ago
- You Only Search Once: On Lightweight Differentiable Architecture Search for Resource-Constrained Embedded Platforms☆10Updated last year
- The official, proof-of-concept C++ implementation of PocketNN.☆31Updated 7 months ago
- Accelerator simulation framework using nn_dataflow traces and energy, etc. post-processing☆7Updated 5 years ago
- The code for our paper "Neural Architecture Search as Program Transformation Exploration"☆18Updated 3 years ago
- [ICLR 2021 Spotlight] "CPT: Efficient Deep Neural Network Training via Cyclic Precision" by Yonggan Fu, Han Guo, Meng Li, Xin Yang, Yinin…☆30Updated 10 months ago
- The code for Joint Neural Architecture Search and Quantization☆13Updated 5 years ago
- This is a PyTorch implementation of the Scalpel. Node pruning for five benchmark networks and SIMD-aware weight pruning for LeNet-300-100…☆41Updated 6 years ago
- The code for paper: Neuralpower: Predict and deploy energy-efficient convolutional neural networks☆21Updated 5 years ago
- ☆12Updated last year
- ☆14Updated 4 years ago
- GEMM and Winograd based convolutions using CUTLASS☆26Updated 4 years ago
- ☆20Updated last year
- Post-training sparsity-aware quantization☆34Updated last year
- ColTraIn HBFP Training Emulator☆16Updated last year
- An implementation of a BinaryConnect network for cifar10☆11Updated 5 years ago
- A 8-/16-/32-/64-bit floating point number family☆16Updated 2 years ago
- Training with Block Minifloat number representation☆14Updated 3 years ago
- Code for High-Capacity Expert Binary Networks (ICLR 2021).☆27Updated 3 years ago
- Implementation of the Winograd algorithm.☆23Updated 6 years ago
- Awesome Quantization Paper lists with Codes☆12Updated 3 years ago
- Code for BlockSwap (ICLR 2020).☆33Updated 3 years ago
- Official implementation of "Searching for Winograd-aware Quantized Networks" (MLSys'20)☆27Updated last year
- An external memory allocator example for PyTorch.☆14Updated 3 years ago
- Implementation of "NITI: Training Integer Neural Networks Using Integer-only Arithmetic" on arxiv☆79Updated 2 years ago