agural / memory-optimal-direct-convolutions
Code for reproducing work of ICML 2019 paper: Memory-Optimal Direct Convolutions for Maximizing Classification Accuracy in Embedded Applications
☆12Updated 5 years ago
Alternatives and similar repositories for memory-optimal-direct-convolutions:
Users that are interested in memory-optimal-direct-convolutions are comparing it to the libraries listed below
- Chameleon: Adaptive Code Optimization for Expedited Deep Neural Network Compilation☆27Updated 5 years ago
- Code for "Fast Sparse ConvNets" CVPR2020 submissions☆13Updated 5 years ago
- This is the implementation for paper: AdaTune: Adaptive Tensor Program CompilationMade Efficient (NeurIPS 2020).☆13Updated 3 years ago
- [ICML 2021] "Auto-NBA: Efficient and Effective Search Over the Joint Space of Networks, Bitwidths, and Accelerators" by Yonggan Fu, Yonga…☆15Updated 3 years ago
- GEMM and Winograd based convolutions using CUTLASS☆26Updated 4 years ago
- [ECCV 2022] SuperTickets: Drawing Task-Agnostic Lottery Tickets from Supernets via Jointly Architecture Searching and Parameter Pruning☆20Updated 2 years ago
- An external memory allocator example for PyTorch.☆14Updated 3 years ago
- Benchmark for matrix multiplications between dense and block sparse (BSR) matrix in TVM, blocksparse (Gray et al.) and cuSparse.☆24Updated 4 years ago
- The code for Joint Neural Architecture Search and Quantization☆13Updated 6 years ago
- ☆12Updated 3 years ago
- ☆17Updated 3 years ago
- Artifact for IPDPS'21: DSXplore: Optimizing Convolutional Neural Networks via Sliding-Channel Convolutions.☆13Updated 4 years ago
- ColTraIn HBFP Training Emulator☆16Updated 2 years ago
- Code repository for paper "Efficient Structured Pruning and Architecture Searching for Group Convolution" https://arxiv.org/abs/1811.0934…☆8Updated 3 years ago
- The code for paper: Neuralpower: Predict and deploy energy-efficient convolutional neural networks☆21Updated 5 years ago
- A Hackable Quantization Library for PyTorch☆20Updated 4 years ago
- The code for our paper "Neural Architecture Search as Program Transformation Exploration"☆18Updated 3 years ago
- ☆21Updated 2 months ago
- Accelerate convolution neural network for face recognition using GPU☆12Updated 4 years ago
- ☆29Updated 4 years ago
- You Only Search Once: On Lightweight Differentiable Architecture Search for Resource-Constrained Embedded Platforms☆11Updated 2 years ago
- Accelerator simulation framework using nn_dataflow traces and energy, etc. post-processing☆7Updated 6 years ago
- An Attention Superoptimizer☆21Updated 3 months ago
- SAMO: Streaming Architecture Mapping Optimisation☆32Updated last year
- Post-training sparsity-aware quantization☆34Updated 2 years ago
- Memory Optimizations for Deep Learning (ICML 2023)☆64Updated last year
- An implementation of a BinaryConnect network for cifar10☆11Updated 5 years ago
- ☆10Updated 3 years ago
- Low Precision Arithmetic Simulation in PyTorch - extension for posit and beyond☆13Updated last year
- ParaDnn: A systematic performance analysis methodology for deep learning.☆39Updated 5 years ago