xingyul / sparse-winograd-cnnView external linksLinks
Efficient Sparse-Winograd Convolutional Neural Networks (ICLR 2018)
☆193May 7, 2019Updated 6 years ago
Alternatives and similar repositories for sparse-winograd-cnn
Users that are interested in sparse-winograd-cnn are comparing it to the libraries listed below
Sorting:
- Winograd minimal convolution algorithm generator for convolutional neural networks.☆626Updated this week
- Official implementation of "Searching for Winograd-aware Quantized Networks" (MLSys'20)☆27Oct 3, 2023Updated 2 years ago
- ☆26Dec 1, 2016Updated 9 years ago
- Fast CUDA Kernels for ResNet Inference.☆182May 26, 2019Updated 6 years ago
- Binary Neural Network on IceStick FPGA.☆54Jul 11, 2018Updated 7 years ago
- Deep learning with a multiplication budget☆47Jul 15, 2018Updated 7 years ago
- I'm going to use the Winograd’s minimal filtering algorithms to introduce a new class of fast algorithms for convolutional neural networks…☆12Mar 22, 2018Updated 7 years ago
- Implementation of ICLR 2018 paper "Loss-aware Weight Quantization of Deep Networks"☆27Oct 24, 2019Updated 6 years ago
- Implementation of the Winograd algorithm.☆24Nov 6, 2018Updated 7 years ago
- GPU implementation of Winograd convolution☆10Oct 23, 2017Updated 8 years ago
- Codes for Accepted Paper : "MetaQuant: Learning to Quantize by Learning to Penetrate Non-differentiable Quantization" in NeurIPS 2019☆54May 8, 2020Updated 5 years ago
- A Winograd Minimal Filter Implementation in CUDA☆28Aug 25, 2021Updated 4 years ago
- ☆19Aug 26, 2021Updated 4 years ago
- Channel Pruning for Accelerating Very Deep Neural Networks (ICCV'17)☆1,086May 2, 2024Updated last year
- This repository represents training examples for the CVPR 2018 paper "SYQ:Learning Symmetric Quantization For Efficient Deep Neural Netwo…☆31Jul 25, 2019Updated 6 years ago
- Caffe Implementation for Incremental network quantization☆190Jul 29, 2018Updated 7 years ago
- This is originally a collection of papers on neural network accelerators. Now it's more like my selection of research on deep learning an…☆2,048Nov 8, 2025Updated 3 months ago
- Code example for the ICLR 2018 oral paper☆152May 31, 2018Updated 7 years ago
- Test winograd convolution written in TVM for CUDA and AMDGPU☆41Oct 12, 2018Updated 7 years ago
- Codes for Binary Ensemble Neural Network: More Bits per Network or More Networks per Bit?☆31Oct 7, 2019Updated 6 years ago
- An OpenCL-Based FPGA Accelerator for Compressed YOLOv2☆39May 27, 2021Updated 4 years ago
- Graph Transforms to Quantize and Retrain Deep Neural Nets in TensorFlow☆169Dec 9, 2019Updated 6 years ago
- Low-precision matrix multiplication☆1,832Jan 29, 2024Updated 2 years ago
- Assembler for NVIDIA Volta and Turing GPUs☆239Jan 13, 2022Updated 4 years ago
- CNNs in Halide☆23Oct 22, 2015Updated 10 years ago
- Caffe for Sparse and Low-rank Deep Neural Networks☆382Mar 8, 2020Updated 5 years ago
- Automatic Schedule Exploration and Optimization Framework for Tensor Computations☆182Apr 25, 2022Updated 3 years ago
- CondenseNet: Light weighted CNN for mobile devices☆691Nov 11, 2019Updated 6 years ago
- An efficient framework for convolutional neural networks☆278Aug 30, 2023Updated 2 years ago
- Quantized Neural Network PACKage - mobile-optimized implementation of quantized neural network operators☆1,549Aug 28, 2019Updated 6 years ago
- Caffe implementation for dynamic network surgery.☆189Aug 15, 2017Updated 8 years ago
- Training Low-bits DNNs with Stochastic Quantization☆74Aug 4, 2017Updated 8 years ago
- An HLS based winograd systolic CNN accelerator☆54Jul 18, 2021Updated 4 years ago
- Explore the energy-efficient dataflow scheduling for neural networks.☆233Aug 24, 2020Updated 5 years ago
- collection of works aiming at reducing model sizes or the ASIC/FPGA accelerator for machine learning☆566Feb 3, 2024Updated 2 years ago
- Rethinking the Value of Network Pruning (Pytorch) (ICLR 2019)☆1,516Jun 7, 2020Updated 5 years ago
- Tensorflow codes for "Rethinking the Smaller-Norm-Less-Informative Assumption in Channel Pruning of Convolution Layers"☆30Oct 14, 2019Updated 6 years ago
- Implementation of ICLR 2017 paper "Loss-aware Binarization of Deep Networks"☆20Feb 24, 2019Updated 6 years ago
- Code for paper "Energy-Constrained Compression for Deep Neural Networks via Weighted Sparse Projection and Layer Input Masking"☆18May 7, 2019Updated 6 years ago