jwfromm / Riptide
Simple Training and Deployment of Fast End-to-End Binary Networks
☆159Updated 2 years ago
Related projects ⓘ
Alternatives and complementary repositories for Riptide
- Graph Transforms to Quantize and Retrain Deep Neural Nets in TensorFlow☆168Updated 4 years ago
- Efficient Sparse-Winograd Convolutional Neural Networks (ICLR 2018)☆190Updated 5 years ago
- [CVPR 2020] APQ: Joint Search for Network Architecture, Pruning and Quantization Policy☆156Updated 4 years ago
- Quantization of Convolutional Neural networks.☆239Updated 3 months ago
- Low Precision Arithmetic Simulation in PyTorch☆265Updated 6 months ago
- DNN quantization with outlier channel splitting☆112Updated 4 years ago
- Code example for the ICLR 2018 oral paper☆149Updated 6 years ago
- [MLSys 2021] IOS: Inter-Operator Scheduler for CNN Acceleration☆195Updated 2 years ago
- ☆66Updated 5 years ago
- LQ-Nets: Learned Quantization for Highly Accurate and Compact Deep Neural Networks☆239Updated 2 years ago
- ☆213Updated 6 years ago
- A PyTorch implementation of "Incremental Network Quantization: Towards Lossless CNNs with Low-Precision Weights"☆164Updated 4 years ago
- Mayo: Auto-generation of hardware-friendly deep neural networks. Dynamic Channel Pruning: Feature Boosting and Suppression.☆114Updated 4 years ago
- [CVPR 2019, Oral] HAQ: Hardware-Aware Automated Quantization with Mixed Precision☆370Updated 3 years ago
- Improving Post Training Neural Quantization: Layer-wise Calibration and Integer Programming☆95Updated 3 years ago
- TVM integration into PyTorch☆453Updated 4 years ago
- PyTorch implementation of Data Free Quantization Through Weight Equalization and Bias Correction.☆258Updated last year
- [CVPR'20] ZeroQ: A Novel Zero Shot Quantization Framework☆274Updated 11 months ago
- Place for meetup slides☆140Updated 4 years ago
- Reference implementations of popular Binarized Neural Networks☆104Updated 3 weeks ago
- Network acceleration methods☆178Updated 3 years ago
- tophub autotvm log collections☆70Updated last year
- Benchmark of TVM quantized model on CUDA☆112Updated 4 years ago
- [ACL'20] HAT: Hardware-Aware Transformers for Efficient Natural Language Processing☆330Updated 4 months ago
- Repository containing pruned models and related information☆36Updated 3 years ago
- BMXNet 2: An Open-Source Binary Neural Network Implementation Based on MXNet☆231Updated 2 years ago
- Implements quantized distillation. Code for our paper "Model compression via distillation and quantization"☆330Updated 3 months ago
- Caffe implementation of accurate low-precision neural networks☆118Updated 6 years ago
- Quantize weights and activations in Recurrent Neural Networks.☆94Updated 6 years ago
- PyProf2: PyTorch Profiling tool☆83Updated 4 years ago