lz1313 / BlockCIrculantRNN
BlockCIrculantRNN (LSTM and GRU) using TensorFlow
☆14Updated 6 years ago
Alternatives and similar repositories for BlockCIrculantRNN:
Users that are interested in BlockCIrculantRNN are comparing it to the libraries listed below
- ☆14Updated 5 years ago
- Official implementation of "Searching for Winograd-aware Quantized Networks" (MLSys'20)☆27Updated last year
- This is a collection of works on neural networks and neural accelerators.☆40Updated 6 years ago
- PyTorch implementation of DiracDeltaNet from paper Synetgy: Algorithm-hardware Co-design for ConvNet Accelerators on Embedded FPGAs☆31Updated 5 years ago
- ☆36Updated 6 years ago
- A general framework for optimizing DNN dataflow on systolic array☆34Updated 4 years ago
- Simulator for BitFusion☆97Updated 4 years ago
- ☆33Updated 5 years ago
- ☆70Updated 5 years ago
- Approximate layers - TensorFlow extension☆27Updated 11 months ago
- Codes for Binary Ensemble Neural Network: More Bits per Network or More Networks per Bit?☆31Updated 5 years ago
- This repository represents training examples for the CVPR 2018 paper "SYQ:Learning Symmetric Quantization For Efficient Deep Neural Netwo…☆31Updated 5 years ago
- MAERI: A DNN accelerator with reconfigurable interconnects to support flexible dataflow (http://synergy.ece.gatech.edu/tools/maeri/)☆65Updated 3 years ago
- ☆32Updated 4 years ago
- Provides the code for the paper "EBPC: Extended Bit-Plane Compression for Deep Neural Network Inference and Training Accelerators" by Luk…☆17Updated 5 years ago
- Reproduction of WAGE in PyTorch.☆41Updated 6 years ago
- ☆14Updated last year
- ☆33Updated 3 years ago
- FlexASR: A Reconfigurable Hardware Accelerator for Attention-based Seq-to-Seq Networks☆42Updated last month
- This repository containts the pytorch scripts to train mixed-precision networks for microcontroller deployment, based on the memory contr…☆49Updated 10 months ago
- Residual Binarized Neural Network☆43Updated 7 years ago
- Python code to show how a systolic array works. Written for https://medium.com/@antonpaquin/whats-inside-a-tpu-c013eb51973e☆28Updated 6 years ago
- TQT's pytorch implementation.☆21Updated 3 years ago
- ☆19Updated 4 years ago
- Docker container with tools for the Timeloop/Accelergy tutorial☆22Updated 11 months ago
- Post-training sparsity-aware quantization☆34Updated 2 years ago
- ColTraIn HBFP Training Emulator☆16Updated 2 years ago
- pytorch fixed point training tool/framework☆34Updated 4 years ago
- HLS implemented systolic array structure☆41Updated 7 years ago
- DNN quantization with outlier channel splitting☆112Updated 5 years ago