MXHX7199 / ICCV_2021_AFPLinks
AFP is a hardware-friendly quantization framework for DNNs, which is contributed by Fangxin Liu and Wenbo Zhao.
☆13Updated 4 years ago
Alternatives and similar repositories for ICCV_2021_AFP
Users that are interested in ICCV_2021_AFP are comparing it to the libraries listed below
Sorting:
- BSQ: Exploring Bit-Level Sparsity for Mixed-Precision Neural Network Quantization (ICLR 2021)☆42Updated 4 years ago
- ☆31Updated 8 months ago
- ☆18Updated 2 years ago
- ☆33Updated this week
- Open-source of MSD framework☆16Updated 2 years ago
- Here are some implementations of basic hardware units in RTL language (verilog for now), which can be used for area/power evaluation and …☆12Updated 2 years ago
- Fast Emulation of Approximate DNN Accelerators in PyTorch☆28Updated last year
- bitfusion verilog implementation☆12Updated 3 years ago
- FracBNN: Accurate and FPGA-Efficient Binary Neural Networks with Fractional Activations☆95Updated 4 years ago
- ☆35Updated 5 years ago
- [TCAD'23] AccelTran: A Sparsity-Aware Accelerator for Transformers☆54Updated 2 years ago
- Sparse CNN Accelerator targeting Intel FPGA☆12Updated 4 years ago
- [ICASSP'20] DNN-Chip Predictor: An Analytical Performance Predictor for DNN Accelerators with Various Dataflows and Hardware Architecture…☆25Updated 3 years ago
- [ICML 2021] "Auto-NBA: Efficient and Effective Search Over the Joint Space of Networks, Bitwidths, and Accelerators" by Yonggan Fu, Yonga…☆16Updated 3 years ago
- A bit-level sparsity-awared multiply-accumulate process element.☆18Updated last year
- ☆19Updated 4 years ago
- A co-design architecture on sparse attention☆54Updated 4 years ago
- Neural Network Quantization With Fractional Bit-widths☆11Updated 4 years ago
- [HPCA 2023] ViTCoD: Vision Transformer Acceleration via Dedicated Algorithm and Accelerator Co-Design☆124Updated 2 years ago
- ☆42Updated last year
- ☆47Updated 4 years ago
- ☆19Updated 4 years ago
- The official implementation of HPCA 2025 paper, Prosperity: Accelerating Spiking Neural Networks via Product Sparsity☆36Updated 4 months ago
- ☆113Updated 2 years ago
- Simulator for BitFusion☆102Updated 5 years ago
- Official implementation of EMNLP'23 paper "Revisiting Block-based Quantisation: What is Important for Sub-8-bit LLM Inference?"☆24Updated 2 years ago
- A framework for fast exploration of the depth-first scheduling space for DNN accelerators☆43Updated 2 years ago
- ☆71Updated 5 years ago
- [ASP-DAC 2025] "NeuronQuant: Accurate and Efficient Post-Training Quantization for Spiking Neural Networks" Official Implementation☆13Updated 9 months ago
- ☆16Updated 3 years ago