Zhen-Dong / CoDeNet
[FPGA'21] CoDeNet is an efficient object detection model on PyTorch, with SOTA performance on VOC and COCO based on CenterNet and Co-Designed deformable convolution.
☆25Updated 2 years ago
Alternatives and similar repositories for CoDeNet:
Users that are interested in CoDeNet are comparing it to the libraries listed below
- ☆19Updated 3 years ago
- [ICML 2021] "Auto-NBA: Efficient and Effective Search Over the Joint Space of Networks, Bitwidths, and Accelerators" by Yonggan Fu, Yonga…☆15Updated 3 years ago
- ☆32Updated 4 years ago
- [TCAD 2021] Block Convolution: Towards Memory-Efficient Inference of Large-Scale CNNs on FPGA☆16Updated 2 years ago
- BitPack is a practical tool to efficiently save ultra-low precision/mixed-precision quantized models.☆51Updated 2 years ago
- A Out-of-box PyTorch Scaffold for Neural Network Quantization-Aware-Training (QAT) Research. Website: https://github.com/zhutmost/neuralz…☆26Updated 2 years ago
- ☆69Updated 4 years ago
- BSQ: Exploring Bit-Level Sparsity for Mixed-Precision Neural Network Quantization (ICLR 2021)☆38Updated 4 years ago
- Algorithm-hardware Co-design for Deformable Convolution☆24Updated 4 years ago
- Official implementation of "Searching for Winograd-aware Quantized Networks" (MLSys'20)☆27Updated last year
- Post-training sparsity-aware quantization☆34Updated last year
- ☆18Updated 3 years ago
- A FPGA-based neural network inference accelerator, which won the third place in DAC-SDC☆28Updated 2 years ago
- ☆19Updated last year
- [TCAD'23] AccelTran: A Sparsity-Aware Accelerator for Transformers☆34Updated last year
- BitSplit Post-trining Quantization☆48Updated 3 years ago
- ☆25Updated 2 months ago
- Neural Network Quantization With Fractional Bit-widths☆12Updated 4 years ago
- Static Block Floating Point Quantization for CNN☆32Updated 3 years ago
- [HPCA 2023] ViTCoD: Vision Transformer Acceleration via Dedicated Algorithm and Accelerator Co-Design☆102Updated last year
- DeiT implementation for Q-ViT☆24Updated 2 years ago
- Designs for finalist teams of the DAC System Design Contest☆36Updated 4 years ago
- The code for Joint Neural Architecture Search and Quantization☆13Updated 5 years ago
- This is an implementation of YOLO using LSQ network quantization method.☆22Updated 2 years ago
- Approximate layers - TensorFlow extension☆27Updated 9 months ago
- Simulator for BitFusion☆95Updated 4 years ago
- ☆1Updated 4 years ago
- [ICASSP'20] DNN-Chip Predictor: An Analytical Performance Predictor for DNN Accelerators with Various Dataflows and Hardware Architecture…☆23Updated 2 years ago
- The official PyTorch implementation of the NeurIPS2022 (spotlight) paper, Outlier Suppression: Pushing the Limit of Low-bit Transformer L…☆48Updated 2 years ago