minhhn2910 / QPyTorch
Low Precision Arithmetic Simulation in PyTorch - extension for posit and beyond
☆13Updated last year
Alternatives and similar repositories for QPyTorch:
Users that are interested in QPyTorch are comparing it to the libraries listed below
- ☆14Updated 5 years ago
- Official implementation of "Searching for Winograd-aware Quantized Networks" (MLSys'20)☆27Updated last year
- ☆57Updated 4 years ago
- Training with Block Minifloat number representation☆14Updated 3 years ago
- ☆21Updated 2 years ago
- ☆71Updated 2 years ago
- Provides the hardware code for the paper "EBPC: Extended Bit-Plane Compression for Deep Neural Network Inference and Training Accelerator…☆24Updated 4 years ago
- Generate versal system design from ONNX model. AI engine kernels. Sub-microsecond speeds for autoencoders.☆9Updated 2 months ago
- ColTraIn HBFP Training Emulator☆16Updated 2 years ago
- FlexASR: A Reconfigurable Hardware Accelerator for Attention-based Seq-to-Seq Networks☆42Updated 2 weeks ago
- MAERI: A DNN accelerator with reconfigurable interconnects to support flexible dataflow (http://synergy.ece.gatech.edu/tools/maeri/)☆63Updated 3 years ago
- FPGA-based hardware acceleration for dropout-based Bayesian Neural Networks.☆23Updated last year
- SAMO: Streaming Architecture Mapping Optimisation☆32Updated last year
- ☆70Updated 4 years ago
- This is a collection of works on neural networks and neural accelerators.☆40Updated 6 years ago
- A Generic Distributed Auto-Tuning Infrastructure☆22Updated 3 years ago
- A Deep Learning Framework for the Posit Number System☆27Updated 7 months ago
- ☆19Updated last month
- ☆19Updated 3 years ago
- ☆32Updated 4 years ago
- [ICML 2021] "Auto-NBA: Efficient and Effective Search Over the Joint Space of Networks, Bitwidths, and Accelerators" by Yonggan Fu, Yonga…☆15Updated 3 years ago
- Approximate layers - TensorFlow extension☆27Updated 10 months ago
- QuickEst repository: Quick Estimation of Quality of Results☆26Updated 6 years ago
- Adaptive floating-point based numerical format for resilient deep learning☆14Updated 2 years ago
- ☆23Updated 2 years ago
- A floating-point matrix multiplication implemented in hardware☆31Updated 4 years ago
- A Spatial Accelerator Generation Framework for Tensor Algebra.☆55Updated 3 years ago
- PyTorch implementation of DiracDeltaNet from paper Synetgy: Algorithm-hardware Co-design for ConvNet Accelerators on Embedded FPGAs☆31Updated 5 years ago
- Simulator for BitFusion☆96Updated 4 years ago
- ☆28Updated 4 months ago