IBM / pfloatLinks
A 8-/16-/32-/64-bit floating point number family
☆16Updated 3 years ago
Alternatives and similar repositories for pfloat
Users that are interested in pfloat are comparing it to the libraries listed below
Sorting:
- An implementation of a BinaryConnect network for cifar10☆11Updated 6 years ago
- Fork of upstream onnxruntime focused on supporting risc-v accelerators☆88Updated 2 years ago
- A Out-of-box PyTorch Scaffold for Neural Network Quantization-Aware-Training (QAT) Research. Website: https://github.com/zhutmost/neuralz…☆25Updated 2 years ago
- ☆23Updated 4 years ago
- TQT's pytorch implementation.☆21Updated 3 years ago
- Approximate layers - TensorFlow extension☆26Updated 7 months ago
- ☆71Updated 5 years ago
- Provides the code for the paper "EBPC: Extended Bit-Plane Compression for Deep Neural Network Inference and Training Accelerators" by Luk…☆19Updated 6 years ago
- ☆19Updated 6 years ago
- HW/SW co-design of sentence-level energy optimizations for latency-aware multi-task NLP inference☆53Updated last year
- Training with Block Minifloat number representation☆17Updated 4 years ago
- Official implementation of "Searching for Winograd-aware Quantized Networks" (MLSys'20)☆27Updated 2 years ago
- Eyeriss chip simulator☆39Updated 5 years ago
- Linux docker for the DNN accelerator exploration infrastructure composed of Accelergy and Timeloop☆60Updated last month
- Post-training sparsity-aware quantization☆34Updated 2 years ago
- This repository containts the pytorch scripts to train mixed-precision networks for microcontroller deployment, based on the memory contr…☆50Updated last year
- ☆14Updated 5 years ago
- BISMO: A Scalable Bit-Serial Matrix Multiplication Overlay for Reconfigurable Computing☆146Updated 5 years ago
- ☆33Updated 2 years ago
- Chameleon: Adaptive Code Optimization for Expedited Deep Neural Network Compilation☆27Updated 6 years ago
- DAC System Design Contest 2020☆29Updated 5 years ago
- ☆19Updated 4 years ago
- PyTorch extension for emulating FP8 data formats on standard FP32 Xeon/GPU hardware.☆112Updated last year
- ☆37Updated 3 years ago
- Static Block Floating Point Quantization for CNN☆37Updated 4 years ago
- CSV spreadsheets and other material for AI accelerator survey papers☆183Updated 2 weeks ago
- [FPGA'21] CoDeNet is an efficient object detection model on PyTorch, with SOTA performance on VOC and COCO based on CenterNet and Co-Desi…☆27Updated 2 years ago
- ☆85Updated 2 years ago
- ☆47Updated 6 years ago
- ☆22Updated 9 months ago