facebookresearch / deepfloat
An exploration of log domain "alternative floating point" for hardware ML/AI accelerators.
☆389Updated last year
Related projects ⓘ
Alternatives and complementary repositories for deepfloat
- Simple Training and Deployment of Fast End-to-End Binary Networks☆159Updated 2 years ago
- ☆47Updated 4 years ago
- Graph Transforms to Quantize and Retrain Deep Neural Nets in TensorFlow☆168Updated 4 years ago
- Symbolic Expression and Statement Module for new DSLs☆206Updated 4 years ago
- Ristretto: Caffe-based approximation of convolutional neural networks.☆292Updated 5 years ago
- Caffe for Sparse Convolutional Neural Network☆238Updated last year
- Efficient Sparse-Winograd Convolutional Neural Networks (ICLR 2018)☆190Updated 5 years ago
- CaffePresso: An Optimized Library for Deep Learning on Embedded Accelerator-based platforms☆88Updated last month
- (New version is out: https://github.com/hpi-xnor/BMXNet-v2) BMXNet: An Open-Source Binary Neural Network Implementation Based on MXNet☆351Updated 5 years ago
- ☆119Updated 6 years ago
- Winograd minimal convolution algorithm generator for convolutional neural networks.☆605Updated 4 years ago
- Caffe implementation of accurate low-precision neural networks☆118Updated 6 years ago
- Neural network visualizer and analyzer☆164Updated 6 years ago
- Code example for the ICLR 2018 oral paper☆149Updated 6 years ago
- Binarized Neural Network TF training code + C matrix / eval library.☆99Updated 7 years ago
- BISMO: A Scalable Bit-Serial Matrix Multiplication Overlay for Reconfigurable Computing☆131Updated 4 years ago
- FireSim-NVDLA: NVIDIA Deep Learning Accelerator (NVDLA) Integrated with RISC-V Rocket Chip SoC Running on the Amazon FPGA Cloud☆161Updated 2 years ago
- Quantize weights and activations in Recurrent Neural Networks.☆94Updated 6 years ago
- BinaryNets in TensorFlow with XNOR GEMM op☆155Updated 7 years ago
- Implementation for Trained Ternary Network.☆108Updated 7 years ago
- HeteroCL: A Multi-Paradigm Programming Infrastructure for Software-Defined Heterogeneous Computing☆326Updated 7 months ago
- Explore the energy-efficient dataflow scheduling for neural networks.☆216Updated 4 years ago
- collection of works aiming at reducing model sizes or the ASIC/FPGA accelerator for machine learning☆555Updated 9 months ago
- Training deep neural networks with low precision multiplications☆63Updated 9 years ago
- This repository contains the results and code for the MLPerf™ Inference v0.5 benchmark.☆55Updated last year
- NVDLA SW☆489Updated 3 years ago
- Fast matrix multiplication for few-bit integer matrices on CPUs.☆27Updated 5 years ago
- A open source reimplementation of Google's Tensor Processing Unit (TPU).☆364Updated 6 years ago
- Highly optimized inference engine for Binarized Neural Networks☆243Updated 3 weeks ago
- Training Deep Neural Networks with binary weights during propagations☆378Updated 8 years ago