facebookresearch / deepfloat
An exploration of log domain "alternative floating point" for hardware ML/AI accelerators.
☆392Updated 2 years ago
Alternatives and similar repositories for deepfloat:
Users that are interested in deepfloat are comparing it to the libraries listed below
- BinaryNets in TensorFlow with XNOR GEMM op☆156Updated 7 years ago
- Simple Training and Deployment of Fast End-to-End Binary Networks☆158Updated 3 years ago
- Binarized Neural Network TF training code + C matrix / eval library.☆99Updated 7 years ago
- Ristretto: Caffe-based approximation of convolutional neural networks.☆290Updated 5 years ago
- Training deep neural networks with low precision multiplications☆63Updated 9 years ago
- (New version is out: https://github.com/hpi-xnor/BMXNet-v2) BMXNet: An Open-Source Binary Neural Network Implementation Based on MXNet☆350Updated 5 years ago
- ☆47Updated 5 years ago
- Implementation for Trained Ternary Network.☆108Updated 8 years ago
- Caffe implementation of accurate low-precision neural networks☆117Updated 6 years ago
- Winograd minimal convolution algorithm generator for convolutional neural networks.☆614Updated 4 years ago
- Caffe for Sparse Convolutional Neural Network☆238Updated 2 years ago
- ☆119Updated 7 years ago
- Efficient Sparse-Winograd Convolutional Neural Networks (ICLR 2018)☆190Updated 5 years ago
- Symbolic Expression and Statement Module for new DSLs☆205Updated 4 years ago
- Explore the energy-efficient dataflow scheduling for neural networks.☆220Updated 4 years ago
- Graph Transforms to Quantize and Retrain Deep Neural Nets in TensorFlow☆168Updated 5 years ago
- CaffePresso: An Optimized Library for Deep Learning on Embedded Accelerator-based platforms☆88Updated 5 months ago
- Training Deep Neural Networks with binary weights during propagations☆378Updated 9 years ago
- FireSim-NVDLA: NVIDIA Deep Learning Accelerator (NVDLA) Integrated with RISC-V Rocket Chip SoC Running on the Amazon FPGA Cloud☆161Updated 3 years ago
- Implementation of Ternary Weight Networks In Caffe☆63Updated 8 years ago
- Code example for the ICLR 2018 oral paper☆151Updated 6 years ago
- Open single and half precision gemm implementations☆378Updated last year
- Neural network visualizer and analyzer☆164Updated 6 years ago
- BISMO: A Scalable Bit-Serial Matrix Multiplication Overlay for Reconfigurable Computing☆134Updated 5 years ago
- Quantize weights and activations in Recurrent Neural Networks.☆94Updated 6 years ago
- Repository for the tools and non-commercial data used for the "Accelerator wall" paper.☆49Updated 6 years ago
- HLS branch of Halide☆77Updated 6 years ago
- Fast matrix multiplication for few-bit integer matrices on CPUs.☆27Updated 6 years ago
- Getting Started with Xilinx ML Suite☆337Updated 4 years ago
- ☆81Updated last month