facebookresearch / deepfloat
An exploration of log domain "alternative floating point" for hardware ML/AI accelerators.
☆388Updated last year
Related projects: ⓘ
- Simple Training and Deployment of Fast End-to-End Binary Networks☆159Updated 2 years ago
- Caffe for Sparse Convolutional Neural Network☆238Updated last year
- BinaryNets in TensorFlow with XNOR GEMM op☆154Updated 7 years ago
- Ristretto: Caffe-based approximation of convolutional neural networks.☆292Updated 5 years ago
- Efficient Sparse-Winograd Convolutional Neural Networks (ICLR 2018)☆190Updated 5 years ago
- Explore the energy-efficient dataflow scheduling for neural networks.☆214Updated 4 years ago
- Winograd minimal convolution algorithm generator for convolutional neural networks.☆600Updated 3 years ago
- Symbolic Expression and Statement Module for new DSLs☆205Updated 3 years ago
- Graph Transforms to Quantize and Retrain Deep Neural Nets in TensorFlow☆169Updated 4 years ago
- ☆119Updated 6 years ago
- (New version is out: https://github.com/hpi-xnor/BMXNet-v2) BMXNet: An Open-Source Binary Neural Network Implementation Based on MXNet☆350Updated 4 years ago
- FireSim-NVDLA: NVIDIA Deep Learning Accelerator (NVDLA) Integrated with RISC-V Rocket Chip SoC Running on the Amazon FPGA Cloud☆160Updated 2 years ago
- A CUDNN minimal deep learning training code sample using LeNet.☆257Updated last year
- Neural network visualizer and analyzer☆164Updated 5 years ago
- Open single and half precision gemm implementations☆364Updated last year
- HeteroCL: A Multi-Paradigm Programming Infrastructure for Software-Defined Heterogeneous Computing☆322Updated 5 months ago
- Caffe implementation of accurate low-precision neural networks☆118Updated 5 years ago
- BISMO: A Scalable Bit-Serial Matrix Multiplication Overlay for Reconfigurable Computing☆128Updated 4 years ago
- Spatial: "Specify Parameterized Accelerators Through Inordinately Abstract Language"☆271Updated 3 months ago
- Training Deep Neural Networks with binary weights during propagations☆377Updated 8 years ago
- Binarized Neural Network TF training code + C matrix / eval library.☆98Updated 6 years ago
- CaffePresso: An Optimized Library for Deep Learning on Embedded Accelerator-based platforms☆88Updated 6 years ago
- Code example for the ICLR 2018 oral paper☆150Updated 6 years ago
- Implementation for Trained Ternary Network.☆108Updated 7 years ago
- Low Precision Arithmetic Simulation in PyTorch☆258Updated 4 months ago
- Reference workloads for modern deep learning methods.☆73Updated last year
- Quantize weights and activations in Recurrent Neural Networks.☆95Updated 6 years ago
- ☆51Updated 6 years ago
- LQ-Nets: Learned Quantization for Highly Accurate and Compact Deep Neural Networks☆239Updated 2 years ago
- Highly optimized inference engine for Binarized Neural Networks☆242Updated last month