guoheng / bfloat16Links
Convert single precision float to bfloat16 (Brain Floating Point) floating-point format
☆14Updated 5 years ago
Alternatives and similar repositories for bfloat16
Users that are interested in bfloat16 are comparing it to the libraries listed below
Sorting:
- ☆29Updated 4 years ago
- Implementation of convolution layer in different flavors☆68Updated 7 years ago
- Lightweight C implementation of CNNs for Embedded Systems☆62Updated 2 years ago
- Example code and instructions on getting Tensorflow Lite running on a Xilinx Zynq☆49Updated 7 years ago
- Fast matrix multiplication for few-bit integer matrices on CPUs.☆28Updated 6 years ago
- ☆59Updated 3 years ago
- Open Source Compiler Framework using ONNX as Frontend and IR☆32Updated 2 years ago
- Highly optimized inference engine for Binarized Neural Networks☆251Updated last week
- Fork of upstream onnxruntime focused on supporting risc-v accelerators☆87Updated 2 years ago
- TFLite model analyzer & memory optimizer☆129Updated last year
- Binary Neural Network on IceStick FPGA.☆52Updated 7 years ago
- BISMO: A Scalable Bit-Serial Matrix Multiplication Overlay for Reconfigurable Computing☆141Updated 5 years ago
- ☆37Updated 3 years ago
- A tool to deploy Deep Neural Networks on PULP-based SoC's☆83Updated last week
- Jupyter notebook examples on image classification with quantized neural networks☆69Updated 5 years ago
- Machine learning inference library for ARC EM and HS Processors☆30Updated 6 months ago
- An exploration of log domain "alternative floating point" for hardware ML/AI accelerators.☆394Updated 2 years ago
- Open Source Specialized Computing Stack for Accelerating Deep Neural Networks.☆219Updated 6 years ago
- GUINNESS: A GUI-based binarized deep Neural NEtwork SyntheSizer toward an FPGA☆181Updated 6 years ago
- umbrella project helps you to build up onnc from scratch☆24Updated 3 years ago
- implementing a Recurrent Neural Network with binarized weight format on FPGA☆22Updated 7 years ago
- Tutorials on Quantized Neural Network using Tensorflow Lite☆87Updated 6 years ago
- ☆83Updated last year
- Matrix Operation Library for FPGA https://xilinx.github.io/gemx/☆63Updated 5 years ago
- The official, proof-of-concept C++ implementation of PocketNN.☆34Updated last year
- Linear model training using stochastic gradient descent (SGD) on PYNQ with full to low precision.☆55Updated 7 years ago
- NEural Minimizer for pytOrch☆44Updated last year
- npcomp - An aspirational MLIR based numpy compiler☆51Updated 5 years ago
- FireSim-NVDLA: NVIDIA Deep Learning Accelerator (NVDLA) Integrated with RISC-V Rocket Chip SoC Running on the Amazon FPGA Cloud☆163Updated 3 years ago
- This is a collection of works on neural networks and neural accelerators.☆40Updated 6 years ago