guoheng / bfloat16Links
Convert single precision float to bfloat16 (Brain Floating Point) floating-point format
☆14Updated 6 years ago
Alternatives and similar repositories for bfloat16
Users that are interested in bfloat16 are comparing it to the libraries listed below
Sorting:
- ☆29Updated 4 years ago
- Implementation of convolution layer in different flavors☆68Updated 8 years ago
- Highly optimized inference engine for Binarized Neural Networks☆251Updated this week
- Lightweight C implementation of CNNs for Embedded Systems☆62Updated 2 years ago
- A self-contained version of the tutorial which can be easily cloned and viewed by others.☆24Updated 6 years ago
- ☆61Updated 3 years ago
- Fork of upstream onnxruntime focused on supporting risc-v accelerators☆88Updated 2 years ago
- Reference implementations of popular Binarized Neural Networks☆109Updated last week
- Fast matrix multiplication for few-bit integer matrices on CPUs.☆28Updated 6 years ago
- Tutorials on Quantized Neural Network using Tensorflow Lite☆87Updated 6 years ago
- Open Source Compiler Framework using ONNX as Frontend and IR☆33Updated 3 years ago
- TFLite model analyzer & memory optimizer☆135Updated last year
- ☆37Updated 3 years ago
- BISMO: A Scalable Bit-Serial Matrix Multiplication Overlay for Reconfigurable Computing☆148Updated 6 years ago
- TVM stack: exploring the incredible explosion of deep-learning frameworks and how to bring them together☆64Updated 7 years ago
- npcomp - An aspirational MLIR based numpy compiler☆51Updated 5 years ago
- ☆68Updated 2 years ago
- 🧠 Benchmark facility to train networks on different datasets for PyTorch/Brevitas☆27Updated 2 years ago
- Jupyter notebook examples on image classification with quantized neural networks☆71Updated 5 years ago
- GPTPU for SC 2021☆52Updated 2 years ago
- A tool to deploy Deep Neural Networks on PULP-based SoC's☆91Updated 5 months ago
- XLA integration of Open Neural Network Exchange (ONNX)☆19Updated 7 years ago
- nnq_cnd_study stands for Neural Network Quantization & Compact Networks Design Study☆13Updated 5 years ago
- Tool for the deployment and analysis of TinyML applications on TFLM and MicroTVM backends☆33Updated this week
- Library for fast image convolution in neural networks on Intel Architecture☆30Updated 8 years ago
- Graph Transforms to Quantize and Retrain Deep Neural Nets in TensorFlow☆168Updated 6 years ago
- Matrix Operation Library for FPGA https://xilinx.github.io/gemx/☆63Updated 6 years ago
- Parse TFLite models (*.tflite) EASILY with Python. Check the API at https://zhenhuaw.me/tflite/docs/☆104Updated 11 months ago
- Official implementation of "Searching for Winograd-aware Quantized Networks" (MLSys'20)☆27Updated 2 years ago
- Low Precision Arithmetic Simulation in PyTorch☆289Updated last year