larq / compute-engineLinks
Highly optimized inference engine for Binarized Neural Networks
☆251Updated this week
Alternatives and similar repositories for compute-engine
Users that are interested in compute-engine are comparing it to the libraries listed below
Sorting:
- Reference implementations of popular Binarized Neural Networks☆108Updated last week
- An Open-Source Library for Training Binarized Neural Networks☆721Updated last year
- Simple Training and Deployment of Fast End-to-End Binary Networks☆157Updated 3 years ago
- Parse TFLite models (*.tflite) EASILY with Python. Check the API at https://zhenhuaw.me/tflite/docs/☆101Updated 7 months ago
- Graph Transforms to Quantize and Retrain Deep Neural Nets in TensorFlow☆168Updated 5 years ago
- ☆69Updated 2 years ago
- PyTorch interface for the IPU☆181Updated last year
- TFLite model analyzer & memory optimizer☆131Updated last year
- ☆29Updated 4 years ago
- Implementation of convolution layer in different flavors☆68Updated 7 years ago
- Low Precision Arithmetic Simulation in PyTorch☆284Updated last year
- Scailable ONNX python tools☆97Updated 10 months ago
- QKeras: a quantization deep learning library for Tensorflow Keras☆570Updated 3 months ago
- Codebase associated with the PyTorch compiler tutorial☆46Updated 6 years ago
- A self-contained version of the tutorial which can be easily cloned and viewed by others.☆24Updated 6 years ago
- XLA integration of Open Neural Network Exchange (ONNX)☆19Updated 7 years ago
- An exploration of log domain "alternative floating point" for hardware ML/AI accelerators.☆394Updated 2 years ago
- Quantized Neural Networks - networks trained for inference at arbitrary low precision.☆147Updated 7 years ago
- Efficient Sparse-Winograd Convolutional Neural Networks (ICLR 2018)☆193Updated 6 years ago
- ☆313Updated last month
- tophub autotvm log collections☆69Updated 2 years ago
- The official, proof-of-concept C++ implementation of PocketNN.☆34Updated last year
- Repository containing pruned models and related information☆37Updated 4 years ago
- Programmable Neural Network Compression☆149Updated 3 years ago
- QONNX: Arbitrary-Precision Quantized Neural Networks in ONNX☆157Updated this week
- TensorFlow-nGraph bridge☆136Updated 4 years ago
- TVM stack: exploring the incredible explosion of deep-learning frameworks and how to bring them together☆64Updated 7 years ago
- Fast matrix multiplication for few-bit integer matrices on CPUs.☆28Updated 6 years ago
- PyTorch C++ API Documentation☆235Updated 2 weeks ago
- Fast sparse deep learning on CPUs☆55Updated 2 years ago