larq / compute-engineLinks
Highly optimized inference engine for Binarized Neural Networks
☆250Updated 2 weeks ago
Alternatives and similar repositories for compute-engine
Users that are interested in compute-engine are comparing it to the libraries listed below
Sorting:
- Reference implementations of popular Binarized Neural Networks☆107Updated last month
- Simple Training and Deployment of Fast End-to-End Binary Networks☆157Updated 3 years ago
- An Open-Source Library for Training Binarized Neural Networks☆720Updated 9 months ago
- Low Precision Arithmetic Simulation in PyTorch☆277Updated last year
- Graph Transforms to Quantize and Retrain Deep Neural Nets in TensorFlow☆168Updated 5 years ago
- Scailable ONNX python tools☆97Updated 7 months ago
- ☆69Updated 2 years ago
- Parse TFLite models (*.tflite) EASILY with Python. Check the API at https://zhenhuaw.me/tflite/docs/☆100Updated 4 months ago
- tophub autotvm log collections☆69Updated 2 years ago
- Fast sparse deep learning on CPUs☆53Updated 2 years ago
- BMXNet 2: An Open-Source Binary Neural Network Implementation Based on MXNet☆230Updated 3 years ago
- PyTorch implementation of Data Free Quantization Through Weight Equalization and Bias Correction.☆262Updated last year
- Quantization of Convolutional Neural networks.☆243Updated 9 months ago
- TFLite model analyzer & memory optimizer☆127Updated last year
- A small library for managing deep learning models, hyperparameters and datasets☆24Updated last year
- Butterfly matrix multiplication in PyTorch☆168Updated last year
- A tensor-aware point-to-point communication primitive for machine learning☆257Updated 2 years ago
- Customized matrix multiplication kernels☆54Updated 3 years ago
- torch::deploy (multipy for non-torch uses) is a system that lets you get around the GIL problem by running multiple Python interpreters i…☆178Updated 5 months ago
- Repository containing pruned models and related information☆37Updated 4 years ago
- PyProf2: PyTorch Profiling tool☆82Updated 4 years ago
- TensorFlow-nGraph bridge☆136Updated 4 years ago
- Programmable Neural Network Compression☆148Updated 3 years ago
- TVM integration into PyTorch☆452Updated 5 years ago
- A DNN inference latency prediction toolkit for accurately modeling and predicting the latency on diverse edge devices.☆352Updated 10 months ago
- Training neural networks in TensorFlow 2.0 with 5x less memory☆131Updated 3 years ago
- [ACL'20] HAT: Hardware-Aware Transformers for Efficient Natural Language Processing☆334Updated 10 months ago
- PyTorch library to facilitate development and standardized evaluation of neural network pruning methods.☆430Updated last year
- Implementation of convolution layer in different flavors☆68Updated 7 years ago
- Efficient Sparse-Winograd Convolutional Neural Networks (ICLR 2018)☆191Updated 6 years ago