larq / compute-engine
Highly optimized inference engine for Binarized Neural Networks
☆243Updated 3 weeks ago
Related projects ⓘ
Alternatives and complementary repositories for compute-engine
- Reference implementations of popular Binarized Neural Networks☆104Updated 3 weeks ago
- An Open-Source Library for Training Binarized Neural Networks☆707Updated 3 months ago
- Simple Training and Deployment of Fast End-to-End Binary Networks☆159Updated 2 years ago
- Graph Transforms to Quantize and Retrain Deep Neural Nets in TensorFlow☆168Updated 4 years ago
- Scailable ONNX python tools☆96Updated 3 weeks ago
- Low Precision Arithmetic Simulation in PyTorch☆265Updated 6 months ago
- A small library for managing deep learning models, hyperparameters and datasets☆23Updated 9 months ago
- Parse TFLite models (*.tflite) EASILY with Python. Check the API at https://zhenhuaw.me/tflite/docs/☆96Updated last year
- TVM integration into PyTorch☆453Updated 4 years ago
- PyTorch interface for the IPU☆177Updated last year
- PyProf2: PyTorch Profiling tool☆83Updated 4 years ago
- A library of GPU kernels for sparse matrix operations.☆249Updated 4 years ago
- Codebase associated with the PyTorch compiler tutorial☆44Updated 5 years ago
- ☆67Updated last year
- ☆398Updated this week
- Representation and Reference Lowering of ONNX Models in MLIR Compiler Infrastructure☆769Updated this week
- PyTorch RFCs (experimental)☆130Updated 2 months ago
- Programmable Neural Network Compression☆147Updated 2 years ago
- Training neural networks in TensorFlow 2.0 with 5x less memory☆129Updated 2 years ago
- The Tensor Algebra SuperOptimizer for Deep Learning☆692Updated last year
- QONNX: Arbitrary-Precision Quantized Neural Networks in ONNX☆127Updated 3 weeks ago
- ☆303Updated last week
- torch::deploy (multipy for non-torch uses) is a system that lets you get around the GIL problem by running multiple Python interpreters i…☆176Updated this week
- QKeras: a quantization deep learning library for Tensorflow Keras☆541Updated last month
- A self-contained version of the tutorial which can be easily cloned and viewed by others.☆24Updated 5 years ago
- A tensor-aware point-to-point communication primitive for machine learning☆249Updated last year
- Fast sparse deep learning on CPUs☆51Updated 2 years ago
- Customized matrix multiplication kernels☆53Updated 2 years ago
- Python bindings for NVTX☆66Updated last year