jrk / gradient-halideLinks
☆101Updated 6 years ago
Alternatives and similar repositories for gradient-halide
Users that are interested in gradient-halide are comparing it to the libraries listed below
Sorting:
- Python Binding to NVRTC☆79Updated last year
- CNNs in Halide☆23Updated 10 years ago
- ☆22Updated 7 years ago
- Greentea LibDNN - a universal convolution implementation supporting CUDA and OpenCL☆137Updated 8 years ago
- Boda: A C++ Framework for Efficient Experiments in Computer Vision☆64Updated 6 years ago
- Example code used in the CVPR 2015 tutorial☆42Updated 10 years ago
- CuPy fused PyTorch neural networks ops☆273Updated 7 years ago
- Simple example of implementing a new Tensorflow operation and its gradient in C++.☆56Updated 6 years ago
- Efficient forward propagation for BCNNs☆49Updated 8 years ago
- an example of a CUDA extension for PyTorch using CuPy which computes the Hadamard product of two tensors☆119Updated 7 months ago
- Proof-of-Concept CNN in Halide☆22Updated 9 years ago
- An exploration of log domain "alternative floating point" for hardware ML/AI accelerators.☆399Updated 2 years ago
- ☆62Updated 7 years ago
- Examples of C extensions for PyTorch☆256Updated 2 years ago
- Programmable Neural Network Compression☆149Updated 3 years ago
- Development a customized op in TensorFlow for convolution with sparse kernel☆28Updated 6 years ago
- Binarized Neural Network TF training code + C matrix / eval library.☆101Updated 8 years ago
- Elemental code snippets written in Halide language.☆88Updated 6 years ago
- ☆44Updated 6 years ago
- Python code for the fast bilateral solver☆239Updated 5 years ago
- Case Studies for Halide performance against C++ and OpenCL☆37Updated 12 years ago
- Multi-core CPU implementation of deep learning for 2D and 3D sliding window convolutional networks (ConvNets).☆94Updated 9 years ago
- Caffe: a fast open framework for deep learning. With OpenCL and CUDA support.☆86Updated 7 years ago
- Deep learning with a multiplication budget☆47Updated 7 years ago
- Accelerating DNN Convolutional Layers with Micro-batches☆63Updated 5 years ago
- A raytracer written in PyTorch (raynet?)☆245Updated 5 years ago
- Library for fast image convolution in neural networks on Intel Architecture☆30Updated 8 years ago
- Caffe implementation of accurate low-precision neural networks☆119Updated 7 years ago
- (New version is out: https://github.com/hpi-xnor/BMXNet-v2) BMXNet: An Open-Source Binary Neural Network Implementation Based on MXNet☆351Updated 6 years ago
- This is a demo project that shows how you can utilize Caffe2's modular design and build a library on top of it.☆40Updated 6 years ago