jrk / gradient-halide
☆102Updated 5 years ago
Alternatives and similar repositories for gradient-halide:
Users that are interested in gradient-halide are comparing it to the libraries listed below
- Python Binding to NVRTC☆79Updated 6 months ago
- an example of a CUDA extension for PyTorch using CuPy which computes the Hadamard product of two tensors☆118Updated 3 months ago
- Simple example of implementing a new Tensorflow operation and its gradient in C++.☆56Updated 6 years ago
- CNNs in Halide☆23Updated 9 years ago
- ☆22Updated 6 years ago
- Boda: A C++ Framework for Efficient Experiments in Computer Vision☆64Updated 5 years ago
- This is a demo project that shows how you can utilize Caffe2's modular design and build a library on top of it.☆40Updated 6 years ago
- Greentea LibDNN - a universal convolution implementation supporting CUDA and OpenCL☆135Updated 8 years ago
- Examples of C extensions for PyTorch☆255Updated 2 years ago
- Efficient forward propagation for BCNNs☆50Updated 7 years ago
- CuPy fused PyTorch neural networks ops☆273Updated 7 years ago
- Accelerating DNN Convolutional Layers with Micro-batches☆63Updated 4 years ago
- Example code used in the CVPR 2015 tutorial☆40Updated 9 years ago
- Proof-of-Concept CNN in Halide☆22Updated 8 years ago
- Multi-core CPU implementation of deep learning for 2D and 3D sliding window convolutional networks (ConvNets).☆94Updated 8 years ago
- Python code for the fast bilateral solver☆235Updated 4 years ago
- Elemental code snippets written in Halide language.☆88Updated 5 years ago
- An implementation of Deep Joint Demosaicking and Denoising - SiGGRAPH Asia 2016☆114Updated last year
- Photographic Image Synthesis with Cascaded Refinement Networks - Pytorch Implementation☆64Updated 7 years ago
- Port of permutohedral bilateral filtering to tensorflow as an op☆22Updated 8 years ago
- ☆47Updated 5 years ago
- ☆62Updated 7 years ago
- Case Studies for Halide performance against C++ and OpenCL☆37Updated 11 years ago
- Implementation of Bilateral Space Video Segmentation [Maerki et al CVPR 2016]☆65Updated 6 years ago
- tutorial to optimize GEMM performance on android☆51Updated 9 years ago
- A portable high-level API with CUDA or OpenCL back-end☆54Updated 7 years ago
- Caffe implementation of accurate low-precision neural networks☆117Updated 6 years ago
- ☆138Updated 8 years ago
- Binarized Neural Network TF training code + C matrix / eval library.☆99Updated 7 years ago
- Torch FFI-bindings for NNPACK☆30Updated 7 years ago