hughperkins / coriander-dnn
Partial implementation of NVIDIA® cuDNN API for Coriander, OpenCL 1.2
☆22Updated 7 years ago
Alternatives and similar repositories for coriander-dnn:
Users that are interested in coriander-dnn are comparing it to the libraries listed below
- Training deep neural networks with low precision multiplications☆63Updated 9 years ago
- (Deprecated) hipCaffe: the HIP port of Caffe☆124Updated 11 months ago
- OpenCL backend for Torch nn neural networks library☆125Updated 8 years ago
- Fast matrix multiplication for few-bit integer matrices on CPUs.☆27Updated 6 years ago
- Source code for ``Neural Networks with Few Multiplications'' published at ICLR 2016☆81Updated 8 years ago
- ArrayFire's Machine Learning Library.☆103Updated 6 years ago
- A CUDA implementation of the Tsetlin Machine based on bitwise operators☆26Updated 5 years ago
- High Efficiency Convolution Kernel for Maxwell GPU Architecture☆134Updated 7 years ago
- Greentea LibDNN - a universal convolution implementation supporting CUDA and OpenCL☆135Updated 7 years ago
- An ONNX backend using PlaidML☆28Updated 6 years ago
- Compiler toolkit for neuFlow.☆26Updated 11 years ago
- An OpenCL backend for torch.☆296Updated 8 years ago
- nGraph™ Backend for ONNX☆42Updated 2 years ago
- OpenCL Torch☆147Updated 6 years ago
- Distributed Learning by Pair-Wise Averaging☆53Updated 7 years ago
- A portable high-level API with CUDA or OpenCL back-end☆54Updated 7 years ago
- Implements a message passing interface (MPI) wrapper that makes it easy to do massively parallel computations inside the Torch deep-learn…☆109Updated 6 years ago
- Library for fast image convolution in neural networks on Intel Architecture☆29Updated 7 years ago
- Kernel Fusion and Runtime Compilation Based on NNVM☆70Updated 8 years ago
- Benchmarks for CNTK and other toolkits.☆44Updated 9 years ago
- This repository contains the results and code for the MLPerf™ Training v0.5 benchmark.☆35Updated last year
- OpenCL support for TensorFlow via SYCL☆65Updated 6 years ago
- ☆14Updated 6 years ago
- Caffe: a fast open framework for deep learning. With OpenCL and CUDA support.☆86Updated 6 years ago
- Generating Families of Practical Fast Matrix Multiplication Algorithms☆12Updated 7 years ago
- Python wrappers for the NVIDIA cuDNN libraries☆140Updated 7 years ago
- Quantize weights and activations in Recurrent Neural Networks.☆94Updated 6 years ago
- ☆35Updated 7 years ago
- Reference workloads for modern deep learning methods.☆73Updated 2 years ago
- DLPack for Tensorflow☆36Updated 4 years ago