hughperkins / coriander-dnn
Partial implementation of NVIDIA® cuDNN API for Coriander, OpenCL 1.2
☆22Updated 7 years ago
Alternatives and similar repositories for coriander-dnn:
Users that are interested in coriander-dnn are comparing it to the libraries listed below
- Training deep neural networks with low precision multiplications☆63Updated 9 years ago
- ArrayFire's Machine Learning Library.☆103Updated 6 years ago
- High Efficiency Convolution Kernel for Maxwell GPU Architecture☆134Updated 7 years ago
- image to column☆30Updated 10 years ago
- (Deprecated) hipCaffe: the HIP port of Caffe☆124Updated 8 months ago
- OpenCL backend for Torch nn neural networks library☆125Updated 8 years ago
- Fast matrix multiplication for few-bit integer matrices on CPUs.☆27Updated 5 years ago
- Greentea LibDNN - a universal convolution implementation supporting CUDA and OpenCL☆135Updated 7 years ago
- Kernel Fusion and Runtime Compilation Based on NNVM☆70Updated 8 years ago
- Source code for ``Neural Networks with Few Multiplications'' published at ICLR 2016☆81Updated 8 years ago
- nGraph™ Backend for ONNX☆42Updated 2 years ago
- PyProf2: PyTorch Profiling tool☆82Updated 4 years ago
- ☆14Updated 5 years ago
- Customized matrix multiplication kernels☆53Updated 2 years ago
- OpenCL support for TensorFlow via SYCL☆65Updated 6 years ago
- Convolutional neural networks C++ framework with CPU and GPU (CUDA) backends☆178Updated 6 years ago
- Training neural networks with back-prop, feedback-alignment and direct feedback-alignment☆100Updated 7 years ago
- An OpenCL Torch Utility Library☆59Updated 8 years ago
- ☆47Updated 4 years ago
- Reference workloads for modern deep learning methods.☆73Updated 2 years ago
- A fast deep neural network library (CPU) for speech recognition☆84Updated 5 years ago
- An ONNX backend using PlaidML☆28Updated 6 years ago
- This is not the official kaldi repository. It is better to fork https://github.com/kaldi-asr/kaldi or https://github.com/vimalmanohar/kal…☆33Updated 9 years ago
- An exploration of log domain "alternative floating point" for hardware ML/AI accelerators.☆391Updated last year
- Quantize weights and activations in Recurrent Neural Networks.☆94Updated 6 years ago
- Deep learning with a multiplication budget☆47Updated 6 years ago
- Deep Reinforcement Learning Agent☆19Updated 9 years ago
- Binarized Neural Network TF training code + C matrix / eval library.☆99Updated 7 years ago
- GEMM and Winograd based convolutions using CUTLASS☆26Updated 4 years ago
- Research Blog☆24Updated 7 years ago