hughperkins / neonCl-underconstruction
experimental port of nervana neon kernels in OpenCL
☆11Updated 8 years ago
Alternatives and similar repositories for neonCl-underconstruction:
Users that are interested in neonCl-underconstruction are comparing it to the libraries listed below
- Greentea LibDNN - a universal convolution implementation supporting CUDA and OpenCL☆135Updated 7 years ago
- Boda: A C++ Framework for Efficient Experiments in Computer Vision☆63Updated 5 years ago
- Multi-core CPU implementation of deep learning for 2D and 3D sliding window convolutional networks (ConvNets).☆94Updated 8 years ago
- Python Binding to NVRTC☆79Updated 3 months ago
- Open single and half precision gemm implementations☆374Updated last year
- NNVM for ROCm Examples☆19Updated 7 years ago
- A portable high-level API with CUDA or OpenCL back-end☆54Updated 7 years ago
- A simple memory manager for CUDA designed to help Deep Learning frameworks manage memory☆296Updated 6 years ago
- Python wrappers for the NVIDIA cuDNN libraries☆140Updated 7 years ago
- High Efficiency Convolution Kernel for Maxwell GPU Architecture☆134Updated 7 years ago
- Library for fast image convolution in neural networks on Intel Architecture☆29Updated 7 years ago
- OpenCL backend for Torch nn neural networks library☆125Updated 8 years ago
- Distributed Learning by Pair-Wise Averaging☆53Updated 7 years ago
- The repo is obsolete. Use at your own risk.☆12Updated 6 years ago
- (Deprecated) hipCaffe: the HIP port of Caffe☆124Updated 8 months ago
- Original Python version of Intel® Nervana™ Graph☆215Updated 2 years ago
- OpenCL Torch☆147Updated 6 years ago
- Partial implementation of NVIDIA® cuDNN API for Coriander, OpenCL 1.2☆22Updated 7 years ago
- Source code for ``Neural Networks with Few Multiplications'' published at ICLR 2016☆81Updated 8 years ago
- ArrayFire's Machine Learning Library.☆102Updated 6 years ago
- Convolution op for Theano based on CuFFT using scikits.cuda☆51Updated 10 years ago
- Caffe deep learning framework - optimized for Xeon Phi☆14Updated 9 years ago
- The Operator Vectorization Library, or OVL, is a python productivity library for defining high performance custom operators for the Tenso…☆68Updated 8 years ago
- Binarized Neural Network TF training code + C matrix / eval library.☆99Updated 7 years ago
- A CUDNN minimal deep learning training code sample using LeNet.☆264Updated last year
- Neural network training using iterated projections.☆89Updated 8 years ago
- Caffe: a fast open framework for deep learning.☆14Updated 9 years ago
- This repository contains the results and code for the MLPerf™ Training v0.5 benchmark.☆35Updated last year
- Developer repository for PyViennaCL. Visit http://viennacl.sourceforge.net/ for latest releases.☆32Updated 9 years ago
- ☆47Updated 4 years ago