jetpacapp / pi-gemm
A Raspberry Pi GPU-accelerated implementation of the GEMM matrix-multiply function
☆88Updated 10 years ago
Alternatives and similar repositories for pi-gemm:
Users that are interested in pi-gemm are comparing it to the libraries listed below
- An assembler/disassembler for the QPU processors on the Raspberry Pi☆121Updated 9 years ago
- Math Kernel Library for VideoCore IV QPU☆68Updated 6 years ago
- Greentea LibDNN - a universal convolution implementation supporting CUDA and OpenCL☆135Updated 7 years ago
- Docker images that support different OpenCl Runtime☆34Updated 8 years ago
- High Efficiency Convolution Kernel for Maxwell GPU Architecture☆134Updated 7 years ago
- Intel® Optimization for Chainer*☆82Updated 2 years ago
- Binarized Neural Network TF training code + C matrix / eval library.☆99Updated 7 years ago
- Scripts to install TensorFlow on the NVIDIA Jetson TX1 Development Kit☆62Updated 7 years ago
- Caffe: a fast open framework for deep learning.☆43Updated 8 years ago
- ViNN - an OpenCL accelerated neural networks library☆33Updated 9 years ago
- Boda: A C++ Framework for Efficient Experiments in Computer Vision☆64Updated 5 years ago
- Convolution op for Theano based on CuFFT using scikits.cuda☆51Updated 10 years ago
- (Deprecated) hipCaffe: the HIP port of Caffe☆124Updated 10 months ago
- tutorial to optimize GEMM performance on android☆51Updated 9 years ago
- kmeans☆54Updated 8 years ago
- Python Binding to NVRTC☆79Updated 5 months ago
- ChainerMN: Scalable distributed deep learning with Chainer☆206Updated 5 years ago
- Original Python version of Intel® Nervana™ Graph☆215Updated 2 years ago
- CaffePresso: An Optimized Library for Deep Learning on Embedded Accelerator-based platforms☆88Updated 5 months ago
- Accelerating DNN Convolutional Layers with Micro-batches☆63Updated 4 years ago
- Convolutional neural networks C++ framework with CPU and GPU (CUDA) backends☆178Updated 6 years ago
- Python wrappers for the NVIDIA cuDNN libraries☆140Updated 7 years ago
- AMD OpenVX Core -- a sub-module of amdovx-modules:☆149Updated 6 years ago
- Compiler toolkit for neuFlow.☆26Updated 11 years ago
- Easy to run kernels using OpenCL☆185Updated 7 years ago
- Example code used in the CVPR 2015 tutorial☆40Updated 9 years ago
- Raspberry Pi Projects☆84Updated 8 years ago
- ArrayFire's Machine Learning Library.☆103Updated 6 years ago
- Node based Gui for creating caffe networks☆103Updated 4 years ago
- experimental binary net implementation in chainer☆101Updated 9 years ago