jetpacapp / pi-gemmLinks
A Raspberry Pi GPU-accelerated implementation of the GEMM matrix-multiply function
☆88Updated 11 years ago
Alternatives and similar repositories for pi-gemm
Users that are interested in pi-gemm are comparing it to the libraries listed below
Sorting:
- Greentea LibDNN - a universal convolution implementation supporting CUDA and OpenCL☆137Updated 8 years ago
- An assembler/disassembler for the QPU processors on the Raspberry Pi☆121Updated 10 years ago
- Math Kernel Library for VideoCore IV QPU☆69Updated 7 years ago
- Docker images that support different OpenCl Runtime☆33Updated 9 years ago
- Boda: A C++ Framework for Efficient Experiments in Computer Vision☆64Updated 6 years ago
- High Efficiency Convolution Kernel for Maxwell GPU Architecture☆135Updated 8 years ago
- Scripts to install TensorFlow on the NVIDIA Jetson TX1 Development Kit☆62Updated 8 years ago
- Original Python version of Intel® Nervana™ Graph☆215Updated 2 years ago
- AMD OpenVX Core -- a sub-module of amdovx-modules:☆147Updated 6 years ago
- ViNN - an OpenCL accelerated neural networks library☆33Updated 9 years ago
- Caffe: a fast open framework for deep learning.☆43Updated 9 years ago
- Compute Library for Deep Neural Networks (clDNN)☆576Updated 2 years ago
- kmeans☆55Updated 9 years ago
- tutorial to optimize GEMM performance on android☆51Updated 9 years ago
- Convolutional neural networks C++ framework with CPU and GPU (CUDA) backends☆182Updated 6 years ago
- CaffePresso: An Optimized Library for Deep Learning on Embedded Accelerator-based platforms☆87Updated 11 months ago
- Caffe2 implementation of Open Neural Network Exchange (ONNX)☆166Updated 7 years ago
- An OpenCL backend for torch.☆300Updated 8 years ago
- (Deprecated) hipCaffe: the HIP port of Caffe☆124Updated last year
- Easy to run kernels using OpenCL☆186Updated 5 months ago
- The NNEF Tools repository contains tools to generate and consume NNEF documents☆226Updated this week
- Python wrappers for the NVIDIA cuDNN libraries☆141Updated 8 years ago
- OpenCL Torch☆146Updated 6 years ago
- Binarized Neural Network TF training code + C matrix / eval library.☆101Updated 7 years ago
- Convolution op for Theano based on CuFFT using scikits.cuda☆52Updated 11 years ago
- Caffe: a fast open framework for deep learning. With OpenCL and CUDA support.☆86Updated 7 years ago
- A fast deep neural network library (CPU) for speech recognition☆84Updated 6 years ago
- Facebook's CUDA extensions.☆285Updated 6 years ago
- ONNX model format support for Apache MXNet☆96Updated 6 years ago
- A CUDNN minimal deep learning training code sample using LeNet.☆268Updated 2 years ago