jetpacapp / pi-gemm
A Raspberry Pi GPU-accelerated implementation of the GEMM matrix-multiply function
☆88Updated 10 years ago
Alternatives and similar repositories for pi-gemm:
Users that are interested in pi-gemm are comparing it to the libraries listed below
- An assembler/disassembler for the QPU processors on the Raspberry Pi☆121Updated 9 years ago
- Greentea LibDNN - a universal convolution implementation supporting CUDA and OpenCL☆135Updated 7 years ago
- Math Kernel Library for VideoCore IV QPU☆68Updated 6 years ago
- Boda: A C++ Framework for Efficient Experiments in Computer Vision☆63Updated 5 years ago
- Python wrappers for the NVIDIA cuDNN libraries☆140Updated 7 years ago
- Library for fast image convolution in neural networks on Intel Architecture☆29Updated 7 years ago
- ViNN - an OpenCL accelerated neural networks library☆33Updated 9 years ago
- Binarized Neural Network TF training code + C matrix / eval library.☆99Updated 7 years ago
- CaffePresso: An Optimized Library for Deep Learning on Embedded Accelerator-based platforms☆88Updated 4 months ago
- Docker images that support different OpenCl Runtime☆34Updated 8 years ago
- Original Python version of Intel® Nervana™ Graph☆215Updated 2 years ago
- AMD OpenVX Core -- a sub-module of amdovx-modules:☆149Updated 6 years ago
- Compiler for the VC4CL OpenCL implementation☆118Updated last year
- DeepDetect performance sheet☆93Updated 5 years ago
- (Deprecated) hipCaffe: the HIP port of Caffe☆124Updated 9 months ago
- tutorial to optimize GEMM performance on android☆51Updated 8 years ago
- High Efficiency Convolution Kernel for Maxwell GPU Architecture☆134Updated 7 years ago
- Easy to run kernels using OpenCL☆183Updated 7 years ago
- Python Binding to NVRTC☆79Updated 4 months ago
- RenderScript based implementation of Convolutional Neural Networks for Android phones☆52Updated 7 years ago
- Caffe: a fast open framework for deep learning.☆43Updated 8 years ago
- Scripts to install TensorFlow on the NVIDIA Jetson TX1 Development Kit☆62Updated 7 years ago
- Raspberry Pi Projects☆83Updated 8 years ago
- Accelerating DNN Convolutional Layers with Micro-batches☆64Updated 4 years ago
- kmeans☆54Updated 8 years ago
- Collective Knowledge repository for NVIDIA's TensorRT☆37Updated 3 years ago
- Intel® Optimization for Chainer*☆82Updated 2 years ago
- experimental binary net implementation in chainer☆101Updated 9 years ago
- Tools to convert Caffe models to neon's serialization format☆39Updated 2 years ago
- Open single and half precision gemm implementations☆375Updated last year