jacqt / OpenCL-Neural-Network
OpenCL implementation of a NN and CNN
☆22Updated 6 years ago
Alternatives and similar repositories for OpenCL-Neural-Network:
Users that are interested in OpenCL-Neural-Network are comparing it to the libraries listed below
- ONNX Parser is a tool that automatically generates openvx inference code (CNN) from onnx binary model files.☆18Updated 6 years ago
- Implementing CNN code in CUDA and OpenCL to evaluate its performance on NVIDIA GPUs, AMD GPUs, and an FPGA platform.☆54Updated 8 years ago
- Caffe implementation of accurate low-precision neural networks☆117Updated 6 years ago
- Optimizing Mobile Deep Learning on ARM GPU with TVM☆181Updated 6 years ago
- Caffe Computation Graph Optimization.☆29Updated 5 years ago
- CNNs in Halide☆23Updated 9 years ago
- ☆62Updated 7 years ago
- Greentea LibDNN - a universal convolution implementation supporting CUDA and OpenCL☆135Updated 8 years ago
- Implementation of convolution layer in different flavors☆68Updated 7 years ago
- I'm going to use the Winograd’s minimal filtering algorithms to introduce a new class of fast algorithms for convolutional neural networks…☆12Updated 7 years ago
- ☆36Updated 7 years ago
- implementation of winograd minimal convolution algorithm on Intel Architecture☆39Updated 7 years ago
- tutorial to optimize GEMM performance on android☆51Updated 9 years ago
- Simple pruning example using Caffe☆33Updated 7 years ago
- Code for testing the native float16 matrix multiplication performance on Tesla P100 and V100 GPU based on cublasHgemm☆34Updated 5 years ago
- ☆26Updated 8 years ago
- PyTorch -> ONNX -> TVM for autotuning☆24Updated 5 years ago
- PyTorch implementation of Near-Lossless Post-Training Quantization of Deep Neural Networks via a Piecewise Linear Approximation☆21Updated 5 years ago
- Test winograd convolution written in TVM for CUDA and AMDGPU☆41Updated 6 years ago
- This is an implementation of a forward and reverse computational photography pipeline☆32Updated 6 years ago
- fast and energy-efficient computation of HOG features☆17Updated 6 years ago
- Efficient forward propagation for BCNNs☆50Updated 7 years ago
- Caffe: a fast open framework for deep learning.☆14Updated 8 years ago
- Example code used in the CVPR 2015 tutorial☆40Updated 9 years ago
- Library for fast image convolution in neural networks on Intel Architecture☆29Updated 7 years ago
- Fork of https://source.codeaurora.org/quic/hexagon_nn/nnlib☆57Updated 2 years ago
- flexible-gemm conv of deepcore☆17Updated 5 years ago
- ☆19Updated last year
- Optimized half precision gemm assembly kernels (deprecated due to ROCm)☆47Updated 7 years ago
- Accelerating CNN's convolution operation on GPUs by using memory-efficient data access patterns.☆14Updated 7 years ago