ppwwyyxx / haDNNLinks

Proof-of-Concept CNN in Halide

☆22

Alternatives and similar repositories for haDNN

Users that are interested in haDNN are comparing it to the libraries listed below

Sorting:

ravi-teja-mullapudi / Halide-NN
CNNs in Halide
☆23Updated 9 years ago
strin / gemm-android
tutorial to optimize GEMM performance on android
☆51Updated 9 years ago
moskewcz / boda
Boda: A C++ Framework for Efficient Experiments in Computer Vision
☆64Updated 5 years ago
naibaf7 / libdnn
Greentea LibDNN - a universal convolution implementation supporting CUDA and OpenCL
☆136Updated 8 years ago
MatthieuCourbariaux / deep-learning-multipliers
Training deep neural networks with low precision multiplications
☆63Updated 10 years ago
hyln9 / GCNGEMM
Optimized half precision gemm assembly kernels (deprecated due to ROCm)
☆47Updated 8 years ago
eBay / maxDNN
High Efficiency Convolution Kernel for Maxwell GPU Architecture
☆134Updated 8 years ago
gplhegde / caffepresso
CaffePresso: An Optimized Library for Deep Learning on Embedded Accelerator-based platforms
☆87Updated 9 months ago
codekansas / tinier-nn
Binarized Neural Network TF training code + C matrix / eval library.
☆101Updated 7 years ago
masahi / nnvm-vision-demo
Demos interesting image-in, image-out networks running on both NVIDIA and AMD GPUs, with NNVM
☆49Updated 7 years ago
ColfaxResearch / FALCON
Library for fast image convolution in neural networks on Intel Architecture
☆31Updated 8 years ago
IntelLabs / SkimCaffe
Caffe for Sparse Convolutional Neural Network
☆238Updated 2 years ago
cc-hpc-itwm / TensorQuant
☆47Updated 5 years ago
bondhugula / polymage-benchmarks
Base code and optimized code for the benchmarks used in the PolyMage paper published at ASPLOS 2015
☆19Updated 9 years ago
jrk / gradient-halide
☆102Updated 5 years ago
masahi / tvm-winograd
Test winograd convolution written in TVM for CUDA and AMDGPU
☆41Updated 6 years ago
ajtulloch / caffe
Caffe: a fast open framework for deep learning.
☆14Updated 9 years ago
dmlc / HalideIR
Symbolic Expression and Statement Module for new DSLs
☆205Updated 4 years ago
rodrigob / cudatemplates
The "CUDA templates" are a collection of C++ template classes and functions which provide a consistent interface to NVIDIA's "Compute Uni…
☆27Updated 13 years ago
linnanwang / BLASX
a heterogeneous multiGPU level-3 BLAS library
☆45Updated 5 years ago
zhaoweicai / hwgq
Caffe implementation of accurate low-precision neural networks
☆117Updated 6 years ago
aaalgo / xnn
a C++ wrapper of Caffe and mxnet to make predictions
☆49Updated 7 years ago
naibaf7 / caffe
Caffe: a fast open framework for deep learning. With OpenCL and CUDA support.
☆86Updated 6 years ago
dmlc / nnvm-fusion
Kernel Fusion and Runtime Compilation Based on NNVM
☆70Updated 8 years ago
zhxfl / CUDA-CNN
CNN accelerated by cuda. Test on mnist and finilly get 99.76%
☆186Updated 7 years ago
Orion34-lanbo / tvm-batch-matmul-example
☆24Updated 7 years ago
MichalBusta / caffe
Ristretto: Caffe-based approximation of convolutional neural networks.
☆30Updated 6 years ago
Maratyszcza / caffe-nnpack
Caffe with NNPACK integration
☆58Updated 9 years ago
zhangxinqian / example-of-nnvm-in-cpp
An Example of MXNet Models Comilation and Deployment with NNVM in C++
☆16Updated 7 years ago
mlzxy / caffe-fpga-opencl
☆35Updated 8 years ago