5antelope / H-PipeLinks

☆9

Alternatives and similar repositories for H-Pipe

Users that are interested in H-Pipe are comparing it to the libraries listed below

Sorting:

ravi-teja-mullapudi / Halide-NN
CNNs in Halide
☆23Updated 9 years ago
hyln9 / GCNGEMM
Optimized half precision gemm assembly kernels (deprecated due to ROCm)
☆47Updated 8 years ago
naibaf7 / libdnn
Greentea LibDNN - a universal convolution implementation supporting CUDA and OpenCL
☆136Updated 8 years ago
gplhegde / caffepresso
CaffePresso: An Optimized Library for Deep Learning on Embedded Accelerator-based platforms
☆87Updated 9 months ago
XiuYuLi / flexible-gemm
flexible-gemm conv of deepcore
☆17Updated 5 years ago
ColfaxResearch / FALCON
Library for fast image convolution in neural networks on Intel Architecture
☆30Updated 8 years ago
XiaoMi / nnlib
Fork of https://source.codeaurora.org/quic/hexagon_nn/nnlib
☆58Updated 2 years ago
bondhugula / polymage-benchmarks
Base code and optimized code for the benchmarks used in the PolyMage paper published at ASPLOS 2015
☆19Updated 9 years ago
ppwwyyxx / haDNN
Proof-of-Concept CNN in Halide
☆22Updated 8 years ago
strin / gemm-android
tutorial to optimize GEMM performance on android
☆51Updated 9 years ago
merrymercy / tvm-mali
Optimizing Mobile Deep Learning on ARM GPU with TVM
☆181Updated 6 years ago
jacqt / OpenCL-Neural-Network
OpenCL implementation of a NN and CNN
☆22Updated 7 years ago
tobegit3hub / tftvm
TensorFlow and TVM integration
☆37Updated 5 years ago
lcskrishna / onnx-parser
ONNX Parser is a tool that automatically generates openvx inference code (CNN) from onnx binary model files.
☆18Updated 6 years ago
mz24cn / clnet
OpenCL for Nets - A Deep Learning Framework based on OpenCL, written by C++. Supports popular MLP, RNN(LSTM), CNN(ResNet). Friendly debug…
☆68Updated 6 years ago
xzhai1 / latte
CMU 15-418/618 Final Project: Implementing Fully Convolutional Network using Halide and evaluate against Caffe version
☆7Updated 9 years ago
CSshengxy / MEC
ICML2017 MEC: Memory-efficient Convolution for Deep Neural Network C++实现(非官方)
☆17Updated 6 years ago
zhiqi-0 / RDMA-MXNet-ps-lite
RDMA Optimization on MXNet
☆14Updated 7 years ago
victoroliv2 / halide-casestudies
Case Studies for Halide performance against C++ and OpenCL
☆37Updated 11 years ago
hipacc / hipacc
A domain-specific language and compiler for image processing
☆76Updated 4 years ago
zhaoweicai / hwgq
Caffe implementation of accurate low-precision neural networks
☆117Updated 6 years ago
dmlc / HalideIR
Symbolic Expression and Statement Module for new DSLs
☆205Updated 4 years ago
spcl / ucudnn
Accelerating DNN Convolutional Layers with Micro-batches
☆63Updated 5 years ago
lankas / SqueezeNet
☆39Updated 8 years ago
intel / light-model-transformer
☆72Updated 8 months ago
moskewcz / boda
Boda: A C++ Framework for Efficient Experiments in Computer Vision
☆64Updated 5 years ago
codeplaysoftware / visioncpp
A machine vision library written in SYCL and C++ that shows performance-portable implementation of graph algorithms
☆162Updated last year
codeplaysoftware / portDNN
portDNN is a library implementing neural network algorithms written using SYCL
☆113Updated last year
jeremyfix / FFTConvolution
Some C++ codes for computing a 1D and 2D convolution product using the FFT implemented with the GSL or FFTW
☆58Updated 12 years ago
masahi / tvm-winograd
Test winograd convolution written in TVM for CUDA and AMDGPU
☆41Updated 6 years ago