rodrigob / cudatemplatesLinks

The "CUDA templates" are a collection of C++ template classes and functions which provide a consistent interface to NVIDIA's "Compute Unified Device Architecture" (CUDA), hiding much of the complexity of the underlying CUDA functions from the programmer (see the brief overview of the main features). Original author: Markus Grabner

☆27

Alternatives and similar repositories for cudatemplates

Users that are interested in cudatemplates are comparing it to the libraries listed below

Sorting:

NervanaSystems / caffe2neon
Tools to convert Caffe models to neon's serialization format
☆39Updated 2 years ago
Bihaqo / tf-memonger
Sublinear memory optimization for deep learning, reduce GPU memory cost to train deeper nets
☆28Updated 9 years ago
aaalgo / xnn
a C++ wrapper of Caffe and mxnet to make predictions
☆49Updated 7 years ago
masahi / nnvm-vision-demo
Demos interesting image-in, image-out networks running on both NVIDIA and AMD GPUs, with NNVM
☆49Updated 7 years ago
seung-lab / znn-release
Multi-core CPU implementation of deep learning for 2D and 3D sliding window convolutional networks (ConvNets).
☆94Updated 8 years ago
naibaf7 / caffe
Caffe: a fast open framework for deep learning. With OpenCL and CUDA support.
☆86Updated 6 years ago
soumith / cuda-convnet2.torch
Torch7 bindings for cuda-convnet2 kernels!
☆40Updated 8 years ago
yuruofeifei / mms
MXNet Model Serving
☆25Updated 7 years ago
hannes-brt / cudnn-python-wrappers
Python wrappers for the NVIDIA cuDNN libraries
☆140Updated 8 years ago
moskewcz / boda
Boda: A C++ Framework for Efficient Experiments in Computer Vision
☆64Updated 5 years ago
rbgirshick / DeepPyramid
Deep feature pyramids for various computer vision algorithms (DPMs, pyramid R-CNN, etc.)
☆129Updated 8 years ago
NervanaSystems / cuda-convnet2
Custom fork containing our own python backend for integration into neon
☆15Updated 2 years ago
weichengkuo / DeepBox
Code release for DeepBox paper in ICCV 2015
☆127Updated 8 years ago
naibaf7 / libdnn
Greentea LibDNN - a universal convolution implementation supporting CUDA and OpenCL
☆136Updated 8 years ago
hunterlang / mpmz
miniplaces2 deep residual network in neon
☆16Updated 9 years ago
Maratyszcza / caffe-nnpack
Caffe with NNPACK integration
☆58Updated 9 years ago
una-dinosauria / local-search-quantization
State-of-the-art method for large-scale ANN search as of Oct 2016. Presented at ECCV 16.
☆75Updated 7 years ago
zmonoid / mxasyn
Asynchronous One Step Q Learning implemented with MXNET
☆20Updated 8 years ago
ikkiChung / Prediction-Example-With-Caffe
C++ Prediction Example With Caffe
☆42Updated 10 years ago
facebookarchive / caffe2_bhtsne
This is a demo project that shows how you can utilize Caffe2's modular design and build a library on top of it.
☆40Updated 6 years ago
jaberg / DeepLearningBenchmarks
☆73Updated 13 years ago
panweihit / DropNeuron
DropNeuron: Simplifying the Structure of Deep Neural Networks
☆59Updated 9 years ago
MatthieuCourbariaux / binary-matrix-product
Fast binary matrix product on CPU
☆10Updated 9 years ago
Teaonly / FMD.torch
Full convolution MultiBox Detector ( like SSD) implemented in Torch.
☆40Updated 8 years ago
uoguelph-mlrg / Theano-MPI
MPI Parallel framework for training deep learning models built in Theano
☆54Updated 7 years ago
edgarriba / DeepRosetta
An universal deep learning models conversor
☆141Updated 8 years ago
HUJI-Deep / caffe-simnets
The SimNets Architecture's Implementation in Caffe
☆13Updated 8 years ago
terrychenism / caffe-windows-cudnn
caffe with cudnn
☆54Updated 9 years ago
jhjin / flattened-cnn
Flattened convolutional neural networks (1D convolution modules for Torch nn)
☆61Updated 9 years ago
HazyResearch / CaffeConTroll
☆76Updated 9 years ago