rodrigob / cudatemplates
The "CUDA templates" are a collection of C++ template classes and functions which provide a consistent interface to NVIDIA's "Compute Unified Device Architecture" (CUDA), hiding much of the complexity of the underlying CUDA functions from the programmer (see the brief overview of the main features). Original author: Markus Grabner
☆27Updated 13 years ago
Alternatives and similar repositories for cudatemplates:
Users that are interested in cudatemplates are comparing it to the libraries listed below
- Demos interesting image-in, image-out networks running on both NVIDIA and AMD GPUs, with NNVM☆49Updated 7 years ago
- This is a demo project that shows how you can utilize Caffe2's modular design and build a library on top of it.☆40Updated 5 years ago
- Deep neural network framework for multiple GPUs☆33Updated 9 years ago
- Tools to convert Caffe models to neon's serialization format☆39Updated 2 years ago
- Sublinear memory optimization for deep learning, reduce GPU memory cost to train deeper nets☆28Updated 8 years ago
- FastHOG library that has been fixed to work with CUDA 5.x on Ubuntu 12.04☆20Updated 11 years ago
- Caffe with NNPACK integration☆58Updated 8 years ago
- a C++ wrapper of Caffe and mxnet to make predictions☆49Updated 7 years ago
- detection-developing☆20Updated 10 years ago
- VGG16 architecture with BatchNorm☆14Updated 7 years ago
- some new implementation of caffe☆24Updated 8 years ago
- Custom fork containing our own python backend for integration into neon☆15Updated 2 years ago
- Boda: A C++ Framework for Efficient Experiments in Computer Vision☆64Updated 5 years ago
- Caffe: a fast open framework for deep learning.☆14Updated 8 years ago
- SqueezeNet Generator☆31Updated 6 years ago
- Object Segmentation (NIPS 2014)☆25Updated 9 years ago
- DelugeNets: Deep Networks with Efficient and Flexible Cross-layer Information Inflows☆26Updated 8 years ago
- Torch7 bindings for cuda-convnet2 kernels!☆40Updated 8 years ago
- ☆23Updated 8 years ago
- Direct C++ Interface to PyTorch☆80Updated 6 years ago
- Fast binary matrix product on CPU☆10Updated 9 years ago
- MXNet Model Serving☆25Updated 7 years ago
- Implementation of the CVPR 2015 paper: Learning to propose objects☆90Updated 9 years ago
- C++ Prediction Example With Caffe☆42Updated 10 years ago
- Deep neural network framework (C/C++/CUDA).☆31Updated 9 years ago
- caffe with cudnn☆54Updated 9 years ago
- kmeans implementation☆11Updated 9 years ago
- ☆28Updated 8 years ago
- Code release for DeepBox paper in ICCV 2015☆127Updated 8 years ago
- ☆42Updated 5 years ago