rodrigob / cudatemplates
The "CUDA templates" are a collection of C++ template classes and functions which provide a consistent interface to NVIDIA's "Compute Unified Device Architecture" (CUDA), hiding much of the complexity of the underlying CUDA functions from the programmer (see the brief overview of the main features). Original author: Markus Grabner
☆27Updated 13 years ago
Alternatives and similar repositories for cudatemplates:
Users that are interested in cudatemplates are comparing it to the libraries listed below
- FastHOG library that has been fixed to work with CUDA 5.x on Ubuntu 12.04☆20Updated 11 years ago
- a C++ wrapper of Caffe and mxnet to make predictions☆49Updated 7 years ago
- Sublinear memory optimization for deep learning, reduce GPU memory cost to train deeper nets☆28Updated 8 years ago
- Demos interesting image-in, image-out networks running on both NVIDIA and AMD GPUs, with NNVM☆49Updated 7 years ago
- Tools to convert Caffe models to neon's serialization format☆39Updated 2 years ago
- Fast binary matrix product on CPU☆10Updated 9 years ago
- Caffe with NNPACK integration☆58Updated 9 years ago
- This is a demo project that shows how you can utilize Caffe2's modular design and build a library on top of it.☆40Updated 6 years ago
- Caffe: a fast open framework for deep learning.☆11Updated 8 years ago
- Deep neural network framework (C/C++/CUDA).☆31Updated 9 years ago
- Implementation of HoG feature extractor that uses SSE instructions.☆45Updated 11 years ago
- MXNet Model Serving☆25Updated 7 years ago
- SqueezeNet Generator☆31Updated 6 years ago
- DelugeNets: Deep Networks with Efficient and Flexible Cross-layer Information Inflows☆26Updated 8 years ago
- Custom fork containing our own python backend for integration into neon☆15Updated 2 years ago
- Deep neural network framework for multiple GPUs☆33Updated 9 years ago
- Caffe: a fast open framework for deep learning.☆14Updated 8 years ago
- miniplaces2 deep residual network in neon☆16Updated 9 years ago
- Darwin: A Framework for Machine Learning Research and Development☆55Updated 3 years ago
- a fork of the densecrf package implementing alternative inference scheme☆28Updated 9 years ago
- Matrix Shadow:Lightweight CPU/GPU Matrix and Tensor Template Library in C++/CUDA for (Deep) Machine Learning☆33Updated 8 years ago
- caffe with cudnn☆54Updated 9 years ago
- Implementation of Residual Learning with Stochastic Depth http://arxiv.org/pdf/1603.09382v2.pdf☆10Updated 8 years ago
- Caffe model zoo and scripts☆24Updated 8 years ago
- Faster R-CNN, an MXNet implementation with distributed implementation and data parallelization.☆36Updated 8 years ago
- ☆28Updated 8 years ago
- Implementation of the CVPR 2015 paper: Learning to propose objects☆90Updated 9 years ago
- Collective Knowledge repository for NVIDIA's TensorRT☆37Updated 3 years ago
- Boda: A C++ Framework for Efficient Experiments in Computer Vision☆64Updated 5 years ago
- ☆13Updated 7 years ago