rodrigob / cudatemplates
The "CUDA templates" are a collection of C++ template classes and functions which provide a consistent interface to NVIDIA's "Compute Unified Device Architecture" (CUDA), hiding much of the complexity of the underlying CUDA functions from the programmer (see the brief overview of the main features). Original author: Markus Grabner
☆27Updated 13 years ago
Alternatives and similar repositories for cudatemplates:
Users that are interested in cudatemplates are comparing it to the libraries listed below
- Fast binary matrix product on CPU☆10Updated 9 years ago
- Demos interesting image-in, image-out networks running on both NVIDIA and AMD GPUs, with NNVM☆49Updated 7 years ago
- a C++ wrapper of Caffe and mxnet to make predictions☆49Updated 7 years ago
- FastHOG library that has been fixed to work with CUDA 5.x on Ubuntu 12.04☆20Updated 11 years ago
- detection-developing☆20Updated 10 years ago
- This is a demo project that shows how you can utilize Caffe2's modular design and build a library on top of it.☆40Updated 6 years ago
- Sublinear memory optimization for deep learning, reduce GPU memory cost to train deeper nets☆28Updated 8 years ago
- SqueezeNet Generator☆31Updated 6 years ago
- Caffe with NNPACK integration☆58Updated 9 years ago
- Implementation of HoG feature extractor that uses SSE instructions.☆45Updated 11 years ago
- Darwin: A Framework for Machine Learning Research and Development☆55Updated 3 years ago
- caffe with cudnn☆54Updated 9 years ago
- Torch7 bindings for cuda-convnet2 kernels!☆40Updated 8 years ago
- Tools to convert Caffe models to neon's serialization format☆39Updated 2 years ago
- OpenCV matrices to HDF5 datasets and vice versa☆14Updated 12 years ago
- MXNet Model Serving☆25Updated 7 years ago
- Deep neural network framework for multiple GPUs☆33Updated 9 years ago
- Implementation of Residual Learning with Stochastic Depth http://arxiv.org/pdf/1603.09382v2.pdf☆10Updated 8 years ago
- Boda: A C++ Framework for Efficient Experiments in Computer Vision☆64Updated 5 years ago
- Caffe: a fast open framework for deep learning.☆11Updated 8 years ago
- VGG16 architecture with BatchNorm☆14Updated 7 years ago
- DelugeNets: Deep Networks with Efficient and Flexible Cross-layer Information Inflows☆26Updated 8 years ago
- Simple example of implementing a new Tensorflow operation and its gradient in C++.☆56Updated 6 years ago
- Object Segmentation (NIPS 2014)☆25Updated 9 years ago
- Mxnet Implementation of Google's MobileNets v2☆11Updated 7 years ago
- kmeans implementation☆11Updated 9 years ago
- Windows port for caffe using cmake. Deprecated. Use Microsoft's port instead.☆15Updated 10 years ago
- ☆13Updated 7 years ago
- Symbolic differentiation engine for optimization-based machine learning models.☆42Updated 7 years ago
- Standalone C TH library☆58Updated 7 years ago