rodrigob / cudatemplates
The "CUDA templates" are a collection of C++ template classes and functions which provide a consistent interface to NVIDIA's "Compute Unified Device Architecture" (CUDA), hiding much of the complexity of the underlying CUDA functions from the programmer (see the brief overview of the main features). Original author: Markus Grabner
☆27Updated 13 years ago
Alternatives and similar repositories for cudatemplates:
Users that are interested in cudatemplates are comparing it to the libraries listed below
- Sublinear memory optimization for deep learning, reduce GPU memory cost to train deeper nets☆28Updated 9 years ago
- Fast binary matrix product on CPU☆10Updated 9 years ago
- Demos interesting image-in, image-out networks running on both NVIDIA and AMD GPUs, with NNVM☆49Updated 7 years ago
- Caffe with NNPACK integration☆58Updated 9 years ago
- Tools to convert Caffe models to neon's serialization format☆39Updated 2 years ago
- detection-developing☆20Updated 10 years ago
- SqueezeNet Generator☆31Updated 7 years ago
- Boda: A C++ Framework for Efficient Experiments in Computer Vision☆64Updated 5 years ago
- a C++ wrapper of Caffe and mxnet to make predictions☆49Updated 7 years ago
- Caffe: a fast open framework for deep learning.☆11Updated 8 years ago
- Torch7 bindings for cuda-convnet2 kernels!☆40Updated 8 years ago
- MXNet Model Serving☆25Updated 7 years ago
- ☆13Updated 8 years ago
- FastHOG library that has been fixed to work with CUDA 5.x on Ubuntu 12.04☆20Updated 11 years ago
- Collective Knowledge repository for NVIDIA's TensorRT☆37Updated 3 years ago
- Deep neural network framework (C/C++/CUDA).☆31Updated 9 years ago
- Custom fork containing our own python backend for integration into neon☆15Updated 2 years ago
- DelugeNets: Deep Networks with Efficient and Flexible Cross-layer Information Inflows☆26Updated 8 years ago
- ☆15Updated 7 years ago
- ☆16Updated 7 years ago
- miniplaces2 deep residual network in neon☆16Updated 9 years ago
- This is a demo project that shows how you can utilize Caffe2's modular design and build a library on top of it.☆40Updated 6 years ago
- Windows port for caffe using cmake. Deprecated. Use Microsoft's port instead.☆15Updated 10 years ago
- Deep neural network framework for multiple GPUs☆33Updated 9 years ago
- caffe with cudnn☆54Updated 9 years ago
- OpenCL Inference Engine for pytorch☆51Updated 7 years ago
- Multi-core CPU implementation of deep learning for 2D and 3D sliding window convolutional networks (ConvNets).☆94Updated 8 years ago
- Keras implementation of the article "Solving internal covariate shift in deep learning with linked neurons"☆13Updated 7 years ago
- Mxnet Implementation of Google's MobileNets v2☆11Updated 7 years ago
- Pre-trained models for MixDCNN paper☆36Updated 6 years ago