rodrigob / cudatemplates
The "CUDA templates" are a collection of C++ template classes and functions which provide a consistent interface to NVIDIA's "Compute Unified Device Architecture" (CUDA), hiding much of the complexity of the underlying CUDA functions from the programmer (see the brief overview of the main features). Original author: Markus Grabner
☆27Updated 13 years ago
Related projects ⓘ
Alternatives and complementary repositories for cudatemplates
- Sublinear memory optimization for deep learning, reduce GPU memory cost to train deeper nets☆29Updated 8 years ago
- Tools to convert Caffe models to neon's serialization format☆39Updated last year
- This is a demo project that shows how you can utilize Caffe2's modular design and build a library on top of it.☆40Updated 5 years ago
- detection-developing☆20Updated 10 years ago
- Fast binary matrix product on CPU☆10Updated 8 years ago
- Deep neural network framework for multiple GPUs☆30Updated 9 years ago
- Demos interesting image-in, image-out networks running on both NVIDIA and AMD GPUs, with NNVM☆49Updated 7 years ago
- Batch Normalization Layer for Caffe☆35Updated 8 years ago
- FastHOG library that has been fixed to work with CUDA 5.x on Ubuntu 12.04☆19Updated 10 years ago
- Custom fork containing our own python backend for integration into neon☆15Updated last year
- miniplaces2 deep residual network in neon☆16Updated 8 years ago
- a C++ wrapper of Caffe and mxnet to make predictions☆50Updated 6 years ago
- SqueezeNet Generator☆32Updated 6 years ago
- MXNet Model Serving☆25Updated 7 years ago
- Torch7 bindings for cuda-convnet2 kernels!☆40Updated 8 years ago
- Object Segmentation (NIPS 2014)☆25Updated 9 years ago
- Multi-core CPU implementation of deep learning for 2D and 3D sliding window convolutional networks (ConvNets).☆94Updated 7 years ago
- Caffe with NNPACK integration☆59Updated 8 years ago
- DelugeNets: Deep Networks with Efficient and Flexible Cross-layer Information Inflows☆26Updated 7 years ago
- Object detection with segmentation and context in deep networks☆27Updated 9 years ago
- Direct C++ Interface to PyTorch☆80Updated 6 years ago
- Boda: A C++ Framework for Efficient Experiments in Computer Vision☆63Updated 5 years ago
- Full convolution MultiBox Detector ( like SSD) implemented in Torch.☆40Updated 8 years ago
- lua-torch code to load KITTI dataset☆18Updated 9 years ago
- Asynchronous One Step Q Learning implemented with MXNET☆20Updated 7 years ago