xingdi-eric-yuan / cuda-deep-neural-nets
Deep neural network framework (C/C++/CUDA).
☆31Updated 9 years ago
Alternatives and similar repositories for cuda-deep-neural-nets:
Users that are interested in cuda-deep-neural-nets are comparing it to the libraries listed below
- Direct C++ Interface to PyTorch☆80Updated 6 years ago
- Simple MXNet sequence-to-sequence model (neural machine translation)☆24Updated 7 years ago
- Sublinear memory optimization for deep learning, reduce GPU memory cost to train deeper nets☆28Updated 8 years ago
- detection-developing☆20Updated 10 years ago
- Training deep neural networks with low precision multiplications☆63Updated 9 years ago
- ☆16Updated 6 years ago
- A fast deep neural network library (CPU) for speech recognition☆84Updated 5 years ago
- SqueezeNet Generator☆31Updated 6 years ago
- GPU/CPU (CUDA) Implementation of "Recurrent Memory Array Structures", Simple RNN, LSTM, Array LSTM..☆25Updated 4 years ago
- MXNet implementation of AC-BLSTM☆23Updated 5 years ago
- OpenCL Inference Engine for pytorch☆51Updated 7 years ago
- Demos interesting image-in, image-out networks running on both NVIDIA and AMD GPUs, with NNVM☆49Updated 7 years ago
- a C++ wrapper of Caffe and mxnet to make predictions☆50Updated 6 years ago
- DelugeNets: Deep Networks with Efficient and Flexible Cross-layer Information Inflows☆26Updated 7 years ago
- Fast binary matrix product on CPU☆10Updated 9 years ago
- Caffe version of code for our paper "Joint unsupervised learning of deep representations and image clusters"☆16Updated 7 years ago
- PyTorch Framework Integration for Tensor Comprehensions☆14Updated 6 years ago
- Flattened convolutional neural networks (1D convolution modules for Torch nn)☆61Updated 9 years ago
- ☆57Updated 6 years ago
- ☆16Updated 7 years ago
- Long Short-Term Memory Recurrent Neural Networks☆27Updated 9 years ago
- Implementation of Residual Learning with Stochastic Depth http://arxiv.org/pdf/1603.09382v2.pdf☆10Updated 8 years ago
- Matrix Shadow:Lightweight CPU/GPU Matrix and Tensor Template Library in C++/CUDA for (Deep) Machine Learning☆33Updated 8 years ago
- Training neural networks with 8-bit computations☆28Updated 8 years ago
- Wide Residual Networks implemented in TensorLayer and TensorFlow.☆44Updated 8 years ago
- Faster Deep Neural Networks☆36Updated 7 years ago
- ☆14Updated 7 years ago
- Simple fully-connected highway networks using TensorFlow.☆25Updated 7 years ago
- DNN Inference with CPU, C++, ONNX support: Instant☆56Updated 6 years ago
- PyTorch development for onnx☆21Updated 7 years ago