jnbntz / gpu-edu-workshopsLinks
Code examples for CUDA and OpenACC
☆34Updated last year
Alternatives and similar repositories for gpu-edu-workshops
Users that are interested in gpu-edu-workshops are comparing it to the libraries listed below
Sorting:
- The Operator Vectorization Library, or OVL, is a python productivity library for defining high performance custom operators for the Tenso…☆68Updated 8 years ago
- Machine Learning Toolkit for Extreme Scale (MaTEx)☆112Updated 7 years ago
- Multi-core CPU implementation of deep learning for 2D and 3D sliding window convolutional networks (ConvNets).☆94Updated 8 years ago
- Fork of magma to include more BLAS☆28Updated 8 years ago
- Generating Families of Practical Fast Matrix Multiplication Algorithms☆12Updated 8 years ago
- Python Binding to NVRTC☆79Updated last year
- Distributed learning with mpi4py☆48Updated 6 years ago
- Python wrappers for the NVIDIA cuDNN libraries☆142Updated 8 years ago
- A visualization tool to show a TensorFlow's graph like TensorBoard☆44Updated 4 years ago
- Scientific library for high-precision computations and research☆49Updated 8 years ago
- review of Deep Learning for Nature☆31Updated 10 years ago
- MPI Parallel framework for training deep learning models built in Theano☆54Updated 8 years ago
- ☆30Updated 5 years ago
- A conda-smithy repository for tensorflow.☆93Updated last week
- A Cython interface to FLANN☆24Updated 4 years ago
- neon tutorials☆93Updated 2 years ago
- Examples of building probabilistic models with MXNet linear algebra operators☆23Updated 8 years ago
- Simple utility to show nVidia GPU memory usage wrt. CUDA device IDs.☆40Updated 8 years ago
- Intel(R) Machine Learning Scaling Library is a library providing an efficient implementation of communication patterns used in deep learn…☆108Updated 2 years ago
- [Outdated. Please use https://github.com/numba/numba-examples] Examples of NumbaPro in use.☆171Updated 3 years ago
- Deep neural network framework (C/C++/CUDA).☆32Updated 10 years ago
- Rectified Factor Networks☆37Updated 6 years ago
- ☆15Updated 7 years ago
- Benchmarks for CNTK and other toolkits.☆44Updated 9 years ago
- Scheduling GPU cluster workloads with Slurm☆76Updated 7 years ago
- stochs: fast stochastic solvers for machine learning in C++ and Cython☆26Updated 3 years ago
- Distributed NMF/NTF Library☆48Updated 10 months ago
- DropNeuron: Simplifying the Structure of Deep Neural Networks☆59Updated 9 years ago
- Library to manipulate tensors on the GPU.☆188Updated 2 years ago
- Backpropagate derivatives through the Cholesky decomposition☆58Updated 5 years ago