jnbntz / gpu-edu-workshopsLinks
Code examples for CUDA and OpenACC
☆34Updated 9 months ago
Alternatives and similar repositories for gpu-edu-workshops
Users that are interested in gpu-edu-workshops are comparing it to the libraries listed below
Sorting:
- Generating Families of Practical Fast Matrix Multiplication Algorithms☆12Updated 7 years ago
- Examples of building probabilistic models with MXNet linear algebra operators☆23Updated 7 years ago
- The Operator Vectorization Library, or OVL, is a python productivity library for defining high performance custom operators for the Tenso…☆68Updated 8 years ago
- Distributed learning with mpi4py☆48Updated 5 years ago
- Python Binding to NVRTC☆79Updated 7 months ago
- GPU implementation of classical molecular dynamics proxy application.☆31Updated 8 years ago
- Extreme Learning Machine - C++ library☆33Updated 10 years ago
- ☆15Updated 7 years ago
- The "CUDA templates" are a collection of C++ template classes and functions which provide a consistent interface to NVIDIA's "Compute Uni…☆27Updated 13 years ago
- Sources for OpenCL and CUDA tutorials. http://jlaning.com☆20Updated 9 years ago
- This repository contains easy-to-read Python/CUDA implementations of fundamental GPU computing primitives.☆36Updated 9 years ago
- Rectified Factor Networks☆38Updated 5 years ago
- Distributed Learning by Pair-Wise Averaging☆52Updated 7 years ago
- Convolution op for Theano based on CuFFT using scikits.cuda☆52Updated 10 years ago
- MXNet Model Serving☆25Updated 7 years ago
- Python wrappers for the NVIDIA cuDNN libraries☆140Updated 8 years ago
- Multi-core CPU implementation of deep learning for 2D and 3D sliding window convolutional networks (ConvNets).☆94Updated 8 years ago
- A GPU / CPU implementation of a feed forward neural network☆31Updated 10 years ago
- ☆19Updated 7 years ago
- stochs: fast stochastic solvers for machine learning in C++ and Cython☆26Updated 2 years ago
- Machine Learning Toolkit for Extreme Scale (MaTEx)☆110Updated 6 years ago
- Scientific library for high-precision computations and research☆49Updated 7 years ago
- OpenCL porting of the GROMACS molecular simulation toolkit☆25Updated 9 years ago
- NGC VMI Example Scripts☆24Updated 6 years ago
- ☆38Updated 7 years ago
- Experiments from the article "Tensorial Mixture Models"☆25Updated 7 years ago
- Deep neural network framework (C/C++/CUDA).☆31Updated 9 years ago
- Research Blog☆24Updated 7 years ago
- ☆73Updated 13 years ago
- Fork of magma to include more BLAS☆28Updated 8 years ago