jnbntz / gpu-edu-workshopsLinks
Code examples for CUDA and OpenACC
☆34Updated last year
Alternatives and similar repositories for gpu-edu-workshops
Users that are interested in gpu-edu-workshops are comparing it to the libraries listed below
Sorting:
- The Operator Vectorization Library, or OVL, is a python productivity library for defining high performance custom operators for the Tenso…☆68Updated 9 years ago
- Distributed learning with mpi4py☆48Updated 6 years ago
- neon tutorials☆93Updated 3 years ago
- Machine Learning Toolkit for Extreme Scale (MaTEx)☆111Updated 7 years ago
- A visualization tool to show a TensorFlow's graph like TensorBoard☆44Updated 4 years ago
- Examples of building probabilistic models with MXNet linear algebra operators☆23Updated 8 years ago
- Sources for OpenCL and CUDA tutorials. http://jlaning.com☆20Updated 10 years ago
- Multi-core CPU implementation of deep learning for 2D and 3D sliding window convolutional networks (ConvNets).☆94Updated 9 years ago
- Python wrappers for the NVIDIA cuDNN libraries☆142Updated 8 years ago
- Generating Families of Practical Fast Matrix Multiplication Algorithms☆12Updated 8 years ago
- Benchmarks for CNTK and other toolkits.☆44Updated 10 years ago
- Fork of magma to include more BLAS☆28Updated 9 years ago
- Tutorials for Horovod☆85Updated 4 years ago
- NGC VMI Example Scripts☆26Updated 7 years ago
- A GPU / CPU implementation of a feed forward neural network☆31Updated 10 years ago
- This repository contains easy-to-read Python/CUDA implementations of fundamental GPU computing primitives.☆36Updated 10 years ago
- ☆34Updated 8 years ago
- Scientific library for high-precision computations and research☆49Updated 8 years ago
- MPI parallel map and cluster scheduling☆59Updated this week
- Intel(R) Machine Learning Scaling Library is a library providing an efficient implementation of communication patterns used in deep learn…☆108Updated 3 years ago
- Pythonic Deep Learning Framework Inspired by Torch's Neural Network package☆28Updated 7 years ago
- Python Binding to NVRTC☆79Updated last year
- Scheduling GPU cluster workloads with Slurm☆78Updated 7 years ago
- MPI Parallel framework for training deep learning models built in Theano☆54Updated 8 years ago
- This fork of Theano/Theano is dedicated to improve its performance on CPU device, in particular Intel® Xeon® processors and Intel® Xeon P…☆58Updated 3 years ago
- AutoDiff DAG constructor, built on numpy and Cython. A Neural Turing Machine and DeepQ agent run on it. Clean code for educational purpos…☆79Updated 5 years ago
- Benchmarking State-of-the-Art Deep Learning Software Tools☆171Updated 8 years ago
- A Cython interface to FLANN☆24Updated 5 years ago
- A conda-smithy repository for tensorflow.☆93Updated 2 months ago
- Scripts for building Singularity images☆10Updated 6 years ago