jnbntz / gpu-edu-workshopsLinks
Code examples for CUDA and OpenACC
☆34Updated 10 months ago
Alternatives and similar repositories for gpu-edu-workshops
Users that are interested in gpu-edu-workshops are comparing it to the libraries listed below
Sorting:
- Fork of magma to include more BLAS☆28Updated 8 years ago
- Machine Learning Toolkit for Extreme Scale (MaTEx)☆110Updated 6 years ago
- Generating Families of Practical Fast Matrix Multiplication Algorithms☆12Updated 7 years ago
- Python Binding to NVRTC☆79Updated 8 months ago
- The Operator Vectorization Library, or OVL, is a python productivity library for defining high performance custom operators for the Tenso…☆68Updated 8 years ago
- Distributed learning with mpi4py☆48Updated 6 years ago
- Simple utility to show nVidia GPU memory usage wrt. CUDA device IDs.☆40Updated 8 years ago
- Python wrappers for the NVIDIA cuDNN libraries☆140Updated 8 years ago
- C++ library for numerical arrays and tensor objects and operations with them, designed to allow Matlab-style programming.☆52Updated 2 years ago
- MPI Parallel framework for training deep learning models built in Theano☆54Updated 7 years ago
- Multi-core CPU implementation of deep learning for 2D and 3D sliding window convolutional networks (ConvNets).☆94Updated 8 years ago
- A visualization tool to show a TensorFlow's graph like TensorBoard☆44Updated 4 years ago
- Catamount is a compute graph analysis tool to load, construct, and modify deep learning models and to symbolically analyze their compute …☆14Updated 4 years ago
- Examples of building probabilistic models with MXNet linear algebra operators☆23Updated 7 years ago
- ☆19Updated 6 years ago
- A Cython interface to FLANN☆24Updated 4 years ago
- Scheduling GPU cluster workloads with Slurm☆74Updated 6 years ago
- Research Blog☆24Updated 7 years ago
- Distributed Learning by Pair-Wise Averaging☆52Updated 7 years ago
- The "CUDA templates" are a collection of C++ template classes and functions which provide a consistent interface to NVIDIA's "Compute Uni…☆27Updated 13 years ago
- This repository contains the results and code for the MLPerf™ Training v0.5 benchmark.☆35Updated last month
- ☆14Updated 6 years ago
- Distributed NMF/NTF Library☆46Updated 6 months ago
- Convolution op for Theano based on CuFFT using scikits.cuda☆52Updated 10 years ago
- Scientific library for high-precision computations and research☆49Updated 7 years ago
- An ONNX backend using PlaidML☆28Updated 7 years ago
- Tools and extensions for CUDA profiling☆65Updated 5 years ago
- Custom fork containing our own python backend for integration into neon☆15Updated 2 years ago
- a heterogeneous multiGPU level-3 BLAS library☆45Updated 5 years ago
- Benchmarks for CNTK and other toolkits.☆44Updated 9 years ago