eth-cscs / pascal-trainingLinks
Teaching materials, slides and exercises, for the GPU & CUDA training in 2017
☆13Updated 8 years ago
Alternatives and similar repositories for pascal-training
Users that are interested in pascal-training are comparing it to the libraries listed below
Sorting:
- Benchmark Suite for Heterogenuous FFT Implementations☆35Updated last year
- Contains sources related to the lectures and labs for the NVIDIA OpenACC course.☆50Updated 5 years ago
- MATAR is a C++ software library to allow developers to easily create and use dense and sparse data representations that are also portable…☆29Updated last week
- ☆87Updated 8 years ago
- ReMPI (MPI Record-and-Replay)☆39Updated last year
- Interoperability examples for OpenACC.☆48Updated 4 years ago
- A tool for debugging and assessing floating point precision and reproducibility.☆84Updated last month
- ☆35Updated 5 years ago
- Error-Free Transformations as building blocks for compensated algorithms☆15Updated 2 years ago
- Autonomic Performance Environment for eXascale (APEX)☆49Updated last month
- Data repository supplementing my blog post comparing hardware characteristics of CPUs, GPUs, and MICs☆35Updated 3 years ago
- DLA-Future☆77Updated last week
- Code repo for lotsofcores.com book 1, here since dropbox doesn't work for everyone☆26Updated 9 years ago
- Exercises and Solutions for "Programming Your GPU with OpenMP: A Hands-On Introduction"☆146Updated 5 months ago
- nvptx-tools: a collection of tools for use with nvptx-none GCC toolchains.☆52Updated 11 months ago
- Performance engineering for the rest of us.☆31Updated last month
- mirror from http://lotsofcores.com book 2, since dropbox isn't good for everyone☆38Updated 9 years ago
- NPBench - A Benchmarking Suite for High-Performance NumPy☆87Updated 3 months ago
- Loop Kernel Analysis and Performance Modeling Toolkit☆94Updated 5 months ago
- Tutorials for Timemory☆20Updated last year
- High-Performance Reproducible BLAS using posit arithmetic☆12Updated 3 years ago
- Geant4 EM physics simulation R&D project looking for solutions to reduce the computing performance bottleneck experienced by HEP detector…☆12Updated 2 months ago
- OpenSHMEM Application Programming Interface☆58Updated 9 months ago
- BLAS implementation for Intel FPGA☆77Updated 4 years ago
- OpenMP Offloading Validation & Verification Suite; Official repository. We have migrated from bitbucket!! For documentation, results, pub…☆59Updated this week
- tools to create performance and roofline plots from measured data☆59Updated 11 years ago
- An MPI ABI compatibility layer☆33Updated last week
- CUDA tool set for non-C++ languages that provides similar functionality like Thrust, with NVRTC at its core.☆59Updated 3 years ago
- Subset of BLAS routines optimized for NVIDIA GPUs☆71Updated 2 years ago
- 3D Tensors for Blaze (https://bitbucket.org/blaze-lib/blaze)☆37Updated 4 years ago