eth-cscs / pascal-trainingLinks
Teaching materials, slides and exercises, for the GPU & CUDA training in 2017
☆13Updated 8 years ago
Alternatives and similar repositories for pascal-training
Users that are interested in pascal-training are comparing it to the libraries listed below
Sorting:
- Tutorials for Timemory☆21Updated last year
- High-Performance Reproducible BLAS using posit arithmetic☆12Updated 3 years ago
- BLAS implementation for Intel FPGA☆78Updated 5 years ago
- Contains sources related to the lectures and labs for the NVIDIA OpenACC course.☆50Updated 6 years ago
- MATAR is a C++ software library to allow developers to easily create and use dense and sparse data representations that are also portable…☆34Updated last month
- Very-Low Overhead Checkpointing System☆59Updated 6 months ago
- Loop Kernel Analysis and Performance Modeling Toolkit☆96Updated 10 months ago
- Data repository supplementing my blog post comparing hardware characteristics of CPUs, GPUs, and MICs☆35Updated 3 years ago
- Autonomic Performance Environment for eXascale (APEX)☆50Updated 6 months ago
- ReMPI (MPI Record-and-Replay)☆40Updated last year
- C++ User interface for the Platform independent Library Alpaka☆39Updated 2 months ago
- CUDA tool set for non-C++ languages that provides similar functionality like Thrust, with NVRTC at its core.☆59Updated 3 years ago
- Absinthe is an optimization framework to fuse and tile stencil codes in one shot☆14Updated 6 years ago
- Examples for using SYCL on CUDA☆63Updated 5 months ago
- A tool for debugging and assessing floating point precision and reproducibility.☆93Updated last week
- GTensor is a multi-dimensional array C++14 header-only library for hybrid GPU development.☆37Updated last week
- An MPI ABI compatibility layer☆34Updated 5 months ago
- HDF5 Performance Analysis Checklist☆13Updated last year
- Interoperability examples for OpenACC.☆49Updated 5 years ago
- MPI wrapper generator, for writing PMPI tool libraries☆36Updated 10 months ago
- NPBench - A Benchmarking Suite for High-Performance NumPy☆91Updated last week
- Error-Free Transformations as building blocks for compensated algorithms☆16Updated 2 years ago
- ☆35Updated 5 years ago
- Scalable High-performance Algorithms and Data-structures☆136Updated 2 months ago
- Distributed-memory, arbitrary-precision, dense and sparse-direct linear algebra, conic optimization, and lattice reduction☆71Updated 10 months ago
- Custom-Precision Floating-point numbers.☆41Updated this week
- DLA-Future☆82Updated last week
- Ravel MPI trace visualization tool☆30Updated 4 years ago
- Orio is an open-source extensible framework for the definition of domain-specific languages and generation of optimized code for multiple…☆37Updated last month
- Subset of BLAS routines optimized for NVIDIA GPUs☆76Updated 2 years ago