eth-cscs / pascal-trainingLinks
Teaching materials, slides and exercises, for the GPU & CUDA training in 2017
☆13Updated 8 years ago
Alternatives and similar repositories for pascal-training
Users that are interested in pascal-training are comparing it to the libraries listed below
Sorting:
- Simple starter code for SYCL and Eigen☆18Updated 8 years ago
- Data repository supplementing my blog post comparing hardware characteristics of CPUs, GPUs, and MICs☆35Updated 3 years ago
- ☆87Updated 8 years ago
- BLAS implementation for Intel FPGA☆78Updated 5 years ago
- Autonomic Performance Environment for eXascale (APEX)☆50Updated 5 months ago
- High-Performance Reproducible BLAS using posit arithmetic☆12Updated 3 years ago
- Benchmark Suite for Heterogenuous FFT Implementations☆35Updated last year
- C++ User interface for the Platform independent Library Alpaka☆40Updated last month
- Range-based for loops to iterate over a range of numbers or values☆34Updated 9 years ago
- ReMPI (MPI Record-and-Replay)☆40Updated last year
- nvptx-tools: a collection of tools for use with nvptx-none GCC toolchains.☆52Updated last year
- ☆30Updated 2 years ago
- Examples for using SYCL on CUDA☆62Updated 3 months ago
- Kernel Tuning Toolkit☆64Updated last month
- Contains sources related to the lectures and labs for the NVIDIA OpenACC course.☆50Updated 6 years ago
- Interoperability examples for OpenACC.☆48Updated 5 years ago
- Tutorials for Timemory☆21Updated last year
- 3D Tensors for Blaze (https://bitbucket.org/blaze-lib/blaze)☆37Updated 5 years ago
- A tool for debugging and assessing floating point precision and reproducibility.☆90Updated 2 months ago
- Header only framework for data analysis in massively parallel platforms.☆113Updated 3 months ago
- A domain-specific language and compiler for image processing☆77Updated 4 years ago
- tools to create performance and roofline plots from measured data☆60Updated 11 years ago
- NPBench - A Benchmarking Suite for High-Performance NumPy☆91Updated 3 weeks ago
- Intel Data Parallel C++ (and SYCL 2020) Tutorial.☆95Updated 4 years ago
- A library to benchmark CUDA code, similar to google benchmark.☆30Updated 4 years ago
- A task benchmark☆44Updated last year
- Subset of BLAS routines optimized for NVIDIA GPUs☆75Updated 2 years ago
- CUDA tool set for non-C++ languages that provides similar functionality like Thrust, with NVRTC at its core.☆59Updated 3 years ago
- WIP · CUDA compatibility for Blaze · https://bitbucket.org/blaze-lib/blaze☆20Updated 6 years ago
- Omni Compiler for C and Fortran programs with XcalableMP and OpenACC directives☆62Updated 2 years ago