eth-cscs / pascal-training
Teaching materials, slides and exercises, for the GPU & CUDA training in 2017
☆13Updated 7 years ago
Related projects ⓘ
Alternatives and complementary repositories for pascal-training
- Benchmark Suite for Heterogenuous FFT Implementations☆34Updated 10 months ago
- Data repository supplementing my blog post comparing hardware characteristics of CPUs, GPUs, and MICs☆34Updated 2 years ago
- Streaming Message Interface: High-Performance Distributed Memory Programming on Reconfigurable Hardware☆16Updated 2 years ago
- High-Performance Reproducible BLAS using posit arithmetic☆12Updated 2 years ago
- Autonomic Performance Environment for eXascale (APEX)☆38Updated 2 weeks ago
- ☆29Updated 4 years ago
- Tutorials for Timemory☆19Updated 3 months ago
- Interoperability examples for OpenACC.☆48Updated 4 years ago
- DLA-Future☆65Updated this week
- 3D Tensors for Blaze (https://bitbucket.org/blaze-lib/blaze)☆36Updated 4 years ago
- ☆29Updated last year
- Absinthe is an optimization framework to fuse and tile stencil codes in one shot☆14Updated 5 years ago
- Implementation of MPI that supports large counts☆45Updated last year
- TAU Performance System Public Mirror (Updated every night at midnight, USA Pacific Time)☆39Updated this week
- Aries Network Performance Counters Monitoring Library☆11Updated 3 years ago
- C++ User interface for the Platform independent Library Alpaka☆37Updated 2 months ago
- A library for C++/Fortran computer simulations (e.g. stencil codes, mesh-free, unstructured grids, n-body & particle methods). Scales fro…☆38Updated 3 years ago
- Performance engineering for the rest of us.☆29Updated last year
- mirror from http://lotsofcores.com book 2, since dropbox isn't good for everyone☆38Updated 8 years ago
- ☆4Updated 7 months ago
- nvptx-tools: a collection of tools for use with nvptx-none GCC toolchains.☆46Updated 2 months ago
- HiCMA: Hierarchical Computations on Manycore Architectures☆28Updated last year
- Contains sources related to the lectures and labs for the NVIDIA OpenACC course.☆52Updated 5 years ago
- A fast shared & distributed memory task-based runtime in C++☆26Updated 3 years ago
- NPBench - A Benchmarking Suite for High-Performance NumPy☆73Updated this week
- Code repo for lotsofcores.com book 1, here since dropbox doesn't work for everyone☆26Updated 8 years ago
- Parallel GDB developed for debugging HPC code at Lawrence Livermore National Laboratory.☆32Updated 9 years ago
- Ravel MPI trace visualization tool☆29Updated 3 years ago