CoffeeBeforeArch / spring_2020_tutorial
"Hardware, Software, and Compilers! Oh My!" tutorial files
☆16Updated 5 years ago
Alternatives and similar repositories for spring_2020_tutorial:
Users that are interested in spring_2020_tutorial are comparing it to the libraries listed below
- ☆43Updated 4 years ago
- ☆22Updated 2 years ago
- Examples for using SYCL on CUDA☆62Updated 2 months ago
- My notes on various HPC papers.☆22Updated 2 years ago
- A GPU benchmark suite for assessing on-chip GPU memory bandwidth☆104Updated 7 years ago
- Emulating DMA Engines on GPUs for Performance and Portability☆40Updated 9 years ago
- ☆67Updated 11 years ago
- The ultimate memory bandwidth benchmark☆49Updated 3 months ago
- RAJA Performance Suite☆117Updated last week
- Serial and parallel implementations of matrix multiplication☆40Updated 4 years ago
- This is a tuned sparse matrix dense vector multiplication(SpMV) library☆21Updated 9 years ago
- SYCL Benchmark Suite☆64Updated 2 months ago
- Intel Data Parallel C++ (and SYCL 2020) Tutorial.☆93Updated 3 years ago
- Little OpenMP Library☆160Updated 2 years ago
- Learning and practice of high performance computing (CUDA, Vulkan, OpenCL, OpenMP, TBB, SSE/AVX, NEON, MPI, coroutines, etc. )☆60Updated last month
- Parallel Tasking Library (PTL) - Lightweight C++11 mutilthreading tasking system featuring thread-pool, task-groups, and lock-free task q…☆46Updated 5 months ago
- ☆34Updated last year
- TLB Benchmarks☆33Updated 7 years ago
- Evaluating different memory managers for dynamic GPU memory☆25Updated 4 years ago
- A simple profiler to count Nvidia PTX assembly instructions of OpenCL/SYCL/CUDA kernels for roofline model analysis.☆50Updated last month
- XSBench: The Monte Carlo Macroscopic Cross Section Lookup Benchmark☆80Updated last year
- SYCL Conformance Tests☆70Updated this week
- tools to create performance and roofline plots from measured data☆58Updated 10 years ago
- This package includes the implementation for four sparse linear algebra kernels: Sparse-Matrix-Vector-Multiplication (SpMV), Sparse-Trian…☆26Updated 4 years ago
- ☆17Updated 3 years ago
- ☆56Updated last month
- Parallelized and vectorized SpMV on Intel Xeon Phi (Knights Landing, AVX512, KNL)☆25Updated last year
- ☆30Updated 2 years ago
- HPC Challenge Benchmark☆52Updated 2 years ago
- Generate simple index ranges in C++ and CUDA C++☆39Updated last year