ACANETS / dpcpp-tutorialLinks
Lectures and Labs for Data Parallel Computing and DPC++. Sponsored by Intel Corporation.
☆14Updated 3 years ago
Alternatives and similar repositories for dpcpp-tutorial
Users that are interested in dpcpp-tutorial are comparing it to the libraries listed below
Sorting:
- Utilities to measure read access times of caches, memory, and hardware prefetches for simple and fused operations☆84Updated last year
- The University of Bristol HPC Simulation Engine☆99Updated this week
- Updated C version of the Test Suite for Vectorising Compilers☆65Updated last year
- SST Architectural Simulation Components and Libraries☆99Updated last week
- SST Structural Simulation Toolkit Parallel Discrete Event Core and Services☆171Updated last week
- A list of benchmark suites used in the research related to compilers, program performance, scientific computations etc.☆55Updated last year
- The Splash-3 benchmark suite☆44Updated 2 years ago
- ☆60Updated 10 months ago
- This is a mirror of the official libpfm4 git repository, https://sourceforge.net/p/perfmon2/libpfm4/ci/master/tree/ with some local branc…☆65Updated 10 months ago
- A lightweight memory allocator for hardware-accelerated machine learning☆160Updated 5 months ago
- SYCL Reference Manual☆28Updated last year
- Measure instruction latency and throughput☆24Updated 2 weeks ago
- A memory profiler for NVIDIA GPUs to explore memory inefficiencies in GPU-accelerated applications.☆25Updated 10 months ago
- A framework that support executing unmodified CUDA source code on non-NVIDIA devices.☆132Updated 7 months ago
- NAS Parallel Benchmarks 3.0 OpenMP C version☆52Updated 10 years ago
- gem5 configuration for intel's skylake micro-architecture☆51Updated 3 years ago
- Microarchitecture diagrams of several CPUs☆38Updated this week
- SYCL for Vitis: Experimental fusion of triSYCL with Intel SYCL oneAPI DPC++ up-streaming effort into Clang/LLVM☆120Updated 9 months ago
- Slides and exercises for persistent memory programming tutorial☆14Updated 2 years ago
- HeteroSync is a benchmark suite for performing fine-grained synchronization on tightly coupled GPUs☆30Updated 11 months ago
- Official page for 18-847C (Spring '22): Data Center Computing☆16Updated 3 years ago
- A collection of RISC-V Vector (RVV) benchmarks to help developers write portably performant RVV code☆125Updated last month
- ☆64Updated 6 years ago
- Collection of synchronization micro-benchmarks and traces from infrastructure applications☆46Updated last month
- The Sniper Multi-Core Simulator☆144Updated 9 months ago
- The CLooG Code Generator in the Polyhedral Model☆51Updated 2 years ago
- A Top-Down Profiler for GPU Applications☆20Updated last year
- Linux source code for ISCA 2020 paper "Enhancing and Exploiting Contiguity for Fast Memory Virtualization"☆18Updated 4 years ago
- Haystack is an analytical cache model that given a program computes the number of cache misses.☆46Updated 6 years ago
- A collection of performance analysis tools, recipes, handy scripts, microbenchmarks & more☆140Updated 2 months ago