ACANETS / dpcpp-tutorialLinks
Lectures and Labs for Data Parallel Computing and DPC++. Sponsored by Intel Corporation.
☆14Updated 3 years ago
Alternatives and similar repositories for dpcpp-tutorial
Users that are interested in dpcpp-tutorial are comparing it to the libraries listed below
Sorting:
- Utilities to measure read access times of caches, memory, and hardware prefetches for simple and fused operations☆85Updated 2 years ago
- [DEPRECATED] Moved to ROCm/rocm-systems repo☆84Updated this week
- The University of Bristol HPC Simulation Engine☆99Updated 2 months ago
- Updated C version of the Test Suite for Vectorising Compilers☆69Updated last year
- SYCL for Vitis: Experimental fusion of triSYCL with Intel SYCL oneAPI DPC++ up-streaming effort into Clang/LLVM☆121Updated 11 months ago
- A lightweight memory allocator for hardware-accelerated machine learning☆172Updated last month
- SST Structural Simulation Toolkit Parallel Discrete Event Core and Services☆176Updated this week
- A list of benchmark suites used in the research related to compilers, program performance, scientific computations etc.☆58Updated 2 years ago
- ☆64Updated 6 years ago
- The Splash-3 benchmark suite☆44Updated 2 years ago
- ☆59Updated this week
- ☆85Updated last week
- Documentation of the RISC-V C API☆77Updated this week
- SYCL Open Source Specification☆138Updated this week
- Measure instruction latency and throughput☆25Updated 2 months ago
- [DEPRECATED] Moved to ROCm/rocm-systems repo☆154Updated this week
- A framework that support executing unmodified CUDA source code on non-NVIDIA devices.☆136Updated 9 months ago
- This is a mirror of the official libpfm4 git repository, https://sourceforge.net/p/perfmon2/libpfm4/ci/master/tree/ with some local branc…☆67Updated last year
- ☆208Updated this week
- SYCL Reference Manual☆28Updated last year
- LMBench for ARC - based off of tarball from sourceforge, slightly modified for post-processing ease☆34Updated last year
- The MiBench testsuite, extended for use in general embedded environments☆13Updated 7 years ago
- Conversions to MLIR EmitC☆133Updated 10 months ago
- PTX-EMU is a simple emulator for CUDA program.☆37Updated 6 months ago
- TransferBench is a utility capable of benchmarking simultaneous copies between user-specified devices (CPUs/GPUs)☆50Updated this week
- We solve the two challenges architects face when designing heterogeneous processors with cache coherent shared memory. First, we develop …☆20Updated 3 years ago
- Trying to figure various CPU things out☆87Updated last year
- Tutorial for LLVM Dev Conference 2019.☆15Updated 6 years ago
- Benchmark for measuring the performance of sparse and irregular memory access.☆80Updated 2 months ago
- NAS Parallel Benchmarks 3.0 OpenMP C version☆52Updated 10 years ago