SamAinsworth / reproduce-cgo2017-paperLinks
Artifact Evaluation Reproduction for "Software Prefetching for Indirect Memory Accesses", CGO 2017, using CK.
☆38Updated 3 years ago
Alternatives and similar repositories for reproduce-cgo2017-paper
Users that are interested in reproduce-cgo2017-paper are comparing it to the libraries listed below
Sorting:
- CUDAAdvisor: a GPU profiling tool☆49Updated 6 years ago
- HeteroSync is a benchmark suite for performing fine-grained synchronization on tightly coupled GPUs☆30Updated 9 months ago
- ☆63Updated 6 years ago
- The Splash-3 benchmark suite☆44Updated 2 years ago
- ☆34Updated 3 years ago
- A Shared Memory Multithreaded Graph Benchmark Suite for Multicores☆36Updated 3 weeks ago
- Haystack is an analytical cache model that given a program computes the number of cache misses.☆46Updated 5 years ago
- ☆19Updated 2 years ago
- CERE: Codelet Extractor and REplayer☆40Updated last year
- This serves as a repository for reproducibility of the SC21 paper "In-Depth Analyses of Unified Virtual Memory System for GPU Accelerated…☆32Updated last year
- A framework that helps implementing swizzle GPU kernels☆42Updated 5 years ago
- Ocolos is the first online code layout optimization system for unmodified applications written in unmanaged languages.☆52Updated this week
- A Speculation-Aware Collaborative Dependence Analysis Framework☆28Updated 11 months ago
- Collaborative Parallelization Framework (CPF)☆32Updated last year
- A low-overhead tool to periodically collect system-wide hardware performance counters on Intel64 systems.☆32Updated 2 years ago
- Chai☆44Updated last year
- Race detector for NVIDIA GPUs, published in SOSP 2021.☆18Updated 4 months ago
- Pannotia v0.9 is a suite of OpenCL graph applications☆24Updated 7 years ago
- Thinking is hard - automate it☆19Updated 2 years ago
- a Pin tool for collecting microarchitecture-independent workload characteristics☆59Updated last year
- Characterizing and Modeling Non-Volatile Memory Systems [MICRO'20, TopPicks'21]☆33Updated 3 years ago
- TLB Benchmarks☆34Updated 7 years ago
- ☆30Updated 2 years ago
- ☆17Updated 3 years ago
- ☆52Updated 5 years ago
- Instanciate the Cache Aware Roofline Model on single socket and multisocket systems.☆27Updated 6 years ago
- ☆59Updated 8 months ago
- LonestarGPU: Irregular algorithms parallelized for GPUs☆35Updated 5 years ago
- Pointer-chasing memory benchmark (forked from Doug Pase's code).☆59Updated 11 years ago
- A GPU cache model for research purposes☆28Updated 11 years ago