ParCoreLab / ReuseTrackerLinks
A fast and accurate reuse distance analyzer for multi-threaded applications. It leverages existing hardware features in commodity CPUs.
☆20Updated 2 years ago
Alternatives and similar repositories for ReuseTracker
Users that are interested in ReuseTracker are comparing it to the libraries listed below
Sorting:
- HeteroSync is a benchmark suite for performing fine-grained synchronization on tightly coupled GPUs☆30Updated last year
- Haystack is an analytical cache model that given a program computes the number of cache misses.☆46Updated 6 years ago
- Ocolos is the first online code layout optimization system for unmodified applications written in unmanaged languages.☆53Updated 3 months ago
- CUDAAdvisor: a GPU profiling tool☆49Updated 7 years ago
- Collaborative Parallelization Framework (CPF)☆32Updated 2 years ago
- Utilities to measure read access times of caches, memory, and hardware prefetches for simple and fused operations☆84Updated last year
- ☆64Updated 6 years ago
- The Splash-3 benchmark suite☆44Updated 2 years ago
- Tools and experiments for 0sim. Simulate system software behavior on machines with terabytes of main memory from your desktop.☆21Updated 5 years ago
- Benchmarks for auto-vectorization and revectorization, including both hand-vectorized and scalar code☆30Updated 6 years ago
- User-space Page Management☆108Updated last year
- A GPU FP32 computation method with Tensor Cores.☆21Updated 2 years ago
- ☆31Updated 2 years ago
- Performance Prediction Toolkit☆52Updated last week
- Artifact Evaluation Reproduction for "Software Prefetching for Indirect Memory Accesses", CGO 2017, using CK.☆41Updated 3 years ago
- Unit benchmarks of CUDA event APIs.☆17Updated last year
- Updated C version of the Test Suite for Vectorising Compilers☆65Updated last year
- Source code for the paper "Profile Guided Optimization without Profiles: A Machine Learning Approach"☆26Updated 3 years ago
- Repo for OSDI 2023 paper: "Ship your Critical Section Not Your Data: Enabling Transparent Delegation with TCLocks"☆21Updated 10 months ago
- A compiler to automatically transform applications into disaggregated memory apps.☆16Updated last year
- This is a mirror of the official libpfm4 git repository, https://sourceforge.net/p/perfmon2/libpfm4/ci/master/tree/ with some local branc…☆65Updated 11 months ago
- Intel® Instrumentation and Tracing Technology (ITT) and Just-In-Time (JIT) APIs☆121Updated last month
- ☆40Updated 3 years ago
- Race detector for NVIDIA GPUs, published in SOSP 2021.☆18Updated 7 months ago
- ☆60Updated 11 months ago
- Linux source code for ISCA 2020 paper "Enhancing and Exploiting Contiguity for Fast Memory Virtualization"☆18Updated 4 years ago
- Intel® Data Mover Library (Intel® DML)☆93Updated 5 months ago
- Polyhedral Parallel Code Generation (source repository: http://repo.or.cz/ppcg.git)☆127Updated 3 years ago
- ☆49Updated 11 months ago
- Slice-aware Memory Management - Exploiting NUCA Characteristic of LLC in Intel Processors☆41Updated 6 years ago