ParCoreLab / ReuseTracker
A fast and accurate reuse distance analyzer for multi-threaded applications. It leverages existing hardware features in commodity CPUs.
☆16Updated 2 years ago
Alternatives and similar repositories for ReuseTracker:
Users that are interested in ReuseTracker are comparing it to the libraries listed below
- Haystack is an analytical cache model that given a program computes the number of cache misses.☆46Updated 5 years ago
- CUDAAdvisor: a GPU profiling tool☆48Updated 6 years ago
- HeteroSync is a benchmark suite for performing fine-grained synchronization on tightly coupled GPUs☆28Updated 6 months ago
- A low-overhead tool to periodically collect system-wide hardware performance counters on Intel64 systems.☆32Updated 2 years ago
- ☆28Updated 2 years ago
- Tools and experiments for 0sim. Simulate system software behavior on machines with terabytes of main memory from your desktop.☆21Updated 4 years ago
- The Splash-3 benchmark suite☆43Updated last year
- ☆53Updated 5 years ago
- Collaborative Parallelization Framework (CPF)☆32Updated last year
- Artifact Evaluation Reproduction for "Software Prefetching for Indirect Memory Accesses", CGO 2017, using CK.☆38Updated 3 years ago
- Repeated access to L2-containable loops to look for snoop filter conflicts on Intel Skylake Xeon processors.☆29Updated 6 years ago
- NumaMMA is a lightweight memory profiler for parallel applications☆27Updated 11 months ago
- ☆59Updated 5 months ago
- a Pin tool for collecting microarchitecture-independent workload characteristics☆60Updated last year
- ☆37Updated 3 years ago
- Performance Prediction Toolkit☆51Updated 3 months ago
- Ocolos is the first online code layout optimization system for unmodified applications written in unmanaged languages.☆52Updated last year
- Utilities to measure read access times of caches, memory, and hardware prefetches for simple and fused operations☆82Updated last year
- ☆34Updated 3 years ago
- Hopscotch: A benchmark suite for memory performance evaluation☆15Updated 2 years ago
- User-space Page Management☆108Updated 7 months ago
- Benchmarks for auto-vectorization and revectorization, including both hand-vectorized and scalar code☆28Updated 6 years ago
- A compiler to automatically transform applications into disaggregated memory apps.☆16Updated last year
- Cycle-level, trace-driven, parallel GPU simulator for NVIDIA Pascal.☆12Updated 10 months ago
- ☆40Updated 7 years ago
- CERE: Codelet Extractor and REplayer☆40Updated last year
- CUDA Flux is a profiler for GPU applications which reports the basic block executions frequencies of compute kernels☆32Updated 4 years ago
- This is the implementation of our research system Illuminator that was published in ASPLOS 2018 with the title "Making Huge Pages Actuall…☆11Updated 4 years ago
- Memory system characterization benchmarks using atomic operations☆14Updated 8 months ago
- A new memory mapping interface for efficient direct user-space access to byte-addressable storage, published in MICRO2022.☆15Updated 2 years ago