ParCoreLab / ReuseTracker
A fast and accurate reuse distance analyzer for multi-threaded applications. It leverages existing hardware features in commodity CPUs.
☆16Updated last year
Related projects: ⓘ
- HeteroSync is a benchmark suite for performing fine-grained synchronization on tightly coupled GPUs☆27Updated last year
- The Splash-3 benchmark suite☆40Updated last year
- Artifact Evaluation Reproduction for "Software Prefetching for Indirect Memory Accesses", CGO 2017, using CK.☆35Updated 2 years ago
- CUDAAdvisor: a GPU profiling tool☆48Updated 6 years ago
- A low-overhead tool to periodically collect system-wide hardware performance counters on Intel64 systems.☆31Updated 2 years ago
- ☆44Updated 5 years ago
- Collaborative Parallelization Framework (CPF)☆31Updated last year
- ☆58Updated 2 years ago
- Haystack is an analytical cache model that given a program computes the number of cache misses.☆42Updated 5 years ago
- Pointer-chasing memory benchmark (forked from Doug Pase's code).☆57Updated 10 years ago
- ☆35Updated 2 years ago
- Benchmarks for auto-vectorization and revectorization, including both hand-vectorized and scalar code☆24Updated 5 years ago
- A Top-Down Profiler for GPU Applications☆13Updated 6 months ago
- ☆26Updated last year
- Utilities to measure read access times of caches, memory, and hardware prefetches for simple and fused operations☆72Updated 10 months ago
- Tools and experiments for 0sim. Simulate system software behavior on machines with terabytes of main memory from your desktop.☆20Updated 4 years ago
- Ocolos is the first online code layout optimization system for unmodified applications written in unmanaged languages.☆51Updated 9 months ago
- Source code for the paper "Profile Guided Optimization without Profiles: A Machine Learning Approach"☆23Updated 2 years ago
- NumaMMA is a lightweight memory profiler for parallel applications☆25Updated 5 months ago
- ☆40Updated 7 years ago
- Repeated access to L2-containable loops to look for snoop filter conflicts on Intel Skylake Xeon processors.☆29Updated 6 years ago
- CUDA Flux is a profiler for GPU applications which reports the basic block executions frequencies of compute kernels☆30Updated 3 years ago
- A GPU cache model for research purposes☆26Updated 10 years ago
- Performance Prediction Toolkit☆51Updated 2 years ago
- User-space Page Management☆102Updated last month
- This serves as a repository for reproducibility of the SC21 paper "In-Depth Analyses of Unified Virtual Memory System for GPU Accelerated…☆23Updated 11 months ago
- A Shared Memory Multithreaded Graph Benchmark Suite for Multicores☆34Updated 2 years ago
- a Pin tool for collecting microarchitecture-independent workload characteristics☆58Updated 7 months ago
- IBM Platform-Independent Software Analysis☆14Updated 6 years ago
- ☆24Updated 2 years ago