UniHD-CEG / cuda-memtrace
LLVM Plugin to Instrument Global Memory Accesses in CUDA Kernels
☆10Updated 4 years ago
Alternatives and similar repositories for cuda-memtrace:
Users that are interested in cuda-memtrace are comparing it to the libraries listed below
- ☆59Updated 4 months ago
- HeteroSync is a benchmark suite for performing fine-grained synchronization on tightly coupled GPUs☆28Updated 5 months ago
- ☆31Updated last year
- CUDAAdvisor: a GPU profiling tool☆48Updated 6 years ago
- Updated C version of the Test Suite for Vectorising Compilers☆55Updated 11 months ago
- Repeated access to L2-containable loops to look for snoop filter conflicts on Intel Skylake Xeon processors.☆29Updated 6 years ago
- A Symbolic Emulator for Shuffle Synthesis on the NVIDIA PTX Code☆14Updated last year
- a Pin tool for collecting microarchitecture-independent workload characteristics☆60Updated last year
- PIN-tool to produce multi-threaded atomic memory traces☆36Updated 11 years ago
- ☆36Updated 2 months ago
- Measure instruction latency and throughput☆23Updated last week
- An LLVM pass to profile dynamic LLVM IR instructions and runtime values☆138Updated 4 years ago
- Tools to track memory accesses in applications and visualize the patterns to reveal opportunities for optimization.☆91Updated 9 years ago
- ☆35Updated 5 years ago
- HW interface for memory caches☆26Updated 4 years ago
- Race detector for NVIDIA GPUs, published in SOSP 2021.☆19Updated 8 months ago
- CERE: Codelet Extractor and REplayer☆40Updated last year
- Creating beautiful gem5 simulations☆47Updated 3 years ago
- GPUReplay, ASPLOS 2022☆33Updated 3 years ago
- Haystack is an analytical cache model that given a program computes the number of cache misses.☆45Updated 5 years ago
- ☆68Updated 4 years ago
- ☆34Updated 3 years ago
- ☆33Updated 2 years ago
- A Speculation-Aware Collaborative Dependence Analysis Framework☆28Updated 7 months ago
- NAS Parallel Benchmarks 3.0 OpenMP C version☆50Updated 10 years ago
- Artifact Evaluation Reproduction for "Software Prefetching for Indirect Memory Accesses", CGO 2017, using CK.☆38Updated 3 years ago
- ☆52Updated 5 years ago
- ☆11Updated 4 years ago
- bogo for ASPLOS'19☆9Updated 5 years ago
- ☆28Updated 2 years ago