jdmccalpin / periodic-performance-counters
A low-overhead tool to periodically collect system-wide hardware performance counters on Intel64 systems.
☆31Updated 2 years ago
Related projects ⓘ
Alternatives and complementary repositories for periodic-performance-counters
- Repeated access to L2-containable loops to look for snoop filter conflicts on Intel Skylake Xeon processors.☆29Updated 6 years ago
- HeteroSync is a benchmark suite for performing fine-grained synchronization on tightly coupled GPUs☆27Updated 2 months ago
- Measure instruction latency and throughput☆22Updated 2 years ago
- ☆34Updated 2 years ago
- Hopscotch: A benchmark suite for memory performance evaluation☆15Updated 2 years ago
- ☆58Updated last month
- Instanciate the Cache Aware Roofline Model on single socket and multisocket systems.☆27Updated 5 years ago
- Utilities to measure read access times of caches, memory, and hardware prefetches for simple and fused operations☆74Updated last year
- The Splash-3 benchmark suite☆42Updated last year
- A Shared Memory Multithreaded Graph Benchmark Suite for Multicores☆34Updated 2 years ago
- Artifact Evaluation Reproduction for "Software Prefetching for Indirect Memory Accesses", CGO 2017, using CK.☆38Updated 3 years ago
- Pointer-chasing memory benchmark (forked from Doug Pase's code).☆58Updated 10 years ago
- A collection of performance analysis tools, recipes, handy scripts, microbenchmarks & more☆117Updated this week
- An unofficial mirror of the core PARSEC 3.0 benchmark suite with patches to run on x86_64 Arch Linux and generalize builds.☆99Updated 2 years ago
- Creating beautiful gem5 simulations☆45Updated 3 years ago
- SST Architectural Simulation Components and Libraries☆92Updated this week
- A Benchmark Suite for Heterogeneous System Computation☆52Updated 3 weeks ago
- SST Structural Simulation Toolkit Parallel Discrete Event Core and Services☆132Updated this week
- tools to create performance and roofline plots from measured data☆58Updated 10 years ago
- ☆113Updated 3 months ago
- A BarrierPoint implementation: Automatically select representative regions of parallel applications☆14Updated 8 years ago
- Allows safer access to model specific registers (MSRs)☆92Updated 3 weeks ago
- NumaMMA is a lightweight memory profiler for parallel applications☆25Updated 7 months ago
- a Pin tool for collecting microarchitecture-independent workload characteristics☆59Updated 9 months ago
- ☆47Updated 5 years ago
- Chai☆42Updated 11 months ago
- Haystack is an analytical cache model that given a program computes the number of cache misses.☆42Updated 5 years ago
- PARSEC Benchmark http://parsec.cs.princeton.edu 3.0-beta-20150206 ported to Ubuntu 22.04 and with proper version control and SPLASH2 port…☆85Updated this week
- GPUDirect Async support for IB Verbs☆90Updated 2 years ago
- The ultimate memory bandwidth benchmark☆46Updated last year