Instanciate the Cache Aware Roofline Model on single socket and multisocket systems.
☆27Feb 22, 2019Updated 7 years ago
Alternatives and similar repositories for Locality-Aware-Roofline-Model
Users that are interested in Locality-Aware-Roofline-Model are comparing it to the libraries listed below
Sorting:
- Extended Roofline Model - LLVM source tree with additional libraries for the analysis of the dynamic execution in the interpreter☆17Jul 5, 2017Updated 8 years ago
- Process-based Asynchronous Progress Model for MPI Communication☆11Jan 24, 2021Updated 5 years ago
- IBM Platform-Independent Software Analysis☆14Mar 12, 2018Updated 7 years ago
- A fast and scalable x86-64 multicore simulator☆31Mar 16, 2021Updated 4 years ago
- Roofline prototype for Arm☆14Mar 25, 2024Updated last year
- Hardware performance counter tool for Windows OS☆18Sep 4, 2018Updated 7 years ago
- Virtualization Layer for the MPI Profiling Interface☆22Apr 30, 2022Updated 3 years ago
- code for examining determinism of performance counters☆21Mar 18, 2021Updated 4 years ago
- Wrapper library for model-specific registers. APIs cover RAPL, performance counters, clocks and turbo.☆52Feb 16, 2023Updated 3 years ago
- tools to create performance and roofline plots from measured data☆61Jun 10, 2014Updated 11 years ago
- This material contains content on how to profile and optimize simple Pytorch mnist code using NVIDIA Nsight Systems and Pytorch Profiler☆21Feb 5, 2026Updated last month
- A survey on architectural simulators focused on CPU caches.☆16Feb 8, 2020Updated 6 years ago
- A binary instrumentation tool to analyze load instructions in any off-the-shelf x86(-64) program. Described by Bera et al. in https://arx…☆24Jun 30, 2024Updated last year
- C++ double-to-string conversion benchmark☆27Mar 2, 2026Updated last week
- Program analysis tool based on software performance counters☆57May 13, 2021Updated 4 years ago
- This is the public repo for the MLPerf DeepCAM climate data segmentation proposal.☆16Sep 30, 2025Updated 5 months ago
- 📝 "Synthesizing Benchmarks for Predictive Modeling" (🥇 CGO'17 Best Paper)☆22Feb 10, 2023Updated 3 years ago
- ☆49Sep 5, 2020Updated 5 years ago
- The Splash-3 benchmark suite☆45Apr 24, 2023Updated 2 years ago
- a Pin tool for collecting microarchitecture-independent workload characteristics☆62Feb 12, 2024Updated 2 years ago
- The ultimate bandwidth benchmark☆61Dec 16, 2025Updated 2 months ago
- Forked from https://bitbucket.org/berkeleylab/cs-roofline-toolkit/src/master/☆25Jun 14, 2019Updated 6 years ago
- Allocation benchmarks☆31Jul 6, 2016Updated 9 years ago
- Memory System Microbenchmarks☆65Feb 9, 2023Updated 3 years ago
- NAS Parallel Benchmark Kernels in C/C++. The parallel versions are in FastFlow, TBB, and OpenMP.☆22Jul 29, 2021Updated 4 years ago
- COBAYN: Compiler Autotuning Framework Using Bayesian Networks☆20May 9, 2022Updated 3 years ago
- ARTICo³ - Dynamic and Partially Reconfigurable Architecture for Run-Time Adaptive, High Performance Embedded Computing☆12Sep 10, 2024Updated last year
- Interactive Visualization of Memory Access Samples☆23Sep 10, 2022Updated 3 years ago
- NumaMMA is a lightweight memory profiler for parallel applications☆30Jun 10, 2025Updated 8 months ago
- An easy to use tool to stream hardware performance counters data as CSV☆31May 7, 2024Updated last year
- Official BOLT Repository☆32Aug 16, 2024Updated last year
- A low-overhead tool to periodically collect system-wide hardware performance counters on Intel64 systems.☆32Aug 2, 2022Updated 3 years ago
- Artifact, reproducibility, and testing utilites for gem5☆23Jul 1, 2021Updated 4 years ago
- pmu event analysis package☆80Dec 11, 2025Updated 2 months ago
- Performance engineering for the rest of us.☆31Oct 3, 2025Updated 5 months ago
- Sparse Matrix Factorization (SMF) is a key component in many machine learning problems and there exist a verity a applications in real-w…☆11Jan 25, 2016Updated 10 years ago
- This place provide different SRAM cells netlist to be simulated with HSpice tool in sub-20nm FinFET technologies.☆12Dec 31, 2020Updated 5 years ago
- A mutation testing tool designed to work with large C++ (and C) codebases.☆13Oct 28, 2025Updated 4 months ago
- TreeFuser is a tool that perform traversals fusion for recursive tree traversals written in subset of the c++ language.☆11Aug 13, 2023Updated 2 years ago