Instanciate the Cache Aware Roofline Model on single socket and multisocket systems.
☆27Feb 22, 2019Updated 7 years ago
Alternatives and similar repositories for Locality-Aware-Roofline-Model
Users that are interested in Locality-Aware-Roofline-Model are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Extended Roofline Model - LLVM source tree with additional libraries for the analysis of the dynamic execution in the interpreter☆17Jul 5, 2017Updated 8 years ago
- tools to create performance and roofline plots from measured data☆61Jun 10, 2014Updated 11 years ago
- Roofline prototype for Arm☆14Mar 25, 2024Updated 2 years ago
- Process-based Asynchronous Progress Model for MPI Communication☆11Jan 24, 2021Updated 5 years ago
- ARTICo³ - Dynamic and Partially Reconfigurable Architecture for Run-Time Adaptive, High Performance Embedded Computing☆12Sep 10, 2024Updated last year
- End-to-end encrypted cloud storage - Proton Drive • AdSpecial offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
- Virtualization Layer for the MPI Profiling Interface☆22Apr 30, 2022Updated 3 years ago
- IBM Platform-Independent Software Analysis☆14Mar 12, 2018Updated 8 years ago
- Wrapper library for model-specific registers. APIs cover RAPL, performance counters, clocks and turbo.☆52Feb 16, 2023Updated 3 years ago
- This is the public repo for the MLPerf DeepCAM climate data segmentation proposal.☆16Sep 30, 2025Updated 6 months ago
- A fast and scalable x86-64 multicore simulator☆31Mar 16, 2021Updated 5 years ago
- A low-overhead tool to periodically collect system-wide hardware performance counters on Intel64 systems.☆32Aug 2, 2022Updated 3 years ago
- 📝 "Synthesizing Benchmarks for Predictive Modeling" (🥇 CGO'17 Best Paper)☆22Feb 10, 2023Updated 3 years ago
- CMU 15-745 Spring 2014☆10Mar 7, 2014Updated 12 years ago
- LaTeX template for PhD thesis☆11Jun 23, 2015Updated 10 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- Interactive Visualization of Memory Access Samples☆23Sep 10, 2022Updated 3 years ago
- Instrumentation framework to generate execution traces of the most used parallel runtimes.☆67Updated this week
- Nanos6 is a runtime that implements the OmpSs-2 parallel programming model, developed by the System Tools and Advanced Runtimes (STAR) gr…☆22Jun 6, 2025Updated 9 months ago
- This material contains content on how to profile and optimize simple Pytorch mnist code using NVIDIA Nsight Systems and Pytorch Profiler☆21Feb 5, 2026Updated last month
- The ultimate bandwidth benchmark☆62Updated this week
- Forked from https://bitbucket.org/berkeleylab/cs-roofline-toolkit/src/master/☆25Jun 14, 2019Updated 6 years ago
- NumaMMA is a lightweight memory profiler for parallel applications☆30Jun 10, 2025Updated 9 months ago
- My i3 config and various scripts. Abandoned due to migration to awesome☆15Aug 6, 2024Updated last year
- Program analysis tool based on software performance counters☆56May 13, 2021Updated 4 years ago
- NordVPN Threat Protection Pro™ • AdTake your cybersecurity to the next level. Block phishing, malware, trackers, and ads. Lightweight app that works with all browsers.
- The mindful appliance builder☆16Nov 24, 2025Updated 4 months ago
- Play with CRIU in vagrant, all automated.☆10Oct 11, 2015Updated 10 years ago
- ☆49Sep 5, 2020Updated 5 years ago
- ☆14Jan 14, 2022Updated 4 years ago
- Code/instructions for various slides/demos I've given☆12Oct 27, 2017Updated 8 years ago
- A graphics tracing and replay framework to explore system-level effects on heterogeneous CPU+GPU memory systems.☆15Apr 16, 2018Updated 7 years ago
- Hypervisor from scratch in linux☆13May 8, 2022Updated 3 years ago
- DOTS - Directed acyclic graph based Online Trajectory Simplification algorithm☆10Aug 7, 2018Updated 7 years ago
- A binary instrumentation tool to analyze load instructions in any off-the-shelf x86(-64) program. Described by Bera et al. in https://arx…☆24Jun 30, 2024Updated last year
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Memory System Microbenchmarks☆64Feb 9, 2023Updated 3 years ago
- Hopscotch: A benchmark suite for memory performance evaluation☆16Apr 8, 2025Updated 11 months ago
- A transfer learning-based random forest regression model☆15Aug 25, 2018Updated 7 years ago
- HoCL (Higher Order dataflow Coordination Language) is a language for describing dataflow networks and generating tool-specific descriptio…☆22Aug 13, 2021Updated 4 years ago
- Hardware performance counter tool for Windows OS☆18Sep 4, 2018Updated 7 years ago
- ☆13Oct 25, 2024Updated last year
- NAS Parallel Benchmark Kernels in C/C++. The parallel versions are in FastFlow, TBB, and OpenMP.☆22Jul 29, 2021Updated 4 years ago