cyanguwa / nersc-rooflineView external linksLinks
☆49Sep 5, 2020Updated 5 years ago
Alternatives and similar repositories for nersc-roofline
Users that are interested in nersc-roofline are comparing it to the libraries listed below
Sorting:
- A simple script to plot the Roofline model for given HW platforms and applications☆10Aug 22, 2024Updated last year
- ☆12Aug 4, 2025Updated 6 months ago
- EPOCH Input System Version 2☆10Jun 5, 2020Updated 5 years ago
- OpenMP offload playground☆10Nov 16, 2024Updated last year
- Forked from https://bitbucket.org/berkeleylab/cs-roofline-toolkit/src/master/☆25Jun 14, 2019Updated 6 years ago
- OpenMP Course at AUTh examples☆14Dec 29, 2025Updated last month
- Reference implementation for the climate segmentation benchmark, based on the Exascale Deep Learning for Climate Analytics work☆10May 6, 2020Updated 5 years ago
- ExaWorks SDK☆11Feb 1, 2024Updated 2 years ago
- ☆11Apr 3, 2023Updated 2 years ago
- Evaluating different memory managers for dynamic GPU memory☆26Dec 16, 2020Updated 5 years ago
- Simple, lightweight transformers in Fortran☆17Nov 17, 2023Updated 2 years ago
- Arrow Matrix Decomposition - Communication-Efficient Distributed Sparse Matrix Multiplication☆15Mar 25, 2024Updated last year
- ☆11Jun 29, 2021Updated 4 years ago
- Absinthe is an optimization framework to fuse and tile stencil codes in one shot☆14Jul 17, 2019Updated 6 years ago
- Instanciate the Cache Aware Roofline Model on single socket and multisocket systems.☆27Feb 22, 2019Updated 6 years ago
- The ultimate bandwidth benchmark☆60Dec 16, 2025Updated 2 months ago
- General Purpose Graphics Processing Unit (GPGPU) IP Core☆11Jul 4, 2014Updated 11 years ago
- ☆15Mar 14, 2022Updated 3 years ago
- An Attention Superoptimizer☆22Jan 20, 2025Updated last year
- ☆17Apr 8, 2021Updated 4 years ago
- Automatically exported from code.google.com/p/patus☆16Sep 3, 2015Updated 10 years ago
- OpenMM Metal Plugin☆18Aug 20, 2024Updated last year
- ☆15Dec 16, 2021Updated 4 years ago
- SmartNIC☆14Dec 13, 2018Updated 7 years ago
- hosted by HPC System Test Working Group collaboration☆17Jan 13, 2026Updated last month
- ☆42Jun 3, 2024Updated last year
- OpenCL memory benchmark☆15Dec 21, 2016Updated 9 years ago
- ☆14Sep 27, 2021Updated 4 years ago
- CAKE Library for constant-bandwidth matrix multiplication on CPUs☆14Apr 6, 2024Updated last year
- gcc plugin to discover optimization passes used during compilation☆20Feb 10, 2021Updated 5 years ago
- Modular C++ Toolkit for Performance Analysis and Logging. Profiling API and Tools for C, C++, CUDA, Fortran, and Python. The C++ template…☆366Jul 31, 2024Updated last year
- ☆80Jan 6, 2026Updated last month
- Scripts for building libraries with Cray's PE☆21Aug 31, 2021Updated 4 years ago
- TiledKernel is a code generation library based on macro kernels and memory hierarchy graph data structure.☆19May 12, 2024Updated last year
- A highly efficient library for GEMM operations on Sunway TaihuLight☆18Sep 7, 2020Updated 5 years ago
- Contains reference architecture scripts for running the OpenPiton regression using auto-scaling SLURM cluster.☆24Dec 1, 2025Updated 2 months ago
- This is the public repo for the MLPerf DeepCAM climate data segmentation proposal.☆16Sep 30, 2025Updated 4 months ago
- C++ Library for Object-oriented Programming with Structure of Arrays Layout☆21May 5, 2018Updated 7 years ago
- Multiplication using AVX512 and AVX512IFMA instructions☆23Nov 9, 2015Updated 10 years ago