ebugger / Empirical-Roofline-Toolkit
Forked from https://bitbucket.org/berkeleylab/cs-roofline-toolkit/src/master/
☆17Updated 5 years ago
Related projects ⓘ
Alternatives and complementary repositories for Empirical-Roofline-Toolkit
- ☆41Updated 4 years ago
- A repository where GPU applications are aggregated using a common build flow that supports multiple CUDA versions.☆45Updated last month
- ☆20Updated 2 years ago
- Tartan: Evaluating Modern GPU Interconnect via a Multi-GPU Benchmark Suite☆60Updated 6 years ago
- Benchmark for measuring the performance of sparse and irregular memory access.☆75Updated this week
- Measure instruction latency and throughput☆22Updated 2 years ago
- tools to create performance and roofline plots from measured data☆58Updated 10 years ago
- A Micro-benchmarking Tool for HPC Networks☆22Updated 3 weeks ago
- Chai☆42Updated 11 months ago
- Instanciate the Cache Aware Roofline Model on single socket and multisocket systems.☆27Updated 5 years ago
- SST Structural Simulation Toolkit Parallel Discrete Event Core and Services☆132Updated last week
- GPUDirect example☆57Updated 3 years ago
- Provides a set of benchmarks that can be used to measure the memory bandwidth performance of CPU's☆81Updated 7 months ago
- Implementation of TSM2L and TSM2R -- High-Performance Tall-and-Skinny Matrix-Matrix Multiplication Algorithms for CUDA☆31Updated 4 years ago
- ☆66Updated 4 years ago
- ☆33Updated 2 years ago
- Performance Prediction Toolkit☆51Updated 3 years ago
- C++/MPI proxies for distributed training of deep neural networks.☆12Updated 2 years ago
- A benchmarking suite for heterogeneous systems. The primary goal of this project is to improve and update aspects of existing benchmarkin…☆40Updated 8 months ago
- This serves as a repository for reproducibility of the SC21 paper "In-Depth Analyses of Unified Virtual Memory System for GPU Accelerated…☆27Updated last year
- CUDA Flux is a profiler for GPU applications which reports the basic block executions frequencies of compute kernels☆31Updated 3 years ago
- ☆37Updated 3 years ago
- CSR5-based SpMV on CPUs, GPUs and Xeon Phi☆95Updated 5 months ago
- SST Architectural Simulation Components and Libraries☆92Updated last week
- High Performance Linpack for Next-Generation AMD HPC Accelerators☆43Updated last month
- Advanced Profiling and Analytics for AMD Hardware☆137Updated this week
- Dissecting NVIDIA GPU Architecture☆82Updated 2 years ago
- ☆58Updated last month
- NCCL Examples from Official NVIDIA NCCL Developer Guide.☆13Updated 6 years ago
- A LogGOPS (LogP, LogGP, LogGPS) Simulator and Simulation Framework☆11Updated 3 months ago