lightsighter / WeftLinks
A Sound and Complete Verification Tool for Warp-Specialized GPU Kernels
☆19Updated 10 years ago
Alternatives and similar repositories for Weft
Users that are interested in Weft are comparing it to the libraries listed below
Sorting:
- This tool serves as a test harness for different optimization techniques to improve stencil computations performance in shared and distri…☆21Updated 2 years ago
- Loop Kernel Analysis and Performance Modeling Toolkit☆95Updated 6 months ago
- tools to create performance and roofline plots from measured data☆59Updated 11 years ago
- GPU Code optimizer for stencil computations. Refer to our IPDPS'19 paper for more details☆23Updated 5 years ago
- A tool for debugging and assessing floating point precision and reproducibility.☆85Updated 2 months ago
- The SparseX sparse kernel optimization library☆41Updated 6 years ago
- A unified framework across multiple programming platforms☆41Updated 3 months ago
- sparse matrix pre-processing library☆83Updated last year
- MPI benchmark to test and measure collective performance☆52Updated 4 years ago
- JUPITER Benchmark Suite☆20Updated 2 months ago
- Benchmark for measuring the performance of sparse and irregular memory access.☆79Updated last month
- A light-weight MPI profiler.☆97Updated last year
- XSBench: The Monte Carlo Macroscopic Cross Section Lookup Benchmark☆83Updated last year
- A BUDE virtual-screening benchmark, in many programming models☆29Updated 11 months ago
- Instanciate the Cache Aware Roofline Model on single socket and multisocket systems.☆27Updated 6 years ago
- A dynamic analysis tool to detect floating-point errors in HPC applications.☆36Updated 3 weeks ago
- The SHOC Benchmark Suite☆257Updated 3 years ago
- Julia ports of the Rodinia benchmark suite for heterogeneous computing infrastructures☆54Updated 2 years ago
- The Combinatorial BLAS (CombBLAS) is an extensible distributed-memory parallel graph library offering a small but powerful set of linear …☆79Updated last month
- Multiple 1-stencil implementations using nvidia cuda.☆13Updated 7 years ago
- A task benchmark☆43Updated last year
- Integrated Performance Monitoring for High Performance Computing☆90Updated 3 years ago
- High-performance, GPU-aware communication library☆86Updated 8 months ago
- Kernel Tuning Toolkit☆64Updated 2 months ago
- Chai☆45Updated last year
- Livermore Unstructured Lagrangian Explicit Shock Hydrodynamics (LULESH)☆110Updated 2 years ago
- Training examples for SYCL☆49Updated last month
- RAJA Performance Suite☆123Updated this week
- The OpenDwarfs project provides a benchmark suite consisting of different computation/communication idioms, i.e., dwarfs, for state-of-ar…☆98Updated 6 years ago
- Online CUDA Occupancy Calculator☆80Updated 3 years ago