lightsighter / WeftLinks
A Sound and Complete Verification Tool for Warp-Specialized GPU Kernels
☆19Updated 10 years ago
Alternatives and similar repositories for Weft
Users that are interested in Weft are comparing it to the libraries listed below
Sorting:
- Loop Kernel Analysis and Performance Modeling Toolkit☆95Updated 6 months ago
- Chai☆45Updated last year
- tools to create performance and roofline plots from measured data☆59Updated 11 years ago
- CUDA Flux is a profiler for GPU applications which reports the basic block executions frequencies of compute kernels☆32Updated 4 years ago
- LonestarGPU: Irregular algorithms parallelized for GPUs☆38Updated 5 years ago
- Flexible GPGPU instrumentation☆88Updated 6 years ago
- A task benchmark☆44Updated last year
- The SHOC Benchmark Suite☆257Updated last week
- XSBench: The Monte Carlo Macroscopic Cross Section Lookup Benchmark☆83Updated last year
- A tool for debugging and assessing floating point precision and reproducibility.☆88Updated 3 months ago
- ☆62Updated last year
- JUPITER Benchmark Suite☆20Updated 2 months ago
- Benchmark for measuring the performance of sparse and irregular memory access.☆79Updated last month
- Parallel Tensor Infrastructure (ParTI!)☆30Updated 5 years ago
- Kernel Tuning Toolkit☆65Updated last week
- Online CUDA Occupancy Calculator☆80Updated 4 years ago
- ☆93Updated 8 years ago
- A unified framework across multiple programming platforms☆41Updated 4 months ago
- A GPU benchmark suite for assessing on-chip GPU memory bandwidth☆107Updated 8 years ago
- TLB Benchmarks☆34Updated 8 years ago
- This tool serves as a test harness for different optimization techniques to improve stencil computations performance in shared and distri…☆20Updated 2 years ago
- 🎃 GPU load-balancing library for regular and irregular computations.☆62Updated last month
- A GPU algorithm for sparse matrix-matrix multiplication☆72Updated 5 years ago
- CSR-based SpGEMM on nVidia and AMD GPUs☆46Updated 9 years ago
- GPU Code optimizer for stencil computations. Refer to our IPDPS'19 paper for more details☆23Updated 6 years ago
- A Benchmark Suite for Heterogeneous System Computation☆54Updated 7 months ago
- The OpenDwarfs project provides a benchmark suite consisting of different computation/communication idioms, i.e., dwarfs, for state-of-ar…☆98Updated 6 years ago
- BLAS implementation for Intel FPGA☆77Updated 4 years ago
- A light-weight MPI profiler.☆98Updated 2 weeks ago
- development repository for the open earth compiler☆80Updated 4 years ago