shoaibkamil / stencilprobe
Stencil Probe - a stencil microbenchmark
☆30Updated 12 years ago
Alternatives and similar repositories for stencilprobe:
Users that are interested in stencilprobe are comparing it to the libraries listed below
- CUDAAdvisor: a GPU profiling tool☆48Updated 6 years ago
- HeteroSync is a benchmark suite for performing fine-grained synchronization on tightly coupled GPUs☆28Updated 6 months ago
- Code for paper "Engineering a High-Performance GPU B-Tree" accepted to PPoPP 2019☆55Updated 2 years ago
- development repository for the open earth compiler☆79Updated 4 years ago
- ☆53Updated 5 years ago
- A GPU FP32 computation method with Tensor Cores.☆20Updated 2 years ago
- GPU Code optimizer for stencil computations. Refer to our IPDPS'19 paper for more details☆24Updated 5 years ago
- Artifact Evaluation Reproduction for "Software Prefetching for Indirect Memory Accesses", CGO 2017, using CK.☆38Updated 3 years ago
- LonestarGPU: Irregular algorithms parallelized for GPUs☆34Updated 5 years ago
- This tool serves as a test harness for different optimization techniques to improve stencil computations performance in shared and distri…☆20Updated 2 years ago
- A low-overhead tool to periodically collect system-wide hardware performance counters on Intel64 systems.☆33Updated 2 years ago
- XSBench: The Monte Carlo Macroscopic Cross Section Lookup Benchmark☆78Updated last year
- Evaluating different memory managers for dynamic GPU memory☆25Updated 4 years ago
- TLB Benchmarks☆33Updated 7 years ago
- The Splash-3 benchmark suite☆43Updated last year
- Instanciate the Cache Aware Roofline Model on single socket and multisocket systems.☆27Updated 6 years ago
- ☆34Updated 3 years ago
- Ocolos is the first online code layout optimization system for unmodified applications written in unmanaged languages.☆52Updated last year
- Multi-GPU dynamic scheduler using PGAS style cross-GPU communication☆28Updated last year
- An attempt at achieving the theoretical best memory bandwidth of my machine.☆53Updated 11 years ago
- Polyhedral Parallel Code Generation (source repository: http://repo.or.cz/ppcg.git)☆124Updated 2 years ago
- Pointer-chasing memory benchmark (forked from Doug Pase's code).☆59Updated 11 years ago
- Prototype of OpenSHMEM for NVIDIA GPUs, developed as part of DoE Design Forward☆23Updated 6 years ago
- Official BOLT Repository☆28Updated 7 months ago
- GPU Performance Advisor☆64Updated 2 years ago
- Enabling on-the-fly manipulations with LLVM IR code of CUDA sources☆110Updated 2 years ago
- Tartan: Evaluating Modern GPU Interconnect via a Multi-GPU Benchmark Suite☆64Updated 6 years ago
- Chai☆43Updated last year
- A Synchronization-Free Algorithm for Parallel Sparse Triangular Solves (SpTRSV)☆21Updated 5 years ago
- A task benchmark☆41Updated 7 months ago