prg-titech / dynasoar
CUDA Dynamic Memory Allocator for SOA Data Layout
☆35Updated 3 years ago
Alternatives and similar repositories for dynasoar:
Users that are interested in dynasoar are comparing it to the libraries listed below
- LonestarGPU: Irregular algorithms parallelized for GPUs☆33Updated 5 years ago
- ☆48Updated 5 years ago
- mallocMC: Memory Allocator for Many Core Architectures☆53Updated last week
- Benchmark for measuring the performance of sparse and irregular memory access.☆76Updated last week
- Official BOLT Repository☆28Updated 5 months ago
- CUDAAdvisor: a GPU profiling tool☆48Updated 6 years ago
- Data Dependence Analyzer in the Polyhedral Model☆19Updated last year
- CUDA Flux is a profiler for GPU applications which reports the basic block executions frequencies of compute kernels☆31Updated 3 years ago
- Evaluating different memory managers for dynamic GPU memory☆24Updated 4 years ago
- 🎃 GPU load-balancing library for regular and irregular computations.☆59Updated 7 months ago
- This tool serves as a test harness for different optimization techniques to improve stencil computations performance in shared and distri…☆20Updated 2 years ago
- ☆17Updated last year
- A library to benchmark CUDA code, similar to google benchmark.☆28Updated 3 years ago
- development repository for the open earth compiler☆79Updated 3 years ago
- Stencil Probe - a stencil microbenchmark☆30Updated 12 years ago
- HeteroSync is a benchmark suite for performing fine-grained synchronization on tightly coupled GPUs☆27Updated 4 months ago
- A simple profiler to count Nvidia PTX assembly instructions of OpenCL/SYCL/CUDA kernels for roofline model analysis.☆49Updated last year
- OpenMP Offloading Validation & Verification Suite; Official repository. We have migrated from bitbucket!! For documentation, results, pub…☆56Updated last week
- A unified framework across multiple programming platforms☆35Updated 7 months ago
- Polyhedral Parallel Code Generation (source repository: http://repo.or.cz/ppcg.git)☆121Updated 2 years ago
- A dynamic analysis tool to detect floating-point errors in HPC applications.☆33Updated 2 years ago
- Absinthe is an optimization framework to fuse and tile stencil codes in one shot☆14Updated 5 years ago
- rocSHMEM intra-kernel networking runtime for AMD dGPUs on the ROCm platform.☆48Updated this week
- Polyhedral Extraction Tool (source repository: http://repo.or.cz/w/pet.git)☆39Updated 2 years ago
- Library to plot integer sets and maps☆49Updated 8 years ago
- Loop Kernel Analysis and Performance Modeling Toolkit☆91Updated 4 months ago
- Generate simple index ranges in C++ and CUDA C++☆39Updated last year
- Prototype of OpenSHMEM for NVIDIA GPUs, developed as part of DoE Design Forward☆21Updated 6 years ago
- GPU Code optimizer for stencil computations. Refer to our IPDPS'19 paper for more details☆23Updated 5 years ago
- Data-Centric MLIR dialect☆40Updated last year