tue-es / bones
Research compiler based on algorithmic skeletons
☆21Updated 10 years ago
Related projects ⓘ
Alternatives and complementary repositories for bones
- A domain-specific language and compiler for image processing☆76Updated 3 years ago
- LonestarGPU: Irregular algorithms parallelized for GPUs☆33Updated 4 years ago
- ☆28Updated this week
- mallocMC: Memory Allocator for Many Core Architectures☆50Updated this week
- A C/C++ task-based programming model for shared memory and distributed parallel computing.☆71Updated 4 years ago
- A GPU cache model for research purposes☆26Updated 11 years ago
- This package includes the implementation for Sparse-Matrix-Vector-Multiplication (SpMV) and Sparse-Matrix-Matrix-Multiplication (SpMM) fo…☆10Updated 4 years ago
- Stencil Probe - a stencil microbenchmark☆29Updated 11 years ago
- Heterogeneous Active Messages C++ library☆21Updated 5 years ago
- GPU Optimization and Memory Abstraction Framework☆32Updated 5 years ago
- compiler for fortran stencils using verified lifting,☆17Updated 2 years ago
- ☆68Updated 4 years ago
- Compute applications.☆25Updated 4 years ago
- Evaluating different memory managers for dynamic GPU memory☆24Updated 3 years ago
- A graphics tracing and replay framework to explore system-level effects on heterogeneous CPU+GPU memory systems.☆14Updated 6 years ago
- A Sound and Complete Verification Tool for Warp-Specialized GPU Kernels☆18Updated 9 years ago
- A fast and highly scalable GPU dynamic memory allocator☆103Updated 9 years ago
- Whippletree, a novel approach to scheduling dynamic, irregular workloads on the GPU☆21Updated 8 years ago
- sparse matrix pre-processing library☆81Updated 6 months ago
- This repository contains components that will support percolation via OpenCL and CUDA☆32Updated 2 years ago
- A thin wrapper around miOpen and cuDNN☆38Updated last year
- Enabling on-the-fly manipulations with LLVM IR code of CUDA sources☆98Updated last year
- A task benchmark☆39Updated 3 months ago
- Scalable GPU Kernel Fission/Fusion Transformation for Memory-Bound Kernels☆13Updated 9 years ago
- Codeplay project for contributions to the LLVM SYCL implementation☆30Updated 3 years ago
- StarPU Runtime system☆16Updated 14 years ago
- library which simplifies host-GPU data transfer using userspace pagefault handling☆15Updated 12 years ago
- A Benchmark Suite for Heterogeneous System Computation☆52Updated last week
- Boost.org graph_parallel module☆27Updated 2 months ago
- Lock-free parallel disjoint set data structure (aka UNION-FIND) with path compression and union by rank☆61Updated 9 years ago