canonizer / libgpuvm
library which simplifies host-GPU data transfer using userspace pagefault handling
☆15Updated 12 years ago
Related projects ⓘ
Alternatives and complementary repositories for libgpuvm
- Intel Heterogeneous Research Compiler (iHRC)☆25Updated last year
- Library to program with streams, events, and to queue own functions into a stream.☆16Updated 4 months ago
- ☆75Updated last year
- OpenCL tool to detect buffer overflows in GPU kernels☆20Updated 5 years ago
- Tools for parsing, assembling, and disassembling HSAIL.☆70Updated 4 years ago
- code for examining determinism of performance counters☆21Updated 3 years ago
- A fast and highly scalable GPU dynamic memory allocator☆103Updated 9 years ago
- Vectorized intersections (research code)☆14Updated 7 years ago
- Extended Roofline Model - LLVM source tree with additional libraries for the analysis of the dynamic execution in the interpreter☆17Updated 7 years ago
- Enable Polyhedral JIT compilation☆9Updated 6 years ago
- Automatically exported from code.google.com/p/freeocl☆31Updated 6 years ago
- mallocMC: Memory Allocator for Many Core Architectures☆50Updated this week
- Mallacc: Accelerating Memory Allocation☆13Updated 6 years ago
- IMPORTANT NOTICE: This implementation is long outdated. The new libwfv will be released soon. Whole-Function Vectorization is an algorith…☆22Updated 12 years ago
- Compute applications.☆25Updated 4 years ago
- ☆68Updated 4 years ago
- DSL for stencils and image processing☆13Updated 8 years ago
- Mirror kept for legacy. Moved to https://github.com/llvm/llvm-project☆25Updated 5 years ago
- Allocation benchmarks☆30Updated 8 years ago
- Enabling on-the-fly manipulations with LLVM IR code of CUDA sources☆98Updated last year
- A domain-specific language and compiler for image processing☆76Updated 3 years ago
- Library with JIT (Just-in-time) compilation support to optimize performance of small and medium matrix multiplication☆12Updated 3 years ago
- nvptx-tools: a collection of tools for use with nvptx-none GCC toolchains.☆46Updated 2 months ago
- Sample programs for the LLVM PTX back-end☆34Updated 9 years ago
- A Sound and Complete Verification Tool for Warp-Specialized GPU Kernels☆18Updated 9 years ago
- Heterogeneous Active Messages C++ library☆21Updated 5 years ago
- NUMAPROF is a NUMA memory profliler based on Pintool to track your remote memory accesses.☆45Updated 4 months ago
- LonestarGPU: Irregular algorithms parallelized for GPUs☆33Updated 4 years ago
- ROCm - AMDGPU Compute Application Binary Interface☆40Updated 2 years ago
- This repository contains my experiments with compression-related algorithms☆35Updated 8 years ago