CharithYMendis / HeliumLinks
Helium: Lifting High-Performance Stencil Kernels from Stripped x86 Binaries to Halide DSL Code
☆46Updated 9 years ago
Alternatives and similar repositories for Helium
Users that are interested in Helium are comparing it to the libraries listed below
Sorting:
- Generic system-wide modern C++ for heterogeneous platforms with SYCL from Khronos Group☆77Updated 4 years ago
- ☆75Updated last year
- a heterogeneous multiGPU level-3 BLAS library☆45Updated 5 years ago
- CL Offline Compiler : Compile OpenCL kernels to HSAIL☆50Updated 8 years ago
- an assembler/compiler for AMD’s GCN (Generation Core Next Architecture) Assembly Language☆41Updated 2 years ago
- A domain-specific language and compiler for image processing☆76Updated 4 years ago
- Intel Heterogeneous Research Compiler (iHRC)☆25Updated 2 years ago
- A fast and highly scalable GPU dynamic memory allocator☆104Updated 10 years ago
- Scientific library for high-precision computations and research☆49Updated 7 years ago
- A portable high-level API with CUDA or OpenCL back-end☆54Updated 7 years ago
- A CUDA implementation of the PageRank Pipeline Benchmark☆32Updated 8 years ago
- GPUVerify: a Verifier for GPU Kernels☆62Updated 2 years ago
- Mirror kept for legacy. Moved to https://github.com/llvm/llvm-project☆25Updated 5 years ago
- Tools for parsing, assembling, and disassembling HSAIL.☆73Updated 5 years ago
- Enable Polyhedral JIT compilation☆9Updated 6 years ago
- A Lightweight Graph Processing Framework for Multi-GPUs☆14Updated 10 years ago
- MIOpenGEMM is now deprecated☆62Updated last year
- [deprecated] Reference Implementation of OpenSHMEM on GASNet (specification <= 1.3)☆43Updated 7 years ago
- Fork of magma to include more BLAS☆28Updated 8 years ago
- Computing Language Utility☆72Updated 8 years ago
- A kernel module to support SSD-to-GPU direct DMA☆126Updated 8 years ago
- Checks to verify the usage of the MPI API in C and C++ code, based on Clang’s Static Analyzer and Clang-Tidy.☆38Updated 10 months ago
- A framework that helps implementing swizzle GPU kernels☆42Updated 5 years ago
- Compute applications.☆24Updated 5 years ago
- Python bindings for libNVVM☆37Updated 11 years ago
- Base code and optimized code for the benchmarks used in the PolyMage paper published at ASPLOS 2015☆19Updated 9 years ago
- ☆88Updated 5 years ago
- Intel(R) Concurrent Collections for C++☆115Updated 2 years ago
- CUDA and OpenMP implementations of C2R/R2C inplace transposition☆46Updated 10 years ago
- Flexible GPGPU instrumentation☆87Updated 5 years ago