CharithYMendis / Helium
Helium: Lifting High-Performance Stencil Kernels from Stripped x86 Binaries to Halide DSL Code
☆46Updated 9 years ago
Alternatives and similar repositories for Helium
Users that are interested in Helium are comparing it to the libraries listed below
Sorting:
- ☆75Updated last year
- Intel Heterogeneous Research Compiler (iHRC)☆25Updated 2 years ago
- Enable Polyhedral JIT compilation☆9Updated 6 years ago
- Generic system-wide modern C++ for heterogeneous platforms with SYCL from Khronos Group☆77Updated 4 years ago
- CL Offline Compiler : Compile OpenCL kernels to HSAIL☆50Updated 8 years ago
- Tools for parsing, assembling, and disassembling HSAIL.☆71Updated 5 years ago
- A domain-specific language and compiler for image processing☆76Updated 4 years ago
- Python bindings for libNVVM☆37Updated 11 years ago
- A Lightweight Graph Processing Framework for Multi-GPUs☆14Updated 10 years ago
- ☆32Updated 7 years ago
- [deprecated] Reference Implementation of OpenSHMEM on GASNet (specification <= 1.3)☆43Updated 7 years ago
- a heterogeneous multiGPU level-3 BLAS library☆45Updated 5 years ago
- library which simplifies host-GPU data transfer using userspace pagefault handling☆15Updated 12 years ago
- Enabling on-the-fly manipulations with LLVM IR code of CUDA sources☆111Updated 3 weeks ago
- an assembler/compiler for AMD’s GCN (Generation Core Next Architecture) Assembly Language☆41Updated 2 years ago
- Mirror kept for legacy. Moved to https://github.com/llvm/llvm-project☆25Updated 5 years ago
- A portable high-level API with CUDA or OpenCL back-end☆54Updated 7 years ago
- CNNs in Halide☆23Updated 9 years ago
- Scientific library for high-precision computations and research☆49Updated 7 years ago
- Extended Roofline Model - LLVM source tree with additional libraries for the analysis of the dynamic execution in the interpreter☆17Updated 7 years ago
- Some miscellaneous OpenSHMEM examples☆21Updated 2 years ago
- Benchmarking matrix multiplication implementations☆98Updated 8 years ago
- A fast and highly scalable GPU dynamic memory allocator☆104Updated 10 years ago
- GPUVerify: a Verifier for GPU Kernels☆62Updated 2 years ago
- npcomp - An aspirational MLIR based numpy compiler☆51Updated 4 years ago
- High Efficiency Convolution Kernel for Maxwell GPU Architecture☆134Updated 8 years ago
- Greentea LibDNN - a universal convolution implementation supporting CUDA and OpenCL☆135Updated 8 years ago
- Data Parallel Python☆207Updated 12 years ago
- Mirror kept for legacy. Moved to https://github.com/llvm/llvm-project☆87Updated 5 years ago
- A compiler intermediate representation for image recognition and heterogeneous computing.☆78Updated 8 years ago