CharithYMendis / Helium
Helium: Lifting High-Performance Stencil Kernels from Stripped x86 Binaries to Halide DSL Code
☆45Updated 9 years ago
Alternatives and similar repositories for Helium:
Users that are interested in Helium are comparing it to the libraries listed below
- Generic system-wide modern C++ for heterogeneous platforms with SYCL from Khronos Group☆76Updated 4 years ago
- a heterogeneous multiGPU level-3 BLAS library☆45Updated 5 years ago
- Intel Heterogeneous Research Compiler (iHRC)☆25Updated 2 years ago
- GPU Automatically Tuned Linear Algebra Software☆28Updated 9 years ago
- A domain-specific language and compiler for image processing☆76Updated 4 years ago
- Scientific library for high-precision computations and research☆49Updated 7 years ago
- Fork of magma to include more BLAS☆28Updated 8 years ago
- ☆75Updated last year
- A C++ expression -> x86 JIT☆18Updated 8 years ago
- The cilkplus/llvm repo implements the Intel Cilk Plus language extensions to C and C++ in LLVM.☆68Updated 9 years ago
- A lightweight C++ framework for vectorizing image-processing code☆75Updated 8 years ago
- [deprecated] Reference Implementation of OpenSHMEM on GASNet (specification <= 1.3)☆43Updated 7 years ago
- Behrooz File System (BFS)☆54Updated 9 years ago
- Benchmarking matrix multiplication implementations☆98Updated 8 years ago
- GPU Optimization and Memory Abstraction Framework☆32Updated 5 years ago
- CL Offline Compiler : Compile OpenCL kernels to HSAIL☆50Updated 7 years ago
- A fast and highly scalable GPU dynamic memory allocator☆104Updated 10 years ago
- Enable Polyhedral JIT compilation☆9Updated 6 years ago
- A portable high-level API with CUDA or OpenCL back-end☆54Updated 7 years ago
- Python bindings for libNVVM☆37Updated 10 years ago
- Facebook's CUDA extensions.☆283Updated 6 years ago
- GCN ISA assembler tool for my GSoC project at Openwall☆35Updated 9 years ago
- Mirror kept for legacy. Moved to https://github.com/llvm/llvm-project☆25Updated 5 years ago
- Intel(R) Concurrent Collections for C++☆115Updated 2 years ago
- ☆32Updated 7 years ago
- finding set bits in large bitmaps☆15Updated 9 years ago
- A Lightweight Graph Processing Framework for Multi-GPUs☆14Updated 9 years ago
- A compiler intermediate representation for image recognition and heterogeneous computing.☆78Updated 8 years ago
- Library wrapper and system-level tracing utilities☆46Updated 8 years ago
- CUDA and OpenMP implementations of C2R/R2C inplace transposition☆46Updated 10 years ago