CharithYMendis / HeliumLinks

Helium: Lifting High-Performance Stencil Kernels from Stripped x86 Binaries to Halide DSL Code

☆47

Alternatives and similar repositories for Helium

Users that are interested in Helium are comparing it to the libraries listed below

Sorting:

nvidia-compiler-sdk / nvvmir-samples
☆75Updated 2 years ago
nvidia-compiler-sdk / pynvvm
Python bindings for libNVVM
☆37Updated 11 years ago
kaigai / nvme-kmod
A kernel module to support SSD-to-GPU direct DMA
☆126Updated 8 years ago
Maratyszcza / FPplus
Scientific library for high-precision computations and research
☆49Updated 7 years ago
Xilinx / triSYCL
Generic system-wide modern C++ for heterogeneous platforms with SYCL from Khronos Group
☆77Updated 4 years ago
attractivechaos / matmul
Benchmarking matrix multiplication implementations
☆100Updated 8 years ago
cilkplus / llvm
The cilkplus/llvm repo implements the Intel Cilk Plus language extensions to C and C++ in LLVM.
☆69Updated 9 years ago
maxhutch / magma
Fork of magma to include more BLAS
☆28Updated 8 years ago
wsmoses / Tapir-LLVM
Tapir extension to LLVM for optimizing Parallel Programs
☆135Updated 5 years ago
linnanwang / BLASX
a heterogeneous multiGPU level-3 BLAS library
☆45Updated 5 years ago
balidani / gcnasm
GCN ISA assembler tool for my GSoC project at Openwall
☆35Updated 9 years ago
PolyJIT / polli
Enable Polyhedral JIT compilation
☆9Updated 6 years ago
AnyDSL / anydsl
Meta project to quickly build dependencies
☆102Updated 2 months ago
ChrisCummins / paper-synthesizing-benchmarks
📝 "Synthesizing Benchmarks for Predictive Modeling" (🥇 CGO'17 Best Paper)
☆22Updated 2 years ago
bthies / streamit
The StreamIt compiler infrastructure.
☆71Updated 8 years ago
scotts / streamflow
Lock-free multithreaded memory allocation
☆106Updated 8 years ago
canonizer / halloc
A fast and highly scalable GPU dynamic memory allocator
☆104Updated 10 years ago
s-kanev / XIOSim
A detailed michroarchitectural x86 simulator
☆62Updated 8 years ago
biometrics / likely
A compiler intermediate representation for image recognition and heterogeneous computing.
☆78Updated 9 years ago
bshafiee / BFS
Behrooz File System (BFS)
☆54Updated 9 years ago
SunsetQuest / Asm4GCN
an assembler/compiler for AMD’s GCN (Generation Core Next Architecture) Assembly Language
☆41Updated 2 years ago
facebookarchive / thpp
TH++, C++ interface to the torch7 TH library
☆238Updated 7 years ago
icnc / icnc
Intel(R) Concurrent Collections for C++
☆116Updated 2 years ago
hughperkins / EasyCL
Easy to run kernels using OpenCL
☆185Updated 3 months ago
CNugteren / CLCudaAPI
A portable high-level API with CUDA or OpenCL back-end
☆54Updated 7 years ago
facebookarchive / fbcuda
Facebook's CUDA extensions.
☆285Updated 6 years ago
cjang / GATLAS
GPU Automatically Tuned Linear Algebra Software
☆28Updated 9 years ago
columbia / libtrack
Library wrapper and system-level tracing utilities
☆47Updated 8 years ago
art4711 / bmap_find
finding set bits in large bitmaps
☆15Updated 9 years ago
llvm-mirror / polly
Mirror kept for legacy. Moved to https://github.com/llvm/llvm-project
☆88Updated 5 years ago