abduld / libwb
☆88Updated 5 years ago
Related projects ⓘ
Alternatives and complementary repositories for libwb
- ☆75Updated last year
- A fast and highly scalable GPU dynamic memory allocator☆103Updated 9 years ago
- Intel Heterogeneous Research Compiler (iHRC)☆25Updated last year
- Fork of magma to include more BLAS☆28Updated 7 years ago
- Resources to work offline on the assignments of Heterogenous Parallel Programming course from Coursera.☆71Updated 5 years ago
- TTC: A high-performance Compiler for Tensor Transpositions☆20Updated 7 years ago
- Full-speed Array of Structures access☆160Updated last year
- a heterogeneous multiGPU level-3 BLAS library☆45Updated 4 years ago
- A domain-specific language and compiler for image processing☆76Updated 3 years ago
- GPU Optimization and Memory Abstraction Framework☆32Updated 5 years ago
- an approximate compiler☆37Updated 4 years ago
- Generic system-wide modern C++ for heterogeneous platforms with SYCL from Khronos Group☆76Updated 3 years ago
- Enabling on-the-fly manipulations with LLVM IR code of CUDA sources☆98Updated last year
- Kernel Fusion and Runtime Compilation Based on NNVM☆69Updated 7 years ago
- Documentation for StreamExecutor open source proposal☆83Updated 8 years ago
- GPU-specialized parameter server for GPU machine learning.☆100Updated 6 years ago
- Tools for parsing, assembling, and disassembling HSAIL.☆70Updated 4 years ago
- The StreamIt compiler infrastructure.☆70Updated 8 years ago
- Flexible GPGPU instrumentation☆86Updated 5 years ago
- library which simplifies host-GPU data transfer using userspace pagefault handling☆15Updated 12 years ago
- ☆32Updated 7 years ago
- A NUMA-aware Graph-structured Analytics Framework☆42Updated 6 years ago
- Benchmarking matrix multiplication implementations☆98Updated 8 years ago
- Base code and optimized code for the benchmarks used in the PolyMage paper published at ASPLOS 2015☆18Updated 8 years ago
- Intel(R) Machine Learning Scaling Library is a library providing an efficient implementation of communication patterns used in deep learn…☆109Updated last year
- CL Offline Compiler : Compile OpenCL kernels to HSAIL☆49Updated 7 years ago
- sparse matrix pre-processing library☆81Updated 6 months ago