abduld / libwbLinks
☆87Updated 6 years ago
Alternatives and similar repositories for libwb
Users that are interested in libwb are comparing it to the libraries listed below
Sorting:
- Facebook's CUDA extensions.☆284Updated 6 years ago
- ☆74Updated 2 years ago
- a heterogeneous multiGPU level-3 BLAS library☆46Updated 6 years ago
- Intel(R) Concurrent Collections for C++☆115Updated 3 years ago
- Generic system-wide modern C++ for heterogeneous platforms with SYCL from Khronos Group☆78Updated 5 years ago
- CUDA Data Parallel Primitives Library☆438Updated 7 years ago
- A fast and highly scalable GPU dynamic memory allocator☆112Updated 10 years ago
- Full-speed Array of Structures access☆176Updated 2 years ago
- GraphMat graph analytics framework☆105Updated 3 years ago
- The StreamIt compiler infrastructure.☆71Updated 9 years ago
- A LaTeX paper skeleton for CS systems conference formats☆54Updated 6 years ago
- Grappa: scaling irregular applications on commodity clusters☆159Updated 8 years ago
- A simple memory manager for CUDA designed to help Deep Learning frameworks manage memory☆300Updated 7 years ago
- Documentation for StreamExecutor open source proposal☆83Updated 9 years ago
- Benchmark for Co-running Single Applications on Integrated Architectures☆12Updated 9 years ago
- Parallel Algorithm Scheduling Library☆105Updated 8 years ago
- GPUfs - File system support for NVIDIA GPUs☆99Updated 7 years ago
- a software library containing Sparse functions written in OpenCL☆176Updated 5 years ago
- The OpenDwarfs project provides a benchmark suite consisting of different computation/communication idioms, i.e., dwarfs, for state-of-ar…☆100Updated 6 years ago
- Caffe deep learning framework - optimized for Xeon Phi☆14Updated 10 years ago
- Enabling on-the-fly manipulations with LLVM IR code of CUDA sources☆125Updated 9 months ago
- Intel(R) Machine Learning Scaling Library is a library providing an efficient implementation of communication patterns used in deep learn…☆108Updated 3 years ago
- An extensible framework for program autotuning☆427Updated last week
- Mimir is a new implementation of MapReduce over MPI. Mimir inherits the core principles of existing MapReduce frameworks, such as MR-MPI,…☆21Updated 7 years ago
- Tools and extensions for CUDA profiling☆67Updated 6 years ago
- Tapir extension to LLVM for optimizing Parallel Programs☆132Updated 5 years ago
- [deprecated] Reference Implementation of OpenSHMEM on GASNet (specification <= 1.3)☆43Updated 8 years ago
- OFI Programmer's Guide☆52Updated 3 years ago
- A Lightweight Graph Processing Framework for Multi-GPUs☆14Updated 10 years ago
- C++ implementation of concurrent Binary Search Trees☆72Updated 10 years ago