maps-gpu / MAPSLinks
GPU Optimization and Memory Abstraction Framework
☆32Updated 6 years ago
Alternatives and similar repositories for MAPS
Users that are interested in MAPS are comparing it to the libraries listed below
Sorting:
- TTC: A high-performance Compiler for Tensor Transpositions☆21Updated 8 years ago
- Concurrent CPU-GPU Programming using Task Models☆106Updated 6 years ago
- Flexible GPGPU instrumentation☆89Updated 6 years ago
- Full-speed Array of Structures access☆176Updated 2 years ago
- TLB Benchmarks☆35Updated 8 years ago
- CLTune: An automatic OpenCL & CUDA kernel tuner☆184Updated 3 years ago
- A Benchmark Suite for Heterogeneous System Computation☆55Updated 11 months ago
- A GPU benchmark suite for assessing on-chip GPU memory bandwidth☆109Updated 8 years ago
- a heterogeneous multiGPU level-3 BLAS library☆46Updated 6 years ago
- LonestarGPU: Irregular algorithms parallelized for GPUs☆38Updated 6 years ago
- A library to benchmark CUDA code, similar to google benchmark.☆30Updated 4 years ago
- portDNN is a library implementing neural network algorithms written using SYCL☆113Updated last year
- GPUfs - File system support for NVIDIA GPUs☆99Updated 7 years ago
- Asynchronous Task and Memory Interface, or ATMI, is a runtime framework and programming model for heterogeneous CPU-GPU systems. It provi…☆68Updated last year
- The SHOC Benchmark Suite☆260Updated 4 months ago
- Third party assembler and GEMM library for NVIDIA Kepler GPU☆85Updated 6 years ago
- SYCL-ML is a C++ library, implementing classical machine learning algorithms using SYCL.☆66Updated 6 years ago
- Evaluating different memory managers for dynamic GPU memory☆26Updated 5 years ago
- Fast integer division with divisor not known at compile time. To be used primarily in CUDA kernels.☆73Updated 10 years ago
- ☆74Updated 2 years ago
- ☆72Updated 5 years ago
- Mimir is a new implementation of MapReduce over MPI. Mimir inherits the core principles of existing MapReduce frameworks, such as MR-MPI,…☆21Updated 7 years ago
- The Berkeley Container Library☆126Updated last month
- A CUDA-based multi-GPU vertex-centric graph processing framework based on Warp Segmentation and Vertex Refinement techniques.☆12Updated 8 years ago
- A C/C++ task-based programming model for shared memory and distributed parallel computing.☆72Updated 5 years ago
- Multi-GPU Computing Benchmark Suite (CUDA)☆43Updated 8 years ago
- GPUnet is a native GPU networking layer that provides a socket abstraction over Infiniband to GPU programs for NVIDIA GPUs.☆117Updated 10 years ago
- Archived implementation of BLAS using the SYCL open standard. See oneMath for a replacement.☆260Updated last year
- A fast and highly scalable GPU dynamic memory allocator☆112Updated 10 years ago
- A Distributed Multi-GPU System for Fast Graph Processing☆65Updated 7 years ago