alpaka-group / mallocMCLinks
mallocMC: Memory Allocator for Many Core Architectures
☆55Updated 3 weeks ago
Alternatives and similar repositories for mallocMC
Users that are interested in mallocMC are comparing it to the libraries listed below
Sorting:
- ☆70Updated 4 years ago
- Compiler agnostic metaprogramming library providing concepts, type operations and tuples for C++ and cuda☆87Updated 2 weeks ago
- Reference implementation of the draft C++ GraphBLAS specification.☆33Updated 3 months ago
- Distributed ranges is a generalization of C++ ranges for distributed data structures.☆50Updated 3 weeks ago
- Collaborating on papers for the ISO C++ committee - public repo☆26Updated 9 months ago
- A Low-Level Abstraction of Memory Access☆86Updated last year
- SYCL Conformance Tests☆71Updated last week
- ☆17Updated 8 years ago
- SYCL-ML is a C++ library, implementing classical machine learning algorithms using SYCL.☆66Updated 5 years ago
- Copy-hiding array abstraction to automatically migrate data between memory spaces☆107Updated last week
- A fast and highly scalable GPU dynamic memory allocator☆104Updated 10 years ago
- A vectorizable multi-dimensional iterator for C++ using the Coroutines TS☆12Updated 2 years ago
- Autonomic Performance Environment for eXascale (APEX)☆48Updated 2 weeks ago
- SYCL Reference Manual☆28Updated last year
- SYCL Benchmark Suite☆64Updated 3 months ago
- A simple profiler to count Nvidia PTX assembly instructions of OpenCL/SYCL/CUDA kernels for roofline model analysis.☆52Updated 2 months ago
- ☆31Updated 2 weeks ago
- Official BOLT Repository☆28Updated 9 months ago
- Library for length agnostic SIMD intrinsic support and the corresponding math operations☆20Updated 3 years ago
- Polymorphic multidimensional array view☆36Updated 4 years ago
- Asynchronous Task and Memory Interface, or ATMI, is a runtime framework and programming model for heterogeneous CPU-GPU systems. It provi…☆68Updated last year
- CUDA Dynamic Memory Allocator for SOA Data Layout☆35Updated 3 years ago
- A library to benchmark CUDA code, similar to google benchmark.☆28Updated 4 years ago
- Fast C header-only library for popcnt, pospopcnt, and set algebraic operations☆45Updated 5 years ago
- Code for paper "Engineering a High-Performance GPU B-Tree" accepted to PPoPP 2019☆55Updated 2 years ago
- Unit benchmarks of CUDA event APIs.☆17Updated last year
- Giddy - A lightweight GPU decompression library☆42Updated 5 years ago
- Codeplay project for contributions to the LLVM SYCL implementation☆30Updated 4 years ago
- Compute applications.☆24Updated 5 years ago
- Synchronous, single-threaded, library-only SYCL implementation for debugging and verification.☆35Updated last month