habanero-rice / hclibLinks
A C/C++ task-based programming model for shared memory and distributed parallel computing.
☆71Updated 4 years ago
Alternatives and similar repositories for hclib
Users that are interested in hclib are comparing it to the libraries listed below
Sorting:
- Autonomic Performance Environment for eXascale (APEX)☆48Updated last month
- OpenSHMEM Application Programming Interface☆57Updated 7 months ago
- Scalable High-performance Algorithms and Data-structures☆131Updated 3 weeks ago
- Asynchronous Task and Memory Interface, or ATMI, is a runtime framework and programming model for heterogeneous CPU-GPU systems. It provi…☆68Updated last year
- Concurrent CPU-GPU Programming using Task Models☆103Updated 5 years ago
- C++ convenience classes to be used with CUDA code, for both the host and the kerlel parts.☆55Updated 6 years ago
- [deprecated] Reference Implementation of OpenSHMEM on GASNet (specification <= 1.3)☆43Updated 7 years ago
- Copy-hiding array abstraction to automatically migrate data between memory spaces☆107Updated last week
- ☆31Updated last month
- An OpenMP runtime implemented using HPX☆24Updated 2 years ago
- Compiler agnostic metaprogramming library providing concepts, type operations and tuples for C++ and cuda☆87Updated this week
- Profiling Taskflow Programs through Visualization☆50Updated 2 years ago
- Boost.org mpi module☆62Updated last month
- A proposal for a futures programming model for ISO C++☆22Updated 6 years ago
- SYCL-ML is a C++ library, implementing classical machine learning algorithms using SYCL.☆66Updated 5 years ago
- UME::SIMD A library for explicit simd vectorization.☆90Updated 7 years ago
- High-level C++ for Accelerator Clusters☆145Updated last week
- mallocMC: Memory Allocator for Many Core Architectures☆56Updated last month
- ☆17Updated 8 years ago
- ☆70Updated 5 years ago
- C++11 Work-Stealing Task Scheduler☆36Updated 5 years ago
- Simplified Interface to Complex Memory☆28Updated last year
- The Berkeley Container Library☆124Updated last year
- A High-performance Cluster Computing Engine☆146Updated 6 years ago
- A modern interface for implementing bulk-synchronous parallel programs.☆94Updated 2 years ago
- A fast and highly scalable GPU dynamic memory allocator☆104Updated 10 years ago
- A universal thread-safe memory pool.☆26Updated 6 years ago
- A Low-Level Abstraction of Memory Access☆86Updated last year
- Execution primitives for C++☆153Updated 5 years ago
- Non-blocking message passing (a C++14 MPI wrapper)☆18Updated 10 years ago