habanero-rice / hclib
A C/C++ task-based programming model for shared memory and distributed parallel computing.
☆71Updated 4 years ago
Alternatives and similar repositories for hclib:
Users that are interested in hclib are comparing it to the libraries listed below
- OpenSHMEM Application Programming Interface☆55Updated 5 months ago
- Copy-hiding array abstraction to automatically migrate data between memory spaces☆107Updated this week
- SYCL-ML is a C++ library, implementing classical machine learning algorithms using SYCL.☆66Updated 5 years ago
- Execution primitives for C++☆153Updated 4 years ago
- This repository contains components that will support percolation via OpenCL and CUDA☆32Updated 3 years ago
- A modern interface for implementing bulk-synchronous parallel programs.☆94Updated 2 years ago
- Compiler agnostic metaprogramming library providing concepts, type operations and tuples for C++ and cuda☆87Updated 2 weeks ago
- [deprecated] Reference Implementation of OpenSHMEM on GASNet (specification <= 1.3)☆43Updated 7 years ago
- The Berkeley Container Library☆124Updated last year
- mallocMC: Memory Allocator for Many Core Architectures☆55Updated last month
- Non-blocking message passing (a C++14 MPI wrapper)☆18Updated 10 years ago
- Reference implementation of the draft C++ GraphBLAS specification.☆32Updated 2 months ago
- Distributed ranges is a generalization of C++ ranges for distributed data structures.☆50Updated last week
- Autonomic Performance Environment for eXascale (APEX)☆46Updated 3 weeks ago
- Boost.org mpi module☆62Updated 3 weeks ago
- Scalable High-performance Algorithms and Data-structures☆128Updated 3 months ago
- A universal thread-safe memory pool.☆26Updated 6 years ago
- Project ARES represents a joint effort between LANL and ORNL to introduce a common compiler representation and tool-chain for HPC applica…☆10Updated 8 years ago
- An OpenMP runtime implemented using HPX☆24Updated 2 years ago
- C++ convenience classes to be used with CUDA code, for both the host and the kerlel parts.☆55Updated 6 years ago
- ☆31Updated last month
- ☆17Updated 8 years ago
- A High-performance Cluster Computing Engine☆146Updated 6 years ago
- High-level C++ for Accelerator Clusters☆145Updated last week
- A fast and highly scalable GPU dynamic memory allocator☆104Updated 10 years ago
- C++11 Work-Stealing Task Scheduler☆36Updated 5 years ago
- Fast, shared, upgradeable, non-recursive and non-fair mutex☆30Updated 6 years ago
- Asynchronous Task and Memory Interface, or ATMI, is a runtime framework and programming model for heterogeneous CPU-GPU systems. It provi…☆66Updated last year
- A Low-Level Abstraction of Memory Access☆86Updated last year
- This repository contains material for HPX tutorials given by members of the STE||AR-Group☆34Updated last year