habanero-rice / hclib
A C/C++ task-based programming model for shared memory and distributed parallel computing.
☆71Updated 4 years ago
Alternatives and similar repositories for hclib:
Users that are interested in hclib are comparing it to the libraries listed below
- C++11 Work-Stealing Task Scheduler☆36Updated 5 years ago
- ☆17Updated 8 years ago
- mallocMC: Memory Allocator for Many Core Architectures☆55Updated 2 weeks ago
- High-level C++ for Accelerator Clusters☆146Updated 2 weeks ago
- Asynchronous Task and Memory Interface, or ATMI, is a runtime framework and programming model for heterogeneous CPU-GPU systems. It provi…☆66Updated last year
- Copy-hiding array abstraction to automatically migrate data between memory spaces☆107Updated this week
- A modern interface for implementing bulk-synchronous parallel programs.☆94Updated 2 years ago
- A fast and highly scalable GPU dynamic memory allocator☆104Updated 10 years ago
- OpenSHMEM Application Programming Interface☆54Updated 5 months ago
- This repository contains components that will support percolation via OpenCL and CUDA☆32Updated 3 years ago
- An alternative to Boost.MPI for a user friendly C++ interface for MPI (MPICH).☆19Updated 7 years ago
- [deprecated] Reference Implementation of OpenSHMEM on GASNet (specification <= 1.3)☆43Updated 7 years ago
- Autonomic Performance Environment for eXascale (APEX)☆45Updated this week
- Fast, shared, upgradeable, non-recursive and non-fair mutex☆30Updated 6 years ago
- Compiler agnostic metaprogramming library providing concepts, type operations and tuples for C++ and cuda☆86Updated 3 weeks ago
- An OpenMP runtime implemented using HPX☆23Updated 2 years ago
- ☆30Updated 2 weeks ago
- Persistent memory allocator for data-centric analytics☆54Updated 2 weeks ago
- SYCL-ML is a C++ library, implementing classical machine learning algorithms using SYCL.☆66Updated 5 years ago
- The Berkeley Container Library☆124Updated last year
- A Low-Level Abstraction of Memory Access☆85Updated last year
- ☆69Updated 4 years ago
- C++ convenience classes to be used with CUDA code, for both the host and the kerlel parts.☆55Updated 6 years ago
- Scalable High-performance Algorithms and Data-structures☆128Updated 2 months ago
- C++ Summer Lecture Series 2016☆13Updated 8 years ago
- Concurrent CPU-GPU Programming using Task Models☆101Updated 5 years ago
- Execution primitives for C++☆153Updated 4 years ago
- A High-performance Cluster Computing Engine☆146Updated 5 years ago
- UME::SIMD A library for explicit simd vectorization.☆91Updated 7 years ago
- Profiling Taskflow Programs through Visualization☆50Updated 2 years ago