usnistgov / HTGSLinks
The Hybrid Task Graph Scheduler API
☆40Updated last month
Alternatives and similar repositories for HTGS
Users that are interested in HTGS are comparing it to the libraries listed below
Sorting:
- ☆31Updated last month
- GPU Optimization and Memory Abstraction Framework☆32Updated 5 years ago
- This repository contains components that will support percolation via OpenCL and CUDA☆32Updated 3 years ago
- Parallel Tasking Library (PTL) - Lightweight C++11 mutilthreading tasking system featuring thread-pool, task-groups, and lock-free task q…☆47Updated 7 months ago
- An alternative to Boost.MPI for a user friendly C++ interface for MPI (MPICH).☆19Updated 7 years ago
- A C/C++ task-based programming model for shared memory and distributed parallel computing.☆71Updated 4 years ago
- Global Memory and Threading runtime system☆24Updated last year
- Concurrent CPU-GPU Programming using Task Models☆103Updated 5 years ago
- TTC: A high-performance Compiler for Tensor Transpositions☆20Updated 7 years ago
- Asynchronous Task and Memory Interface, or ATMI, is a runtime framework and programming model for heterogeneous CPU-GPU systems. It provi…☆68Updated last year
- A C++ framework for data analytics pipelines☆26Updated 5 years ago
- GraphBLAS Template Library (GBTL): C++ graph algorithms and primitives using semiring algebra as defined at graphblas.org☆133Updated 2 years ago
- Directed Acyclic Graph Execution Engine (DAGEE) is a C++ library that enables programmers to express computation and data movement, as ta…☆46Updated 3 years ago
- Simplified Interface to Complex Memory☆28Updated last year
- StarPU Runtime system☆16Updated 14 years ago
- CMake module collection☆30Updated 10 years ago
- OpenSHMEM Reference Implementation over UCX for Specification 1.4 and up☆36Updated 2 years ago
- C++ User interface for the Platform independent Library Alpaka☆38Updated 10 months ago
- SYCL-ML is a C++ library, implementing classical machine learning algorithms using SYCL.☆66Updated 5 years ago
- C++ Header-Only Library for High-Performance Tensor-Vector Multiplication☆21Updated 6 months ago
- A Distributed Multi-GPU System for Fast Graph Processing☆65Updated 6 years ago
- Compiler agnostic metaprogramming library providing concepts, type operations and tuples for C++ and cuda☆87Updated this week
- mallocMC: Memory Allocator for Many Core Architectures☆56Updated last month
- A thread safe simple C++ wrapper for FFTW & MKL☆15Updated 3 years ago
- Kernel Tuning Toolkit☆60Updated last month
- Autonomic Performance Environment for eXascale (APEX)☆48Updated last month
- Experimental ranges for CUDA☆24Updated 6 years ago
- Library for the GPU-accelerated spatial indexing and processing of particles in 2D and 3D with OpenCL. Currently offers trees based on sp…☆27Updated 10 months ago
- A task benchmark☆43Updated 10 months ago
- ☆32Updated 4 years ago