usnistgov / HTGS
The Hybrid Task Graph Scheduler API
☆40Updated 3 years ago
Alternatives and similar repositories for HTGS:
Users that are interested in HTGS are comparing it to the libraries listed below
- GPU Optimization and Memory Abstraction Framework☆32Updated 5 years ago
- ☆28Updated 2 months ago
- StarPU Runtime system☆16Updated 14 years ago
- Asynchronous Task and Memory Interface, or ATMI, is a runtime framework and programming model for heterogeneous CPU-GPU systems. It provi…☆66Updated 11 months ago
- A C/C++ task-based programming model for shared memory and distributed parallel computing.☆71Updated 4 years ago
- Concurrent CPU-GPU Programming using Task Models☆100Updated 5 years ago
- Directed Acyclic Graph Execution Engine (DAGEE) is a C++ library that enables programmers to express computation and data movement, as ta…☆45Updated 3 years ago
- An alternative to Boost.MPI for a user friendly C++ interface for MPI (MPICH).☆19Updated 6 years ago
- This repository contains components that will support percolation via OpenCL and CUDA☆31Updated 3 years ago
- DSL for stencils and image processing☆13Updated 8 years ago
- Global Memory and Threading runtime system☆23Updated 8 months ago
- Simplified Interface to Complex Memory☆27Updated last year
- A GPU-based LZSS compression algorithm, highly tuned for NVIDIA GPGPUs and for streaming data, leveraging the respective strengths of CPU…☆36Updated 9 years ago
- TTC: A high-performance Compiler for Tensor Transpositions☆20Updated 7 years ago
- C++ Header-Only Library for High-Performance Tensor-Vector Multiplication☆21Updated last month
- GPU Automatically Tuned Linear Algebra Software☆28Updated 9 years ago
- Parallel Tasking Library (PTL) - Lightweight C++11 mutilthreading tasking system featuring thread-pool, task-groups, and lock-free task q…☆43Updated 2 months ago
- Collection of full, mini, proxy, and benchmark apps.☆11Updated 4 years ago
- Persistent memory allocator for data-centric analytics☆54Updated this week
- CMake module collection☆30Updated 9 years ago
- Portable HPC Containers (C++)☆48Updated this week
- mallocMC: Memory Allocator for Many Core Architectures☆53Updated this week
- OpenSHMEM Reference Implementation over UCX for Specification 1.4 and up☆33Updated last year
- Experimental ranges for CUDA☆25Updated 5 years ago
- A task benchmark☆40Updated 5 months ago
- Evaluating different memory managers for dynamic GPU memory☆24Updated 4 years ago
- Compiler agnostic metaprogramming library providing concepts, type operations and tuples for C++ and cuda☆82Updated this week
- Boost.org graph_parallel module☆28Updated last month
- Automatically exported from code.google.com/p/freeocl☆31Updated 7 years ago
- Artifacts for SOSP'19 paper Optimizing Deep Learning Computation with Automatic Generation of Graph Substitutions☆21Updated 2 years ago