Directed Acyclic Graph Execution Engine (DAGEE) is a C++ library that enables programmers to express computation and data movement, as task graphs that are scheduled concurrently and asynchronously on both CPUs and GPUs.
☆47Oct 12, 2021Updated 4 years ago
Alternatives and similar repositories for DAGEE
Users that are interested in DAGEE are comparing it to the libraries listed below
Sorting:
- Asynchronous Task and Memory Interface, or ATMI, is a runtime framework and programming model for heterogeneous CPU-GPU systems. It provi…☆68Feb 15, 2024Updated 2 years ago
- Mini-applications that exclusively use the Kokkos programming model☆12Mar 21, 2023Updated 3 years ago
- OpenCL tool to detect buffer overflows in GPU kernels☆23Jan 7, 2019Updated 7 years ago
- Performance-portable C++ code for simulating elastic shear waves in an axisymmetric domain.☆13Jan 30, 2022Updated 4 years ago
- ☆19Jan 17, 2024Updated 2 years ago
- HPC Examples and Documentation for Julia☆14Feb 7, 2023Updated 3 years ago
- The PSC particle-in-cell code☆23Mar 13, 2026Updated last week
- Distributed Interactive Visualization and Exploration of large datasets☆15May 11, 2016Updated 9 years ago
- Vectorised data model base and helper classes.☆20Mar 5, 2026Updated 2 weeks ago
- MPI accelerator-integrated communication extensions☆40Apr 4, 2023Updated 2 years ago
- Anatomy of High-Performance GEMM with Online Fault Tolerance on GPUs☆13Apr 3, 2025Updated 11 months ago
- Sequential and parallel GEMM implementations with C interface + Benchmark.☆12May 24, 2016Updated 9 years ago
- A pseudo random number generator library written against the SYCL API.☆11Jun 11, 2019Updated 6 years ago
- The repository contains container recipes to build the entire stack of Xeus-Cling and Cling including cuda extension with just a few comm…☆10Dec 22, 2020Updated 5 years ago
- Intel® SHMEM - Device initiated shared memory based communication library☆32Nov 12, 2025Updated 4 months ago
- DARMA/magistrate => Serialization and checkpointing library☆12Jan 26, 2026Updated last month
- Automated bottleneck detection and solution orchestration☆19Feb 24, 2026Updated 3 weeks ago
- [DEPRECATED] Moved to ROCm/rocm-systems repo☆146Mar 10, 2026Updated last week
- A GPU performance prediction toolkit for CUDA programs☆19Mar 25, 2019Updated 6 years ago
- Standard interface for collecting HPC run metadata☆16Nov 7, 2025Updated 4 months ago
- Exploring Machine Learning methods and workflows in a simplified weather model☆19Jun 6, 2024Updated last year
- Quasistatic plasma wakefield simulation code for GPUs in well under 1000 lines of code.☆13May 7, 2019Updated 6 years ago
- a small lightweight std::execution work-alike☆66Mar 26, 2025Updated 11 months ago
- Parallel Tasking Library (PTL) - Lightweight C++11 mutilthreading tasking system featuring thread-pool, task-groups, and lock-free task q…☆48Nov 14, 2024Updated last year
- Distributed View Extension for Kokkos☆50Dec 2, 2024Updated last year
- COCCL: Compression and precision co-aware collective communication library☆30Mar 16, 2025Updated last year
- List all available information about all SYCL devices and platforms☆15Sep 14, 2020Updated 5 years ago
- This is the core functions needed by the `tsmp` package. The low level and carefully checked mathematical functions are here. These are i…☆12Dec 16, 2025Updated 3 months ago
- GTensor is a multi-dimensional array C++14 header-only library for hybrid GPU development.☆37Mar 5, 2026Updated 2 weeks ago
- CUDA Dynamic Memory Allocator for SOA Data Layout☆39Dec 29, 2021Updated 4 years ago
- ☆17Apr 8, 2021Updated 4 years ago
- Structured PIC proxy app based on Cabana☆15Jun 30, 2025Updated 8 months ago
- Comb is a communication performance benchmarking tool.☆26Feb 27, 2023Updated 3 years ago
- Reference implementations of MLPerf™ HPC training benchmarks☆50Feb 25, 2025Updated last year
- Paranoid Lua programming☆15Mar 4, 2024Updated 2 years ago
- SParse AcceleRation on Tensor Architecture☆18Apr 7, 2025Updated 11 months ago
- Dynamic execution environments for coupled, thread-heterogeneous MPI+X applications☆21Mar 3, 2025Updated last year
- Tensor library for machine learning☆26Updated this week
- improve the usage experience of std::simd (Parallelism TS 2)☆32Aug 22, 2025Updated 7 months ago