AMDResearch/DAGEE

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/AMDResearch/DAGEE)

AMDResearch / DAGEE

Directed Acyclic Graph Execution Engine (DAGEE) is a C++ library that enables programmers to express computation and data movement, as task graphs that are scheduled concurrently and asynchronously on both CPUs and GPUs.

☆49

Alternatives and similar repositories for DAGEE

Users that are interested in DAGEE are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

ROCm / atmi
View on GitHub
Asynchronous Task and Memory Interface, or ATMI, is a runtime framework and programming model for heterogeneous CPU-GPU systems. It provi…
☆68Feb 15, 2024Updated 2 years ago
kokkos / kokkos-miniapps
View on GitHub
Mini-applications that exclusively use the Kokkos programming model
☆12Mar 21, 2023Updated 3 years ago
ROCm / clARMOR
View on GitHub
OpenCL tool to detect buffer overflows in GPU kernels
☆23Jan 7, 2019Updated 7 years ago
Pressio / SHAW
View on GitHub
Performance-portable C++ code for simulating elastic shear waves in an axisymmetric domain.
☆13Jan 30, 2022Updated 4 years ago
psc-code / psc
View on GitHub
The PSC particle-in-cell code
☆26Jun 4, 2026Updated last month
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
JBlaschke / HPC-Julia
View on GitHub
HPC Examples and Documentation for Julia
☆14Feb 7, 2023Updated 3 years ago
ROCm / roc-stdpar
View on GitHub
☆20Jan 17, 2024Updated 2 years ago
ComputationalRadiationPhysics / pyDive
View on GitHub
Distributed Interactive Visualization and Exploration of large datasets
☆15May 11, 2016Updated 10 years ago
acts-project / vecmem
View on GitHub
Vectorised data model base and helper classes.
☆20May 26, 2026Updated last month
BenjaminW3 / matmul
View on GitHub
Sequential and parallel GEMM implementations with C interface + Benchmark.
☆12May 24, 2016Updated 10 years ago
NVIDIA / mpi-acx
View on GitHub
MPI accelerator-integrated communication extensions
☆39Apr 4, 2023Updated 3 years ago
shixun404 / Fault-Tolerant-SGEMM-on-NVIDIA-GPUs
View on GitHub
Anatomy of High-Performance GEMM with Online Fault Tolerance on GPUs
☆14Apr 3, 2025Updated last year
ComputationalRadiationPhysics / xeus-cling-cuda-container
View on GitHub
The repository contains container recipes to build the entire stack of Xeus-Cling and Cling including cuda extension with just a few comm…
☆10Dec 22, 2020Updated 5 years ago
Wigner-GPU-Lab / SYCL-PRNG
View on GitHub
A pseudo random number generator library written against the SYCL API.
☆11Jun 11, 2019Updated 7 years ago
Deploy open-source AI quickly and easily - Special Bonus Offer • Ad
Runpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
alpaka-group / vikunja
View on GitHub
Vikunja is a performance portable algorithm library that defines functions operating on ranges of elements for a variety of purposes . It…
☆16Oct 10, 2023Updated 2 years ago
DARMA-tasking / magistrate
View on GitHub
DARMA/magistrate => Serialization and checkpointing library
☆12Jan 26, 2026Updated 5 months ago
oneapi-src / ishmem
View on GitHub
Intel® SHMEM - Device initiated shared memory based communication library
☆33Nov 12, 2025Updated 8 months ago
ROCm / rocSHMEM
View on GitHub
[DEPRECATED] Moved to ROCm/rocm-systems repo
☆146Updated this week
llnl / Adiak
View on GitHub
Standard interface for collecting HPC run metadata
☆16Nov 7, 2025Updated 8 months ago
ekondis / gpuroofperf-toolkit
View on GitHub
A GPU performance prediction toolkit for CUDA programs
☆18Mar 25, 2019Updated 7 years ago
lotov / lcode3d
View on GitHub
Quasistatic plasma wakefield simulation code for GPUs in well under 1000 lines of code.
☆13May 7, 2019Updated 7 years ago
ericniebler / ustdex
View on GitHub
a small lightweight std::execution work-alike
☆66May 15, 2026Updated 2 months ago
prg-titech / dynasoar
View on GitHub
CUDA Dynamic Memory Allocator for SOA Data Layout
☆39Dec 29, 2021Updated 4 years ago
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
pnnl / HiParTI
View on GitHub
☆17Apr 8, 2021Updated 5 years ago
wdmapp / gtensor
View on GitHub
GTensor is a multi-dimensional array C++14 header-only library for hybrid GPU development.
☆37Mar 5, 2026Updated 4 months ago
kokkos / kokkos-remote-spaces
View on GitHub
Distributed View Extension for Kokkos
☆53Dec 2, 2024Updated last year
illuhad / syclinfo
View on GitHub
List all available information about all SYCL devices and platforms
☆15Sep 14, 2020Updated 5 years ago
matrix-profile-foundation / matrixprofiler
View on GitHub
This is the core functions needed by the `tsmp` package. The low level and carefully checked mathematical functions are here. These are i…
☆12Jun 10, 2026Updated last month
jrmadsen / PTL
View on GitHub
Parallel Tasking Library (PTL) - Lightweight C++11 mutilthreading tasking system featuring thread-pool, task-groups, and lock-free task q…
☆47Nov 14, 2024Updated last year
HicrestLaboratory / SPARTA
View on GitHub
SParse AcceleRation on Tensor Architecture
☆18Apr 15, 2026Updated 3 months ago
ECP-copa / CabanaPIC
View on GitHub
Structured PIC proxy app based on Cabana
☆15Jun 30, 2025Updated last year
sandialabs / LAPIS
View on GitHub
An MLIR-based compiler targeting Kokkos and other programming models
☆17Jul 14, 2026Updated last week
Deploy open-source AI quickly and easily - Special Bonus Offer • Ad
Runpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
mlcommons / hpc
View on GitHub
Reference implementations of MLPerf™ HPC training benchmarks
☆51Feb 25, 2025Updated last year
llnl / Comb
View on GitHub
Comb is a communication performance benchmarking tool.
☆25Feb 27, 2023Updated 3 years ago
hishamhm / safer
View on GitHub
Paranoid Lua programming
☆15Mar 4, 2024Updated 2 years ago
lanl / libquo
View on GitHub
Dynamic execution environments for coupled, thread-heterogeneous MPI+X applications
☆23Mar 3, 2025Updated last year
hishamhm / subprocess
View on GitHub
A port of the Python subprocess module to Lua
☆18Nov 1, 2018Updated 7 years ago
mattkretz / vir-simd
View on GitHub
improve the usage experience of std::simd (Parallelism TS 2)
☆33May 19, 2026Updated 2 months ago
ROCm / omnistat
View on GitHub
Scale-out system monitoring
☆25Updated this week