Directed Acyclic Graph Execution Engine (DAGEE) is a C++ library that enables programmers to express computation and data movement, as task graphs that are scheduled concurrently and asynchronously on both CPUs and GPUs.
☆48Oct 12, 2021Updated 4 years ago
Alternatives and similar repositories for DAGEE
Users that are interested in DAGEE are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Asynchronous Task and Memory Interface, or ATMI, is a runtime framework and programming model for heterogeneous CPU-GPU systems. It provi…☆68Feb 15, 2024Updated 2 years ago
- Mini-applications that exclusively use the Kokkos programming model☆12Mar 21, 2023Updated 3 years ago
- OpenCL tool to detect buffer overflows in GPU kernels☆23Jan 7, 2019Updated 7 years ago
- Performance-portable C++ code for simulating elastic shear waves in an axisymmetric domain.☆13Jan 30, 2022Updated 4 years ago
- ☆19Jan 17, 2024Updated 2 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- HPC Examples and Documentation for Julia☆14Feb 7, 2023Updated 3 years ago
- The PSC particle-in-cell code☆25Apr 10, 2026Updated 3 weeks ago
- Distributed Interactive Visualization and Exploration of large datasets☆15May 11, 2016Updated 9 years ago
- Vectorised data model base and helper classes.☆20Updated this week
- MPI accelerator-integrated communication extensions☆40Apr 4, 2023Updated 3 years ago
- Sequential and parallel GEMM implementations with C interface + Benchmark.☆12May 24, 2016Updated 9 years ago
- The repository contains container recipes to build the entire stack of Xeus-Cling and Cling including cuda extension with just a few comm…☆10Dec 22, 2020Updated 5 years ago
- A pseudo random number generator library written against the SYCL API.☆11Jun 11, 2019Updated 6 years ago
- Intel® SHMEM - Device initiated shared memory based communication library☆32Nov 12, 2025Updated 5 months ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Anatomy of High-Performance GEMM with Online Fault Tolerance on GPUs☆14Apr 3, 2025Updated last year
- DARMA/magistrate => Serialization and checkpointing library☆12Jan 26, 2026Updated 3 months ago
- Automated bottleneck detection and solution orchestration☆21Feb 24, 2026Updated 2 months ago
- [DEPRECATED] Moved to ROCm/rocm-systems repo☆145Apr 24, 2026Updated last week
- A GPU performance prediction toolkit for CUDA programs☆19Mar 25, 2019Updated 7 years ago
- Standard interface for collecting HPC run metadata☆16Nov 7, 2025Updated 5 months ago
- Exploring Machine Learning methods and workflows in a simplified weather model☆19Jun 6, 2024Updated last year
- Quasistatic plasma wakefield simulation code for GPUs in well under 1000 lines of code.☆13May 7, 2019Updated 6 years ago
- a small lightweight std::execution work-alike☆66Mar 26, 2025Updated last year
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- Parallel Tasking Library (PTL) - Lightweight C++11 mutilthreading tasking system featuring thread-pool, task-groups, and lock-free task q…☆47Nov 14, 2024Updated last year
- Distributed View Extension for Kokkos☆51Dec 2, 2024Updated last year
- COCCL: Compression and precision co-aware collective communication library☆30Mar 16, 2025Updated last year
- List all available information about all SYCL devices and platforms☆15Sep 14, 2020Updated 5 years ago
- This is the core functions needed by the `tsmp` package. The low level and carefully checked mathematical functions are here. These are i…☆12Dec 16, 2025Updated 4 months ago
- GTensor is a multi-dimensional array C++14 header-only library for hybrid GPU development.☆37Mar 5, 2026Updated last month
- ☆17Apr 8, 2021Updated 5 years ago
- CUDA Dynamic Memory Allocator for SOA Data Layout☆39Dec 29, 2021Updated 4 years ago
- Structured PIC proxy app based on Cabana☆15Jun 30, 2025Updated 10 months ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- Comb is a communication performance benchmarking tool.☆26Feb 27, 2023Updated 3 years ago
- Reference implementations of MLPerf™ HPC training benchmarks☆51Feb 25, 2025Updated last year
- Paranoid Lua programming☆15Mar 4, 2024Updated 2 years ago
- Dynamic execution environments for coupled, thread-heterogeneous MPI+X applications☆21Mar 3, 2025Updated last year
- SParse AcceleRation on Tensor Architecture☆18Apr 15, 2026Updated 2 weeks ago
- Tensor library for machine learning☆28Apr 22, 2026Updated last week
- improve the usage experience of std::simd (Parallelism TS 2)☆32Aug 22, 2025Updated 8 months ago