morpheusthewhite / parallel-dfs-dag
A parallel implementation of DFS for Directed Acyclic Graphs (https://research.nvidia.com/publication/parallel-depth-first-search-directed-acyclic-graphs)
☆50Updated 4 years ago
Alternatives and similar repositories for parallel-dfs-dag:
Users that are interested in parallel-dfs-dag are comparing it to the libraries listed below
- Reference implementation of the draft C++ GraphBLAS specification.☆30Updated last week
- pika is a C++ tasking library built on std::execution with fibers, CUDA, HIP, and MPI support.☆68Updated this week
- Task graph-based asynchronous programming system using C++ coroutine☆87Updated last year
- mallocMC: Memory Allocator for Many Core Architectures☆54Updated last week
- ☆68Updated 4 years ago
- The Berkeley Container Library☆124Updated last year
- Scheduling examples using C++20 coroutines☆24Updated last year
- Lightweight recording and sampling of performance counters for specific code segments directly from your C++ application.☆55Updated 2 weeks ago
- Fast, generalized, implementation of the Chase-Lev lock-free work-stealing deque for C++17☆143Updated 3 years ago
- a small lightweight std::execution work-alike☆60Updated this week
- a CUDA implementation of a priority queue☆83Updated 4 years ago
- The Fancy Named Parameters Library☆30Updated 3 months ago
- C++ p2300 proposal in Rust☆17Updated last year
- Compact SVO optimized vector for C++17 or higher☆100Updated 9 months ago
- C++ Scattered Containers☆67Updated 7 years ago
- C++20 Coroutines and io_uring☆48Updated 2 years ago
- C++20 fiber implementation with similar interface to std::thread, header-only / x86_64 / Linux only / stackful / built-in scheduler / thr…☆28Updated 5 months ago
- A simple profiler to count Nvidia PTX assembly instructions of OpenCL/SYCL/CUDA kernels for roofline model analysis.☆50Updated last year
- This repository is a very simple C++ modules example.☆23Updated last year
- Profiling Taskflow Programs through Visualization☆49Updated last year
- EDSL for C++ to make just-in-time code generation easier☆85Updated 2 years ago
- ☆56Updated 6 months ago
- Directed Acyclic Graph Execution Engine (DAGEE) is a C++ library that enables programmers to express computation and data movement, as ta…☆45Updated 3 years ago
- performance experiments for C++ exception handling☆30Updated 3 years ago
- Lightweight structured concurrency for C++20☆109Updated 3 weeks ago
- A benchmark for cache efficient data structures.☆30Updated 6 years ago
- Experimental ranges for CUDA☆25Updated 6 years ago
- A modern interface for implementing bulk-synchronous parallel programs.☆93Updated 2 years ago
- An efficient, composable design pattern for range processing☆113Updated 3 years ago
- A lightweight lazy-loading wrapper written in Modern C++☆46Updated 7 years ago