morpheusthewhite / parallel-dfs-dagLinks
A parallel implementation of DFS for Directed Acyclic Graphs (https://research.nvidia.com/publication/parallel-depth-first-search-directed-acyclic-graphs)
☆50Updated 4 years ago
Alternatives and similar repositories for parallel-dfs-dag
Users that are interested in parallel-dfs-dag are comparing it to the libraries listed below
Sorting:
- a CUDA implementation of a priority queue☆84Updated 4 years ago
- Reference implementation of the draft C++ GraphBLAS specification.☆33Updated 3 months ago
- Code for paper "Engineering a High-Performance GPU B-Tree" accepted to PPoPP 2019☆55Updated 2 years ago
- Profiling Taskflow Programs through Visualization☆50Updated 2 years ago
- Directed Acyclic Graph Execution Engine (DAGEE) is a C++ library that enables programmers to express computation and data movement, as ta…☆46Updated 3 years ago
- Lightweight recording and sampling of performance counters for specific code segments directly from your C++ application.☆65Updated last week
- ☆70Updated 4 years ago
- mallocMC: Memory Allocator for Many Core Architectures☆55Updated 3 weeks ago
- The Berkeley Container Library☆124Updated last year
- Task graph-based asynchronous programming system using C++ coroutine☆90Updated last year
- A C/C++ task-based programming model for shared memory and distributed parallel computing.☆71Updated 4 years ago
- C++ convenience classes to be used with CUDA code, for both the host and the kerlel parts.☆55Updated 6 years ago
- pika is a C++ tasking library built on std::execution with fibers, CUDA, HIP, and MPI support.☆73Updated this week
- SYCL Reference Manual☆28Updated last year
- EDSL for C++ to make just-in-time code generation easier☆86Updated 2 years ago
- Polymorphic memory resource for real-time applications.☆69Updated last year
- Scheduling examples using C++20 coroutines☆24Updated 2 years ago
- Seamless llvm-mca CMake integration☆27Updated 5 years ago
- ☆17Updated 8 years ago
- Generate simple index ranges in C++ and CUDA C++☆39Updated last year
- performance experiments for C++ exception handling☆30Updated 3 years ago
- Compiler agnostic metaprogramming library providing concepts, type operations and tuples for C++ and cuda☆87Updated 3 weeks ago
- A simple profiler to count Nvidia PTX assembly instructions of OpenCL/SYCL/CUDA kernels for roofline model analysis.☆52Updated 2 months ago
- SYCL-ML is a C++ library, implementing classical machine learning algorithms using SYCL.☆66Updated 5 years ago
- Experimental patches to implement missing C++20 modules features for the clang/LLVM toolchain.☆23Updated 3 years ago
- High-level C++ for Accelerator Clusters☆145Updated this week
- A fast implementation of log() and exp()☆53Updated 2 years ago
- Copy-hiding array abstraction to automatically migrate data between memory spaces☆107Updated last week
- ☆21Updated 7 months ago
- Unit benchmarks of CUDA event APIs.☆17Updated last year