morpheusthewhite / parallel-dfs-dagLinks
A parallel implementation of DFS for Directed Acyclic Graphs (https://research.nvidia.com/publication/parallel-depth-first-search-directed-acyclic-graphs)
☆50Updated 4 years ago
Alternatives and similar repositories for parallel-dfs-dag
Users that are interested in parallel-dfs-dag are comparing it to the libraries listed below
Sorting:
- A C/C++ task-based programming model for shared memory and distributed parallel computing.☆72Updated 5 years ago
- a CUDA implementation of a priority queue☆84Updated 5 years ago
- The Berkeley Container Library☆126Updated 2 months ago
- A C++ memory pool that is Boost-friendly and performance oriented (zero-malloc).☆23Updated 2 months ago
- Experimental patches to implement missing C++20 modules features for the clang/LLVM toolchain.☆23Updated 3 years ago
- Task graph-based asynchronous programming system using C++ coroutine☆100Updated last year
- Profiling Taskflow Programs through Visualization☆51Updated 2 years ago
- Scheduling examples using C++20 coroutines☆29Updated 2 years ago
- Directed Acyclic Graph Execution Engine (DAGEE) is a C++ library that enables programmers to express computation and data movement, as ta…☆47Updated 4 years ago
- performance experiments for C++ exception handling☆32Updated 3 years ago
- Interchangeable AoS and SoA containers☆25Updated 3 years ago
- Thrust, CUB, TBB, AVX2, AVX-512, CUDA, OpenCL, OpenMP, Metal, and Rust - all it takes to sum a lot of numbers fast!☆116Updated 6 months ago
- SYCL-ML is a C++ library, implementing classical machine learning algorithms using SYCL.☆66Updated 6 years ago
- ☆72Updated 5 years ago
- pika is a C++ tasking library built on std::execution with fibers, CUDA, HIP, and MPI support.☆80Updated last week
- Concurrent CPU-GPU Programming using Task Models☆106Updated 6 years ago
- a small lightweight std::execution work-alike☆65Updated 10 months ago
- A Low-Level Abstraction of Memory Access☆93Updated last year
- Boost.org graph_parallel module☆33Updated 2 months ago
- Benchmarks for our memory allocator☆70Updated 2 years ago
- A High-performance Cluster Computing Engine☆148Updated 6 years ago
- A curated list of Modern C++ idioms☆25Updated 9 years ago
- Seamless llvm-mca CMake integration☆28Updated 5 years ago
- Modern library for bulk-synchronous parallel programs.☆94Updated 2 years ago
- Intel® Instrumentation and Tracing Technology (ITT) and Just-In-Time (JIT) APIs☆130Updated last week
- Header-only C++ library for low precision floating point type emulation.☆179Updated 6 years ago
- A header only structure of arrays container for C++☆43Updated 2 years ago
- Experimental ranges for CUDA☆25Updated 7 years ago
- C++20 Coroutines and io_uring☆52Updated 3 years ago
- Discrete-event simulation in C++20 using coroutines☆65Updated 6 months ago