π GPU load-balancing library for regular and irregular computations.
β66Sep 9, 2025Updated 5 months ago
Alternatives and similar repositories for loops
Users that are interested in loops are comparing it to the libraries listed below
Sorting:
- β€οΈ CUDA/C++ GPU graph analytics simplified.β32Sep 19, 2022Updated 3 years ago
- mini is miniβ20Jan 19, 2020Updated 6 years ago
- β18Jan 17, 2024Updated 2 years ago
- Runs a single CUDA/OpenCL kernel, taking its source from a file and arguments from the command-lineβ24Nov 25, 2025Updated 3 months ago
- β14Apr 24, 2024Updated last year
- CUDA Dynamic Memory Allocator for SOA Data Layoutβ38Dec 29, 2021Updated 4 years ago
- Generate simple index ranges in C++ and CUDA C++β39Jun 14, 2023Updated 2 years ago
- β11Aug 8, 2021Updated 4 years ago
- Source code for the paper: Accelerating Dynamic Graph Analytics on GPUsβ30Jun 19, 2023Updated 2 years ago
- cuASR: CUDA Algebra for Semiringsβ44Aug 22, 2022Updated 3 years ago
- LonestarGPU: Irregular algorithms parallelized for GPUsβ38Nov 11, 2019Updated 6 years ago
- Using C++ magic to capture CUDA kernels and tune them with Kernel Tunerβ21Sep 12, 2025Updated 5 months ago
- Open-source library for Graph Streaming. Solves the connected components problem using sub-linear space. Published in SIGMOD'22.β10Nov 13, 2025Updated 3 months ago
- OpenMP offload playgroundβ10Nov 16, 2024Updated last year
- EPOCH Input System Version 2β10Jun 5, 2020Updated 5 years ago
- β625Feb 20, 2026Updated last week
- Statistics on GPUsβ33Sep 8, 2025Updated 5 months ago
- β23Feb 16, 2022Updated 4 years ago
- A C++based implementation of the TeaLeaf heat conduction mini-app. This implementation of TeaLeaf replicates the functionality of the refβ¦β25Aug 11, 2024Updated last year
- Goal: a website to automatically train and certify compiler researchers and developersβ10Nov 24, 2019Updated 6 years ago
- β11Dec 23, 2019Updated 6 years ago
- β12Aug 4, 2025Updated 6 months ago
- A 3D multi-material Arbitrary Lagrangian-Eulerian hydrocodeβ15Mar 25, 2020Updated 5 years ago
- Continuum Dynamics Evaluation and Test Suiteβ15Aug 29, 2017Updated 8 years ago
- β11Apr 10, 2019Updated 6 years ago
- YAKL is A Kokkos Layer: A simple C++ framework for performance portability and Fortran code portingβ69Sep 9, 2025Updated 5 months ago
- Evaluating different memory managers for dynamic GPU memoryβ26Dec 16, 2020Updated 5 years ago
- A vectorizable multi-dimensional iterator for C++ using the Coroutines TSβ12Jun 5, 2022Updated 3 years ago
- A pseudo random number generator library written against the SYCL API.β11Jun 11, 2019Updated 6 years ago
- Exploring Machine Learning methods and workflows in a simplified weather modelβ19Jun 6, 2024Updated last year
- GPUDirect Async implementation of HPGMG-FV CUDAβ11May 11, 2018Updated 7 years ago
- Fast SGEMM emulation on Tensor Coresβ17Feb 16, 2025Updated last year
- A BUDE virtual-screening benchmark, in many programming modelsβ30Oct 15, 2024Updated last year
- Matrix multiplication on GPUs for matrices stored on a CPU. Similar to cublasXt, but ported to both NVIDIA and AMD GPUs.β32Apr 2, 2025Updated 11 months ago
- Programmable CUDA/C++ GPU Graph Analyticsβ1,067Feb 9, 2026Updated 3 weeks ago
- Kokkos+Eigen: Write fast, readable multi-platform code.β15Updated this week
- A Symbolic Emulator for Shuffle Synthesis on the NVIDIA PTX Codeβ15Mar 19, 2023Updated 2 years ago
- Scale-out system monitoringβ20Feb 23, 2026Updated last week
- A Lightweight Graph Processing Framework for Multi-GPUsβ14Apr 15, 2015Updated 10 years ago