π GPU load-balancing library for regular and irregular computations.
β66Sep 9, 2025Updated 6 months ago
Alternatives and similar repositories for loops
Users that are interested in loops are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- β€οΈ CUDA/C++ GPU graph analytics simplified.β32Sep 19, 2022Updated 3 years ago
- β14Apr 24, 2024Updated last year
- A vectorizable multi-dimensional iterator for C++ using the Coroutines TSβ12Jun 5, 2022Updated 3 years ago
- Runs a single CUDA/OpenCL kernel, taking its source from a file and arguments from the command-lineβ24Mar 15, 2026Updated last week
- mini is miniβ20Jan 19, 2020Updated 6 years ago
- CUDA Dynamic Memory Allocator for SOA Data Layoutβ39Dec 29, 2021Updated 4 years ago
- β19Jan 17, 2024Updated 2 years ago
- Source code for the paper: Accelerating Dynamic Graph Analytics on GPUsβ30Jun 19, 2023Updated 2 years ago
- β627Mar 12, 2026Updated last week
- Open-source library for Graph Streaming. Solves the connected components problem using sub-linear space. Published in SIGMOD'22.β10Mar 12, 2026Updated last week
- Chapel HyperGraph Library (CHGL) - HPC-class Hypergraphs in Chapelβ33Oct 29, 2020Updated 5 years ago
- LonestarGPU: Irregular algorithms parallelized for GPUsβ38Nov 11, 2019Updated 6 years ago
- β11Aug 8, 2021Updated 4 years ago
- cuASR: CUDA Algebra for Semiringsβ45Aug 22, 2022Updated 3 years ago
- Efficient and High-quality Graph Coloring on the GPUβ16Apr 3, 2022Updated 3 years ago
- Source code supporting the High Performance Graphics 2022 paper: Supporting Unified Shader Specialization by Co-opting C++ Featuresβ14Jul 9, 2022Updated 3 years ago
- β18Oct 15, 2020Updated 5 years ago
- Multi-GPU dynamic scheduler using PGAS style cross-GPU communicationβ29Jul 23, 2023Updated 2 years ago
- Programmable CUDA/C++ GPU Graph Analyticsβ1,071Feb 28, 2026Updated 3 weeks ago
- Evaluating different memory managers for dynamic GPU memoryβ26Dec 16, 2020Updated 5 years ago
- GPUDirect Async implementation of HPGMG-FV CUDAβ11May 11, 2018Updated 7 years ago
- A pattern-based algorithmic autotuner for graph processing on GPUs.β32Jun 25, 2025Updated 8 months ago
- Matrix multiplication on GPUs for matrices stored on a CPU. Similar to cublasXt, but ported to both NVIDIA and AMD GPUs.β32Apr 2, 2025Updated 11 months ago
- Scale-out system monitoringβ21Updated this week
- A Collection of Parallel Algorithms for Computational Geometryβ12Mar 10, 2022Updated 4 years ago
- A Lightweight Graph Processing Framework for Multi-GPUsβ14Apr 15, 2015Updated 10 years ago
- ESL-CGRA-simulatorβ16Updated this week
- Department of Energy Standard Utility Libraryβ33Mar 16, 2026Updated last week
- Statistics on GPUsβ33Sep 8, 2025Updated 6 months ago
- A C++based implementation of the TeaLeaf heat conduction mini-app. This implementation of TeaLeaf replicates the functionality of the refβ¦β25Aug 11, 2024Updated last year
- Using C++ magic to capture CUDA kernels and tune them with Kernel Tunerβ21Sep 12, 2025Updated 6 months ago
- A reference implementation of std::simd, providing data parallel types in the C++ standardβ14Mar 9, 2020Updated 6 years ago
- GenDP: A Dynamic Programming Framework for Genome Sequencing Analysisβ17Jan 12, 2024Updated 2 years ago
- cross-platform modular neural network inference library, small and efficientβ13May 15, 2023Updated 2 years ago
- Artifact for OSDI'21 GNNAdvisor: An Adaptive and Efficient Runtime System for GNN Acceleration on GPUs.β70Mar 2, 2023Updated 3 years ago
- Heron: Automatically Constrained High-Performance Library Generation for Deep Learning Acceleratorsβ23Jan 30, 2024Updated 2 years ago
- SparseTIR: Sparse Tensor Compiler for Deep Learningβ144Mar 31, 2023Updated 2 years ago
- Repository for artifact evaluation of ASPLOS 2023 paper "SparseTIR: Composable Abstractions for Sparse Compilation in Deep Learning"β25Feb 24, 2023Updated 3 years ago
- Goal: a website to automatically train and certify compiler researchers and developersβ10Nov 24, 2019Updated 6 years ago