Generate simple index ranges in C++ and CUDA C++
☆39Jun 14, 2023Updated 2 years ago
Alternatives and similar repositories for ranger
Users that are interested in ranger are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆11Dec 23, 2019Updated 6 years ago
- Range-based for loops to iterate over a range of numbers or values☆34Nov 23, 2016Updated 9 years ago
- ☆27Updated this week
- Fast integer division with divisor not known at compile time. To be used primarily in CUDA kernels.☆73Nov 4, 2015Updated 10 years ago
- Runtime choosing of template specializations using compile-time lookup-tables. Compile all states of a template function, but execute the…☆26Dec 31, 2025Updated 2 months ago
- A library to benchmark CUDA code, similar to google benchmark.☆31Apr 18, 2021Updated 4 years ago
- RAPIDS Memory Manager☆686Updated this week
- Unit benchmarks of CUDA event APIs.☆17Apr 23, 2024Updated last year
- A single-header C++ library for simplifying the use of CUDA Runtime Compilation (NVRTC).☆569Sep 15, 2025Updated 6 months ago
- VTK-m Tutorial code samples☆10Dec 6, 2021Updated 4 years ago
- ☆27Dec 20, 2023Updated 2 years ago
- YAKL is A Kokkos Layer: A simple C++ framework for performance portability and Fortran code porting☆69Sep 9, 2025Updated 6 months ago
- Cuda matrix computation library that is specified for small matrix operation (3x3, 4x4, 1x3, 1x4, etc.). Including buffer☆18Mar 8, 2024Updated 2 years ago
- ☆627Mar 12, 2026Updated last week
- ☆23Feb 16, 2022Updated 4 years ago
- 🎃 GPU load-balancing library for regular and irregular computations.☆66Sep 9, 2025Updated 6 months ago
- Multiple 1-stencil implementations using nvidia cuda.☆12Dec 2, 2017Updated 8 years ago
- Full-speed Array of Structures access☆177Apr 25, 2023Updated 2 years ago
- ☆16Jul 28, 2021Updated 4 years ago
- An MPI+Kokkos library for logically rectilinear grids☆16Jun 3, 2020Updated 5 years ago
- ☆11Aug 8, 2021Updated 4 years ago
- BaryTree is a library for fast computation of N-body interactions on multiple GPUs, based on barycentric Lagrange and Hermite polynomial …☆12Oct 1, 2021Updated 4 years ago
- A Low-Level Abstraction of Memory Access☆92Feb 29, 2024Updated 2 years ago
- ☆11Aug 2, 2025Updated 7 months ago
- Tutorials of Extending and importing TVM with CMAKE Include dependency.☆15Oct 11, 2024Updated last year
- High-level C++ for Accelerator Clusters☆155Mar 12, 2026Updated last week
- GUI application for DoxyPress☆25Jan 27, 2026Updated last month
- Example codes demonstrating the use of various XSDK packages in combination.☆19Jun 1, 2023Updated 2 years ago
- Easier, quicker command-line CUDA profiling☆53Sep 17, 2024Updated last year
- ☆53Feb 24, 2026Updated last month
- Microbenchmarks showing relative performance of different Python functions/patterns.☆13Oct 3, 2025Updated 5 months ago
- Implementation of AMD HIP for CPUs☆22Jun 16, 2020Updated 5 years ago
- MPI accelerator-integrated communication extensions☆40Apr 4, 2023Updated 2 years ago
- [DEPRECATED] Moved to ROCm/rocm-libraries repo☆124Mar 12, 2026Updated last week
- Mini-applications that exclusively use the Kokkos programming model☆12Mar 21, 2023Updated 3 years ago
- ☆44Updated this week
- RAPIDS - combined conda package & integration tests for all of RAPIDS libraries☆17Mar 17, 2026Updated last week
- A unit-testing framework for CMake functions☆15Jul 27, 2025Updated 7 months ago
- a small lightweight std::execution work-alike☆66Mar 26, 2025Updated 11 months ago