[ARCHIVED] The C++ parallel algorithms library. See https://github.com/NVIDIA/cccl
☆5,000Feb 8, 2024Updated 2 years ago
Alternatives and similar repositories for thrust
Users that are interested in thrust are comparing it to the libraries listed below
Sorting:
- [ARCHIVED] Cooperative primitives for CUDA C++. See https://github.com/NVIDIA/cccl☆1,821Oct 9, 2023Updated 2 years ago
- [ARCHIVED] The C++ Standard Library for your entire system. See https://github.com/NVIDIA/cccl☆2,308Feb 7, 2024Updated 2 years ago
- CUDA Core Compute Libraries☆2,217Updated this week
- Patterns and behaviors for GPU computing☆1,766Jan 17, 2026Updated 2 months ago
- ArrayFire: a general purpose GPU library.☆4,868Mar 7, 2026Updated last week
- CUDA Templates and Python DSLs for High-Performance Linear Algebra☆9,442Updated this week
- oneAPI Threading Building Blocks (oneTBB)☆6,582Updated this week
- stdgpu: Efficient STL-like Data Structures on the GPU☆1,255Updated this week
- A General-purpose Task-parallel Programming System using Modern C++☆11,768Mar 11, 2026Updated last week
- A C++ GPU Computing Library for OpenCL☆1,648Mar 11, 2026Updated last week
- The C++ Standard Library for Parallelism and Concurrency☆2,803Updated this week
- ☆626Mar 12, 2026Updated last week
- Samples for CUDA Developers which demonstrates features in CUDA Toolkit☆8,953Jan 6, 2026Updated 2 months ago
- Abseil Common Libraries (C++)☆17,120Updated this week
- A modern formatting library☆23,328Updated this week
- A single-header C++ library for simplifying the use of CUDA Runtime Compilation (NVRTC).☆569Sep 15, 2025Updated 6 months ago
- Thin, unified, C++-flavored wrappers for the CUDA APIs☆876Feb 16, 2026Updated last month
- Range library for C++14/17/20, basis for C++20's std::ranges☆4,355Mar 23, 2025Updated 11 months ago
- A microbenchmark support library☆10,070Updated this week
- Seamless operability between C++11 and Python☆17,757Mar 7, 2026Updated last week
- CUSP : A C++ Templated Sparse Matrix Library☆419Mar 11, 2026Updated last week
- A modern, C++-native, test framework for unit-tests, TDD and BDD - using C++14, C++17 and later (C++11 support is in v2.x branch, and C++…☆20,247Mar 11, 2026Updated last week
- An efficient C++20 GPU numerical computing library with Python-like syntax☆1,406Updated this week
- C++ tensors with broadcasting and lazy computing☆3,712Jan 29, 2026Updated last month
- Source code examples from the Parallel Forall Blog☆1,321Sep 23, 2025Updated 5 months ago
- a language for fast, portable data-parallel computation☆6,601Updated this week
- CUDA Data Parallel Primitives Library☆438Nov 9, 2018Updated 7 years ago
- An open-source C++ library developed and used at Facebook.☆30,280Mar 12, 2026Updated last week
- C++ wrappers for SIMD intrinsics and parallelized, optimized mathematical functions (SSE, AVX, AVX512, NEON, SVE, WebAssembly, VSX, RISC-…☆2,642Mar 11, 2026Updated last week
- Fast C++ logging library.☆28,471Updated this week
- HIP: C++ Heterogeneous-Compute Interface for Portability☆4,345Updated this week
- Kokkos C++ Performance Portability Programming Ecosystem: The Programming Model - Parallel Execution and Memory Abstraction☆2,475Mar 12, 2026Updated last week
- Simple utilities to enable code reuse and portability between CUDA C/C++ and standard C/C++.☆347Apr 14, 2022Updated 3 years ago
- A fast multi-producer, multi-consumer lock-free concurrent queue for C++11☆12,125Feb 14, 2026Updated last month
- oneAPI DPC++ Library (oneDPL) https://software.intel.com/content/www/us/en/develop/tools/oneapi/components/dpc-library.html☆764Updated this week
- CUDA Library Samples☆2,346Updated this week
- RAPIDS Memory Manager☆685Updated this week
- Open Source Parallel STL implementation☆530Jan 26, 2024Updated 2 years ago
- Programmable CUDA/C++ GPU Graph Analytics☆1,069Feb 28, 2026Updated 2 weeks ago