fynv / ThrustRTC
CUDA tool set for non-C++ languages that provides similar functionality like Thrust, with NVRTC at its core.
☆59Updated 2 years ago
Alternatives and similar repositories for ThrustRTC:
Users that are interested in ThrustRTC are comparing it to the libraries listed below
- Generate simple index ranges in C++ and CUDA C++☆39Updated last year
- GTensor is a multi-dimensional array C++14 header-only library for hybrid GPU development.☆35Updated 4 months ago
- DLA-Future☆69Updated this week
- Next generation library for iterative sparse solvers for ROCm platform☆79Updated this week
- Examples for using SYCL on CUDA☆60Updated 3 weeks ago
- Distributed Communication-Optimal LU-factorization Algorithm☆12Updated 3 years ago
- Livermore Unstructured Lagrangian Explicit Shock Hydrodynamics (LULESH)☆105Updated last year
- Intel Data Parallel C++ (and SYCL 2020) Tutorial.☆93Updated 3 years ago
- High-performance, GPU-aware communication library☆84Updated 3 weeks ago
- BGHT: High-performance static GPU hash tables.☆57Updated 4 months ago
- A pseudo random number generator library written against the SYCL API.☆12Updated 5 years ago
- ROCm Thrust - run Thrust dependent software on AMD GPUs☆104Updated this week
- Subset of BLAS routines optimized for NVIDIA GPUs☆67Updated last year
- Multi-dimensional C++ arrays which store objects in a Struct-of-Arrays (SoA) memory layout for efficient vectorization and zero address g…☆36Updated 4 years ago
- portFFT is a library implementing Fast Fourier Transforms using SYCL☆16Updated 3 weeks ago
- MATAR is a C++ software library to allow developers to easily create and use dense and sparse data representations that are also portable…☆26Updated this week
- Full-speed Array of Structures access☆164Updated last year
- ☆58Updated 5 months ago
- Distributed View Extension for Kokkos☆43Updated last month
- WIP · CUDA compatibility for Blaze · https://bitbucket.org/blaze-lib/blaze☆17Updated 5 years ago
- Distributed-memory, arbitrary-precision, dense and sparse-direct linear algebra, conic optimization, and lattice reduction☆65Updated 3 months ago
- Benchmark of expression templates libraries☆40Updated 4 years ago
- A C++ allocator based on cudaMallocManaged☆23Updated 6 years ago
- Autonomic Performance Environment for eXascale (APEX)☆42Updated 2 weeks ago
- Compiler agnostic metaprogramming library providing concepts, type operations and tuples for C++ and cuda☆83Updated this week
- Sparse 3D FFT library with MPI, OpenMP, CUDA and ROCm support☆49Updated last week
- Interoperability examples for OpenACC.☆49Updated 4 years ago
- ☆23Updated 2 years ago
- A simple profiler to count Nvidia PTX assembly instructions of OpenCL/SYCL/CUDA kernels for roofline model analysis.☆49Updated last year
- MiniAMR Adaptive Mesh Refinement (AMR) Mini-App☆33Updated 2 months ago