mgopshtein / cudacppLinks

C++ convenience classes to be used with CUDA code, for both the host and the kerlel parts.

☆55

Alternatives and similar repositories for cudacpp

Users that are interested in cudacpp are comparing it to the libraries listed below

Sorting:

agency-library / agency
Execution primitives for C++
☆153Updated 5 years ago
celerity / celerity-runtime
High-level C++ for Accelerator Clusters
☆146Updated 3 weeks ago
root-project / veccore
C++ Library for Portable SIMD Vectorization
☆84Updated 8 months ago
alpaka-group / llama
A Low-Level Abstraction of Memory Access
☆86Updated last year
habanero-rice / hclib
A C/C++ task-based programming model for shared memory and distributed parallel computing.
☆72Updated 5 years ago
brycelelbach / mditerator
A vectorizable multi-dimensional iterator for C++ using the Coroutines TS
☆12Updated 3 years ago
edanor / umesimd
UME::SIMD A library for explicit simd vectorization.
☆91Updated 7 years ago
jrmadsen / PTL
Parallel Tasking Library (PTL) - Lightweight C++11 mutilthreading tasking system featuring thread-pool, task-groups, and lock-free task q…
☆48Updated 8 months ago
kokkos / array_ref
Polymorphic multidimensional array view
☆36Updated 5 years ago
STEllAR-GROUP / libflatarray
Multi-dimensional C++ arrays which store objects in a Struct-of-Arrays (SoA) memory layout for efficient vectorization and zero address g…
☆36Updated 4 years ago
codeplaysoftware / SYCL-ML
SYCL-ML is a C++ library, implementing classical machine learning algorithms using SYCL.
☆66Updated 5 years ago
pdziepak / ranges-gpu
Experimental ranges for CUDA
☆24Updated 6 years ago
wichtounet / etl
Blazing-fast Expression Templates Library (ETL) with GPU support, in C++
☆228Updated 2 months ago
d36u9 / async
async is a tiny C++ header-only high-performance library for async calls handled by a thread-pool, which is built on top of an unbounded …
☆31Updated 4 years ago
diatomic / diy
data-parallel out-of-core library
☆50Updated 2 weeks ago
taskflow / tfprof
Profiling Taskflow Programs through Visualization
☆50Updated 2 years ago
oprecomp / FloatX
Header-only C++ library for low precision floating point type emulation.
☆175Updated 5 years ago
ashvardanian / ParallelReductionsBenchmark
Thrust, CUB, TBB, AVX2, AVX-512, CUDA, OpenCL, OpenMP, Metal, and Rust - all it takes to sum a lot of numbers fast!
☆103Updated last week
pacxx / pacxx-llvm
Programming Accelerators with C++ (PACXX)
☆57Updated 7 years ago
eyalroz / cuda-kat
CUDA kernel author's tools
☆112Updated 3 years ago
ProGTX / sycl-gtx
Implementation of the SYCL specification.
☆65Updated last year
vectorclass / add-on
Add-on packages for Vector class library
☆75Updated last year
jeffhammond / dpcpp-tutorial
Intel Data Parallel C++ (and SYCL 2020) Tutorial.
☆94Updated 3 years ago
harrism / hemi
Simple utilities to enable code reuse and portability between CUDA C/C++ and standard C/C++.
☆348Updated 3 years ago
klalumiere / NiceMPI
An alternative to Boost.MPI for a user friendly C++ interface for MPI (MPICH).
☆19Updated 7 years ago
STEllAR-GROUP / blaze_cuda
WIP · CUDA compatibility for Blaze · https://bitbucket.org/blaze-lib/blaze
☆19Updated 5 years ago
moroneyt / ctla
Compile-time linear algebra in C++
☆56Updated 7 years ago
Heteroflow / Heteroflow
Concurrent CPU-GPU Programming using Task Models
☆103Updated 5 years ago
berkeley-container-library / bcl
The Berkeley Container Library
☆124Updated 2 years ago
google / dimsum
Portable C++ SIMD library
☆173Updated 5 years ago