roostaiyan / CudaSharedPtrLinks

Shared Pointer for Cuda Device Pointers and Cuda Streams, Smart Wrapper to Allocate and Deallocate Cuda Device Buffer.

☆0

Alternatives and similar repositories for CudaSharedPtr

Users that are interested in CudaSharedPtr are comparing it to the libraries listed below

Sorting:

robertmaynard / code-samples
Source code examples from the Parallel Forall Blog
☆96Updated 6 years ago
CUDA-Tutorial / CodeSamples
Code samples for the CUDA tutorial "CUDA and Applications to Task-based Programming"
☆91Updated last year
llohse / libnpy
C++ library for reading and writing of numpy's .npy files
☆414Updated 10 months ago
ptheywood / cuda-cmake-github-actions
☆59Updated 11 months ago
GPMueller / eigen-cuda
MWE for using the Eigen library in CUDA kernels
☆119Updated 2 years ago
PatWie / cuda-design-patterns
Some CUDA design patterns and a bit of template magic for CUDA
☆156Updated 2 years ago
eyalroz / cuda-kat
CUDA kernel author's tools
☆113Updated 3 years ago
ingowald / cudaKDTree
☆269Updated last month
ashvardanian / ParallelReductionsBenchmark
Thrust, CUB, TBB, AVX2, AVX-512, CUDA, OpenCL, OpenMP, Metal, and Rust - all it takes to sum a lot of numbers fast!
☆104Updated 2 weeks ago
NVIDIA / cuCollections
☆566Updated this week
eyalroz / cuda-api-wrappers
Thin, unified, C++-flavored wrappers for the CUDA APIs
☆855Updated last week
codeplaysoftware / SYCL-For-CUDA-Examples
Examples for using SYCL on CUDA
☆62Updated last month
crosetto / cupq
a CUDA implementation of a priority queue
☆83Updated 4 years ago
Rookfighter / least-squares-cpp
A single header-only C++ library for least squares fitting.
☆107Updated 2 years ago
alugowski / fast_matrix_market
Fast and full-featured Matrix Market I/O library for C++, Python, and R
☆81Updated last year
johnarobinson77 / KdTreeGPU
A gpu based implementation of a K-D Tree Builder
☆114Updated 5 years ago
DavidAce / h5pp
A C++17 interface for HDF5
☆97Updated 4 months ago
harrism / ranger
Generate simple index ranges in C++ and CUDA C++
☆39Updated 2 years ago
Apress / pro-TBB
Source Code for `Today’s TBB: C++ Parallel Programming with Threading Building Blocks, Second Edition' by Michael Voss and James Reinder…
☆193Updated 2 months ago
jrmadsen / PTL
Parallel Tasking Library (PTL) - Lightweight C++11 mutilthreading tasking system featuring thread-pool, task-groups, and lock-free task q…
☆48Updated 8 months ago
aminnj / cpptqdm
(unofficial) tqdm-like single header c++ pretty progress bar
☆222Updated last year
stotko / stdgpu
stdgpu: Efficient STL-like Data Structures on the GPU
☆1,234Updated 3 months ago
smrfeld / cmake_cpp_pybind11_tutorial
Tutorial for wrapping C++ library into Python using pybind11 and CMake
☆147Updated last year
cusplibrary / cusplibrary
CUSP : A C++ Templated Sparse Matrix Library
☆415Updated last week
andrewssobral / dtt
A C++ header-only for data transfer between linear algebra libraries (Eigen, Armadillo, OpenCV, ArrayFire, LibTorch).
☆81Updated last year
shamanDevel / cuMat
An expression template based linear algebra library running completely on the GPU using CUDA
☆25Updated 4 years ago
jeffhammond / dpcpp-tutorial
Intel Data Parallel C++ (and SYCL 2020) Tutorial.
☆94Updated 3 years ago
sekelle / cornerstone-octree
Local and distributed octrees based on Morton codes with halo discovery and exchange with a 3D collision detection algorithm
☆44Updated 3 weeks ago
arborx / ArborX
Performance-portable geometric search library
☆207Updated 2 weeks ago
owensgroup / BGHT
BGHT: High-performance static GPU hash tables.
☆70Updated last month