roostaiyan / CudaSharedPtr
Shared Pointer for Cuda Device Pointers and Cuda Streams, Smart Wrapper to Allocate and Deallocate Cuda Device Buffer.
☆26Updated last year
Related projects ⓘ
Alternatives and complementary repositories for CudaSharedPtr
- Source code examples from the Parallel Forall Blog☆94Updated 5 years ago
- MWE for using the Eigen library in CUDA kernels☆117Updated 2 years ago
- Some CUDA design patterns and a bit of template magic for CUDA☆146Updated last year
- CUDA kernel author's tools☆109Updated 2 years ago
- Code samples for the CUDA tutorial "CUDA and Applications to Task-based Programming"☆82Updated last year
- A sample code for sparse cholesky solver with cuSPARSE and cuSOLVER library☆18Updated 4 years ago
- a CUDA implementation of a priority queue☆81Updated 4 years ago
- ☆59Updated last year
- Intel Data Parallel C++ (and SYCL 2020) Tutorial.☆92Updated 2 years ago
- A C++ header-only for data transfer between linear algebra libraries (Eigen, Armadillo, OpenCV, ArrayFire, LibTorch).☆80Updated 6 months ago
- Example of how to use CUDA with CMake >= 3.8☆69Updated last year
- ☆56Updated 2 months ago
- Examples for using SYCL on CUDA☆60Updated 2 weeks ago
- Fastest Random Distribution Generator for Eigen☆92Updated 2 months ago
- DLA-Future☆65Updated this week
- Reference Implementation for stdBLAS☆128Updated 3 weeks ago
- A C++17 interface for HDF5☆91Updated 5 months ago
- SuiteSparse: a suite of sparse matrix packages by @DrTimothyAldenDavis et al. with native CMake support☆52Updated 4 months ago
- PLEASE SEE THE OFFICIAL REPOSITORY. THIS IS NOT MAINTAINED ANYMORE.☆93Updated 4 years ago
- A single header-only C++ library for least squares fitting.☆100Updated last year
- Generate simple index ranges in C++ and CUDA C++☆39Updated last year
- A gpu based implementation of a K-D Tree Builder☆96Updated 5 years ago
- Parallel Tasking Library (PTL) - Lightweight C++11 mutilthreading tasking system featuring thread-pool, task-groups, and lock-free task q…☆43Updated last week
- Fast and full-featured Matrix Market I/O library for C++, Python, and R☆75Updated 3 months ago
- Tutorial for wrapping C++ library into Python using pybind11 and CMake☆132Updated 10 months ago
- Subset of BLAS routines optimized for NVIDIA GPUs☆65Updated last year
- Combined array and automatic differentiation library in C++☆165Updated 8 months ago
- Offload Eigen operations to GPUs☆18Updated 2 years ago
- A C++17 message passing library based on MPI☆168Updated 9 months ago
- BGHT: High-performance static GPU hash tables.☆55Updated 2 months ago