pkestene / pybind11-cudaLinks
☆23Updated last year
Alternatives and similar repositories for pybind11-cuda
Users that are interested in pybind11-cuda are comparing it to the libraries listed below
Sorting:
- Template for GPU accelerated python libraries☆50Updated 2 years ago
- Tutorial for wrapping C++ library into Python using pybind11 and CMake☆148Updated last year
- An example combining scikit-build and pybind11☆136Updated last week
- Some CUDA design patterns and a bit of template magic for CUDA☆156Updated 2 years ago
- An expression template based linear algebra library running completely on the GPU using CUDA☆25Updated 4 years ago
- How to use CUDA with Python numpy☆40Updated 7 years ago
- Code samples for the CUDA tutorial "CUDA and Applications to Task-based Programming"☆93Updated 2 years ago
- ☆59Updated 3 weeks ago
- A C++ header-only for data transfer between linear algebra libraries (Eigen, Armadillo, OpenCV, ArrayFire, LibTorch).☆82Updated last year
- Algorithms implemented in CUDA + resources about GPGPU☆58Updated 3 years ago
- CUDA kernel author's tools☆113Updated 3 years ago
- Source code examples from the Parallel Forall Blog☆96Updated 6 years ago
- Pybind11 tool for making docstrings from C++ comments☆43Updated last year
- Numbast is a tool to build an automated pipeline that converts CUDA APIs into Numba bindings.☆51Updated last week
- Easier, quicker command-line CUDA profiling☆31Updated last year
- Python package for creating interpolating splines (for position and rotation)☆78Updated this week
- A nanobind example project☆112Updated last week
- Template for starting CUDA/C++ project using CMake with Github Action for CI☆31Updated 3 months ago
- C++ library for reading and writing of numpy's .npy files☆417Updated last year
- Learn OpenMP examples step by step☆97Updated 9 months ago
- Example of wrapping CGAL Delaunay triangulations and mesh refinement using pybind11☆43Updated 6 years ago
- Fast and full-featured Matrix Market I/O library for C++, Python, and R☆82Updated last year
- CUDA Template Functions☆20Updated 10 months ago
- Local and distributed octrees based on Morton codes with halo discovery and exchange with a 3D collision detection algorithm☆44Updated 3 months ago
- A set of hands-on tutorials for CUDA programming☆240Updated last year
- Runs a single CUDA/OpenCL kernel, taking its source from a file and arguments from the command-line☆24Updated last week
- Skeletonide is a parallel implementation of Zhang-Suen morphological thinning algorithm written in Halide-lang. Use it for fast skeletoni…☆14Updated 5 years ago
- A minimal cmake based project skeleton for developping a CUDA application☆17Updated last year
- CUDA tool set for non-C++ languages that provides similar functionality like Thrust, with NVRTC at its core.☆59Updated 3 years ago
- Exploring using stdpar and Cython☆34Updated 4 years ago