torstem / demo-cuda-pybind11
How to use CUDA with Python numpy
☆37Updated 7 years ago
Alternatives and similar repositories for demo-cuda-pybind11:
Users that are interested in demo-cuda-pybind11 are comparing it to the libraries listed below
- Template for GPU accelerated python libraries☆45Updated last year
- Implementation of ConjugateGradients method using C and Nvidia CUDA☆48Updated 2 years ago
- Example of wrapping CGAL Delaunay triangulations and mesh refinement using pybind11☆43Updated 5 years ago
- Example of using pybind11 with numpy and publishing to PyPI and conda-forge☆25Updated last week
- Template for starting CUDA/C++ project using CMake with Github Action for CI☆29Updated last year
- GPU-Accelerated multigrid solver for Poisson's equation in 2D☆21Updated 3 years ago
- ☆42Updated 6 years ago
- Akinasan team(秋名山车队)'s code base for the 0th Taichi Hackathon.☆18Updated 2 years ago
- A nanobind example project☆95Updated last week
- ☆21Updated 8 months ago
- Conjugate Gradient solver written in CUDA☆29Updated 5 years ago
- CUDA implementation of exclusive prefix sum via Blelloch's algorithm☆26Updated 7 years ago
- MWE for using the Eigen library in CUDA kernels☆118Updated 2 years ago
- ☆23Updated 5 years ago
- An open source library for the GPU-implementation of L-BFGS-B algorithm☆122Updated last month
- A Connected Component Labelling algorithm implemented in CUDA☆46Updated 3 years ago
- ☆61Updated last year
- Volume Render a Datacube☆48Updated 3 months ago
- Repository holding the code base to AC-SpGEMM : "Adaptive Sparse Matrix-Matrix Multiplication on the GPU"☆28Updated 4 years ago
- BGHT: High-performance static GPU hash tables.☆57Updated 4 months ago
- Conjugate Gradient for Least Squares in CUDA☆51Updated 9 years ago
- MATLAB Code for Parameters of Floating-Point Arithmetics☆9Updated 2 years ago
- Generate simple index ranges in C++ and CUDA C++☆39Updated last year
- 一个尝试固液耦合的沙盒玩具☆10Updated 2 years ago
- Scattered data interpolation with multilevel B-Splines☆75Updated 3 months ago
- CUDA tool set for non-C++ languages that provides similar functionality like Thrust, with NVRTC at its core.☆59Updated 2 years ago
- This is a c++ port initially performed by Luis Ibanez of the LSQR library of Chris Paige and Michael Saunders. The same methodology was a…☆22Updated 4 years ago
- ☆30Updated 7 years ago
- Some CUDA design patterns and a bit of template magic for CUDA☆147Updated last year
- Skeletonide is a parallel implementation of Zhang-Suen morphological thinning algorithm written in Halide-lang. Use it for fast skeletoni…☆13Updated 4 years ago