pkestene / pybind11-cuda
☆20Updated 6 months ago
Related projects ⓘ
Alternatives and complementary repositories for pybind11-cuda
- Template for GPU accelerated python libraries☆45Updated last year
- How to use CUDA with Python numpy☆37Updated 6 years ago
- Numbast is a tool to build an automated pipeline that converts CUDA APIs into Numba bindings.☆27Updated this week
- Python package for creating interpolating splines (for position and rotation)☆60Updated 3 months ago
- A nanobind example project☆91Updated last week
- Template for starting CUDA/C++ project using CMake with Github Action for CI☆29Updated last year
- This repository contains examples CUDA usage in Cython code.☆20Updated 3 years ago
- resources pour le cours d'introduction à la programmation des GPUs du mastère spécialisé HPC-AI☆22Updated 10 months ago
- CUDA kernel author's tools☆109Updated 2 years ago
- Skeletonide is a parallel implementation of Zhang-Suen morphological thinning algorithm written in Halide-lang. Use it for fast skeletoni…☆12Updated 4 years ago
- Tutorial for wrapping C++ library into Python using pybind11 and CMake☆132Updated 10 months ago
- Some CUDA design patterns and a bit of template magic for CUDA☆146Updated last year
- Create Wheel from CMake projects☆20Updated this week
- Examples from Programming in Parallel with CUDA☆108Updated last year
- CUDA Template Functions☆18Updated 3 months ago
- Exploring using stdpar and Cython☆32Updated 4 years ago
- Example of wrapping CGAL Delaunay triangulations and mesh refinement using pybind11☆43Updated 5 years ago
- Direct solver for sparse SPD matrices for nonlinear optimization. Implements supernodal Cholesky decomposition algorithm, and supports GP…☆85Updated last year
- CUDA tool set for non-C++ languages that provides similar functionality like Thrust, with NVRTC at its core.☆59Updated 2 years ago
- A simple, but fast, triangular solver☆17Updated 3 years ago
- Simple GPU rendering of scientific data with Pytorch, Jax, CuPy, and Warp backends.☆21Updated last year
- Pybind11 tool for making docstrings from C++ comments☆39Updated 6 months ago
- A simple profiler to count Nvidia PTX assembly instructions of OpenCL/SYCL/CUDA kernels for roofline model analysis.☆43Updated 10 months ago
- CuPy Benchmark☆12Updated 5 years ago
- ☆56Updated 2 months ago
- ☆35Updated last week
- Linnea is an experimental tool for the automatic generation of optimized code for linear algebra problems.☆66Updated 2 years ago
- A GPU performance prediction toolkit for CUDA programs☆16Updated 5 years ago
- Code samples for the CUDA tutorial "CUDA and Applications to Task-based Programming"☆82Updated last year
- Fast and full-featured Matrix Market I/O library for C++, Python, and R☆75Updated 3 months ago