pkestene / pybind11-cuda
☆22Updated 10 months ago
Alternatives and similar repositories for pybind11-cuda:
Users that are interested in pybind11-cuda are comparing it to the libraries listed below
- Template for GPU accelerated python libraries☆47Updated last year
- Fast and full-featured Matrix Market I/O library for C++, Python, and R☆77Updated 7 months ago
- How to use CUDA with Python numpy☆38Updated 7 years ago
- A minimal cmake based project skeleton for developping a CUDA application☆15Updated last year
- Code samples for the CUDA tutorial "CUDA and Applications to Task-based Programming"☆89Updated last year
- ☆58Updated 6 months ago
- Some CUDA design patterns and a bit of template magic for CUDA☆149Updated last year
- vectorization of the kd-tree data structure and search algorithm☆40Updated 7 years ago
- Massively parallel DBSCAN algorithm implemented in CUDA along with a KD-Tree for searching neighbors.☆12Updated 4 years ago
- Numbast is a tool to build an automated pipeline that converts CUDA APIs into Numba bindings.☆39Updated this week
- An expression template based linear algebra library running completely on the GPU using CUDA☆25Updated 3 years ago
- A nanobind example project☆101Updated 3 weeks ago
- CUDA kernel author's tools☆110Updated 2 years ago
- Tutorial for wrapping C++ library into Python using pybind11 and CMake☆142Updated last year
- GPU accelerated multigrid library for Python☆56Updated 6 months ago
- This repository contains examples CUDA usage in Cython code.☆23Updated 3 years ago
- Specialized Parallel Linear Algebra, providing distributed GEMM functionality for specific matrix distributions with optional GPU acceler…☆28Updated 8 months ago
- An implementation of parallel exclusive scan in CUDA☆62Updated 7 years ago
- An easily integrable Cholesky solver on CPU and GPU☆233Updated 3 months ago
- Abstractions of memory, allocator, vector, tuple, shared_ptr, unique_ptr, bitset, variant and string working on both CPU and GPU☆30Updated this week
- CUDA Template Functions☆19Updated 3 months ago
- CUDA tool set for non-C++ languages that provides similar functionality like Thrust, with NVRTC at its core.☆59Updated 2 years ago
- High-Performance Computing: CPU Instructions, GPU OpenCL & CUDA, etc.☆14Updated 10 months ago
- Example to build PyTorch CUDA extension using CMake (with pybind11 and scikit-build)☆11Updated 4 years ago
- resources pour le cours d'introduction à la programmation des GPUs du mastère spécialisé HPC-AI☆22Updated last year
- Pybind11 tool for making docstrings from C++ comments☆40Updated 11 months ago
- Large-scale nonlinear least-squares optimization library for both sparse and dense problems☆25Updated 4 months ago
- Exploring using stdpar and Cython☆33Updated 4 years ago
- Example of wrapping CGAL Delaunay triangulations and mesh refinement using pybind11☆43Updated 5 years ago
- Loop Nest - Linear algebra compiler and code generator.☆22Updated 2 years ago