oKatanaaa / CudaCythonSamplesLinks
This repository contains examples CUDA usage in Cython code.
☆25Updated 3 years ago
Alternatives and similar repositories for CudaCythonSamples
Users that are interested in CudaCythonSamples are comparing it to the libraries listed below
Sorting:
- NVIDIA Math Libraries for the Python Ecosystem☆338Updated last month
- GPU accelerated multigrid library for Python☆62Updated 10 months ago
- Algebraic Multigrid Solvers in Python☆614Updated 3 months ago
- Orthogonal polynomials in all shapes and sizes.☆185Updated last year
- Zero-copy MPI communication of JAX arrays, for turbo-charged HPC applications in Python☆489Updated 2 weeks ago
- Numerical integration in arbitrary dimensions on the GPU using PyTorch / TF / JAX☆206Updated last week
- Exploring using stdpar and Cython☆34Updated 4 years ago
- The CUDA target for Numba☆163Updated this week
- Python wrapper for the sparse QR decomposition in SuiteSparseQR.☆36Updated 6 months ago
- An example combining scikit-build and pybind11☆134Updated last week
- A suite of benchmarks for CPU and GPU performance of the most popular high-performance libraries for Python☆328Updated 10 months ago
- An Online Deep Learning Interface for HPC programs on NVIDIA GPUs☆169Updated 3 weeks ago
- Example python (numpy) -- CUDA installable package with a C-extension library☆143Updated 5 years ago
- Sparse matrix tools extending scipy.sparse, but with incompatible licenses☆174Updated 3 months ago
- GPU/TPU accelerated nonlinear least-squares curve fitting using JAX☆58Updated 2 years ago
- ⚡️Optimizing einsum functions in NumPy, Tensorflow, Dask, and more with contraction order optimization.☆935Updated last month
- NumPy and SciPy on Multi-Node Multi-GPU systems☆922Updated this week
- A header-only C++ library for sketching in randomized linear algebra☆92Updated 3 weeks ago
- A JAX-based research framework for differentiable and parallelizable acoustic simulations, on CPU, GPUs and TPUs☆174Updated 10 months ago
- OpenMP for Python in Numba☆118Updated 3 months ago
- XLB: Accelerated Lattice Boltzmann (XLB) for Physics-based ML☆379Updated 3 weeks ago
- Nonuniform fast Fourier transforms of types 1 and 2, in 1D, 2D, and 3D, on the GPU☆89Updated last year
- Legate Sparse is a Legate library that aims to provide a distributed and accelerated drop-in replacement for the scipy.sparse library on …☆23Updated last week
- S2FFT: Differentiable and accelerated spherical transforms☆198Updated this week
- Extending JAX with custom C++ and CUDA code☆398Updated 11 months ago
- PyTorch implementation of Levenberg-Marquardt training algorithm☆72Updated 4 months ago
- Data Parallel Extension for Numba☆82Updated 8 months ago
- Example to build PyTorch CUDA extension using CMake (with pybind11 and scikit-build)☆11Updated 5 years ago
- Template for GPU accelerated python libraries☆49Updated last year
- Python interface to the Intel MKL Pardiso library to solve large sparse linear systems of equations☆142Updated last year