oKatanaaa / CudaCythonSamplesLinks
This repository contains examples CUDA usage in Cython code.
☆25Updated 4 years ago
Alternatives and similar repositories for CudaCythonSamples
Users that are interested in CudaCythonSamples are comparing it to the libraries listed below
Sorting:
- NVIDIA Math Libraries for the Python Ecosystem☆537Updated 2 weeks ago
- Template for GPU accelerated python libraries☆50Updated 2 years ago
- Zero-copy MPI communication of JAX arrays, for turbo-charged HPC applications in Python☆498Updated 2 weeks ago
- The CUDA target for Numba☆222Updated this week
- Exploring using stdpar and Cython☆34Updated 5 years ago
- Extending JAX with custom C++ and CUDA code☆401Updated last year
- Numerical integration in arbitrary dimensions on the GPU using PyTorch / TF / JAX☆211Updated 4 months ago
- Orthogonal polynomials in all shapes and sizes.☆187Updated last year
- S2FFT: Differentiable and accelerated spherical transforms☆208Updated this week
- ⚡️Optimizing einsum functions in NumPy, Tensorflow, Dask, and more with contraction order optimization.☆961Updated 5 months ago
- An example combining scikit-build and pybind11☆141Updated last week
- Newton and Quasi-Newton optimization with PyTorch☆368Updated 6 months ago
- Volume Render a Datacube☆51Updated 6 months ago
- Numbast is a tool to build an automated pipeline that converts CUDA APIs into Numba bindings.☆52Updated this week
- An Online Deep Learning Interface for HPC programs on NVIDIA GPUs☆176Updated this week
- PyTorch implementation of Levenberg-Marquardt training algorithm☆87Updated last week
- Sparse matrix tools extending scipy.sparse, but with incompatible licenses☆182Updated 7 months ago
- Python wrapper for the sparse QR decomposition in SuiteSparseQR.☆38Updated 9 months ago
- Example to build PyTorch CUDA extension using CMake (with pybind11 and scikit-build)☆12Updated 5 years ago
- GPU accelerated multigrid library for Python☆66Updated last year
- Example python (numpy) -- CUDA installable package with a C-extension library☆143Updated 6 years ago
- A suite of benchmarks for CPU and GPU performance of the most popular high-performance libraries for Python☆333Updated last year
- NumPy and SciPy on Multi-Node Multi-GPU systems☆954Updated last week
- A JAX-based research framework for differentiable and parallelizable acoustic simulations, on CPU, GPUs and TPUs☆187Updated last year
- Fusing Taichi into PyTorch☆145Updated 2 years ago
- XLB: Accelerated Lattice Boltzmann (XLB) for Physics-based ML☆423Updated 2 weeks ago
- Differentiable scientific computing library☆156Updated last year
- Wraps PyTorch code in a JIT-compatible way for JAX. Supports automatically defining gradients for reverse-mode AutoDiff.☆58Updated 4 months ago
- How to use CUDA with Python numpy☆39Updated 8 years ago
- Tutorial for wrapping C++ library into Python using pybind11 and CMake☆150Updated last year