NVIDIA / nvmath-pythonLinks
NVIDIA Math Libraries for the Python Ecosystem
☆343Updated last month
Alternatives and similar repositories for nvmath-python
Users that are interested in nvmath-python are comparing it to the libraries listed below
Sorting:
- The CUDA target for Numba☆181Updated this week
- NVIDIA curated collection of educational resources related to general purpose GPU programming.☆660Updated last week
- The Foundation for All Legate Libraries☆222Updated this week
- Numbast is a tool to build an automated pipeline that converts CUDA APIs into Numba bindings.☆50Updated last week
- The NVIDIA® Tools Extension SDK (NVTX) is a C-based Application Programming Interface (API) for annotating events, code ranges, and resou…☆443Updated this week
- A stand-alone implementation of several NumPy dtype extensions used in machine learning.☆291Updated 2 weeks ago
- NumPy and SciPy on Multi-Node Multi-GPU systems☆927Updated this week
- A Fusion Code Generator for NVIDIA GPUs (commonly known as "nvFuser")☆349Updated this week
- An Online Deep Learning Interface for HPC programs on NVIDIA GPUs☆170Updated this week
- Data Parallel Extension for NumPy☆109Updated this week
- Kernel Tuner☆357Updated last week
- Data Parallel Extension for Numba☆82Updated 9 months ago
- Python SYCL bindings and SYCL-based Python Array API library☆116Updated this week
- A plugin for Jupyter Notebook to run CUDA C/C++ code☆239Updated 11 months ago
- CUDA Kernel Benchmarking Library☆706Updated last week
- LLM training in simple, raw C/CUDA☆104Updated last year
- Legate Sparse is a Legate library that aims to provide a distributed and accelerated drop-in replacement for the scipy.sparse library on …☆23Updated 3 weeks ago
- JAX-Toolbox☆331Updated this week
- Evaluating Large Language Models for CUDA Code Generation ComputeEval is a framework designed to generate and evaluate CUDA code from Lar…☆60Updated 2 months ago
- A hands-on introduction to tuning GPU kernels using Kernel Tuner https://github.com/KernelTuner/kernel_tuner/☆32Updated 4 months ago
- Nvidia contributed CUDA tutorial for Numba☆256Updated 3 years ago
- Zero-copy MPI communication of JAX arrays, for turbo-charged HPC applications in Python☆490Updated last week
- Efficient Distributed GPU Programming for Exascale, an SC/ISC Tutorial☆293Updated 2 months ago
- RAPIDS Memory Manager☆612Updated last week
- ☆73Updated last week
- KvikIO - High Performance File IO☆223Updated this week
- High-Performance SGEMM on CUDA devices☆97Updated 7 months ago
- NVIDIA tools guide☆144Updated 7 months ago
- Examples demonstrating available options to program multiple GPUs in a single node or a cluster☆777Updated 6 months ago
- ☆50Updated 3 months ago