NVIDIA / nvmath-pythonLinks
NVIDIA Math Libraries for the Python Ecosystem
☆512Updated last month
Alternatives and similar repositories for nvmath-python
Users that are interested in nvmath-python are comparing it to the libraries listed below
Sorting:
- The CUDA target for Numba☆197Updated this week
- NVIDIA curated collection of educational resources related to general purpose GPU programming.☆735Updated last week
- The Foundation for All Legate Libraries☆228Updated this week
- Numbast is a tool to build an automated pipeline that converts CUDA APIs into Numba bindings.☆51Updated last week
- NumPy and SciPy on Multi-Node Multi-GPU systems☆934Updated this week
- A stand-alone implementation of several NumPy dtype extensions used in machine learning.☆301Updated last week
- Data Parallel Extension for NumPy☆115Updated this week
- An efficient C++17 GPU numerical computing library with Python-like syntax☆1,356Updated this week
- The NVIDIA® Tools Extension SDK (NVTX) is a C-based Application Programming Interface (API) for annotating events, code ranges, and resou…☆456Updated 2 weeks ago
- Data Parallel Extension for Numba☆84Updated 2 weeks ago
- CUDA Kernel Benchmarking Library☆736Updated last week
- Kernel Tuner☆368Updated last week
- Python SYCL bindings and SYCL-based Python Array API library☆117Updated this week
- JAX-Toolbox☆348Updated this week
- RAPIDS Memory Manager☆642Updated this week
- LLM training in simple, raw C/CUDA☆105Updated last year
- Tilus is a tile-level kernel programming language with explicit control over shared memory and registers.☆377Updated this week
- A Fusion Code Generator for NVIDIA GPUs (commonly known as "nvFuser")☆357Updated this week
- High-Performance SGEMM on CUDA devices☆107Updated 8 months ago
- A plugin for Jupyter Notebook to run CUDA C/C++ code☆246Updated last year
- Legate Sparse is a Legate library that aims to provide a distributed and accelerated drop-in replacement for the scipy.sparse library on …☆23Updated last week
- ☆589Updated this week
- Examples demonstrating available options to program multiple GPUs in a single node or a cluster☆818Updated 2 weeks ago
- This repository contains examples CUDA usage in Cython code.☆25Updated 4 years ago
- An Online Deep Learning Interface for HPC programs on NVIDIA GPUs☆173Updated last week
- Evaluating Large Language Models for CUDA Code Generation ComputeEval is a framework designed to generate and evaluate CUDA code from Lar…☆67Updated last week
- Zero-copy MPI communication of JAX arrays, for turbo-charged HPC applications in Python☆496Updated 3 weeks ago
- KvikIO - High Performance File IO☆226Updated last week
- Fault tolerance for PyTorch (HSDP, LocalSGD, DiLoCo, Streaming DiLoCo)☆415Updated this week
- CUDA Core Compute Libraries☆1,964Updated this week