nv-legate / cupynumericLinks

NumPy and SciPy on Multi-Node Multi-GPU systems

☆921

Alternatives and similar repositories for cupynumeric

Users that are interested in cupynumeric are comparing it to the libraries listed below

Sorting:

nv-legate / legate
The Foundation for All Legate Libraries
☆219Updated this week
NVIDIA / nvmath-python
NVIDIA Math Libraries for the Python Ecosystem
☆338Updated 3 weeks ago
dmlc / dlpack
common in-memory tensor structure
☆1,044Updated last month
dionhaefner / pyhpc-benchmarks
A suite of benchmarks for CPU and GPU performance of the most popular high-performance libraries for Python
☆328Updated 9 months ago
dgasmith / opt_einsum
⚡️Optimizing einsum functions in NumPy, Tensorflow, Dask, and more with contraction order optimization.
☆935Updated last month
rapidsai / rmm
RAPIDS Memory Manager
☆603Updated this week
mpi4py / mpi4py
Python bindings for MPI
☆869Updated last week
NVIDIA / MatX
An efficient C++17 GPU numerical computing library with Python-like syntax
☆1,341Updated this week
pytorch / torchdynamo
A Python-level JIT compiler designed to make unmodified PyTorch programs faster.
☆1,056Updated last year
rapidsai / dask-cuda
Utilities for Dask and CUDA interactions
☆311Updated this week
NVIDIA / NVTX
The NVIDIA® Tools Extension SDK (NVTX) is a C-based Application Programming Interface (API) for annotating events, code ranges, and resou…
☆427Updated 2 weeks ago
NVIDIA / numba-cuda
The CUDA target for Numba
☆158Updated last week
KernelTuner / kernel_tuner
Kernel Tuner
☆356Updated last week
NVIDIA / multi-gpu-programming-models
Examples demonstrating available options to program multiple GPUs in a single node or a cluster
☆765Updated 5 months ago
NVIDIA / AMGX
Distributed multigrid linear solver library on GPU
☆581Updated 6 months ago
inducer / pycuda
CUDA integration for Python, plus shiny features
☆1,972Updated last month
NVIDIA / accelerated-computing-hub
NVIDIA curated collection of educational resources related to general purpose GPU programming.
☆611Updated 3 weeks ago
spcl / dace
DaCe - Data Centric Parallel Programming
☆544Updated this week
NVIDIA / cuCollections
☆561Updated this week
rapidsai / raft
RAFT contains fundamental widely-used algorithms and primitives for machine learning and information retrieval. The algorithms are CUDA-a…
☆921Updated this week
inducer / loopy
A code generator for array-based code on CPUs and GPUs
☆609Updated last week
NVIDIA / PyProf
A GPU performance profiling tool for PyTorch models
☆503Updated 4 years ago
pytorch / kineto
A CPU+GPU Profiling library that provides access to timeline traces and hardware performance counters.
☆845Updated this week
NVIDIA / nvbench
CUDA Kernel Benchmarking Library
☆692Updated this week
NVIDIA / cub
[ARCHIVED] Cooperative primitives for CUDA C++. See https://github.com/NVIDIA/cccl
☆1,765Updated last year
gpuopenanalytics / pynvml
Provide Python access to the NVML library for GPU diagnostics
☆242Updated 8 months ago
scikit-hep / awkward
Manipulate JSON-like data with NumPy-like idioms.
☆897Updated this week
IntelPython / dpctl
Python SYCL bindings and SYCL-based Python Array API library
☆116Updated this week
dfm / extending-jax
Extending JAX with custom C++ and CUDA code
☆399Updated 11 months ago
pytorch / torchx
TorchX is a universal job launcher for PyTorch applications. TorchX is designed to have fast iteration time for training/research and sup…
☆378Updated last week