NVIDIA / numba-cuda
The CUDA target for Numba
☆73Updated this week
Alternatives and similar repositories for numba-cuda:
Users that are interested in numba-cuda are comparing it to the libraries listed below
- Numbast is a tool to build an automated pipeline that converts CUDA APIs into Numba bindings.☆39Updated this week
- Data Parallel Extension for Numba☆79Updated 4 months ago
- NVIDIA Math Libraries for the Python Ecosystem☆248Updated last week
- Data Parallel Extension for NumPy☆104Updated this week
- Deploy Dask using MPI4Py☆52Updated 2 weeks ago
- Analyze graph/hierarchical performance data using pandas dataframes☆113Updated last month
- KvikIO - High Performance File IO☆195Updated this week
- NPBench - A Benchmarking Suite for High-Performance NumPy☆80Updated this week
- POC work on MLIR backend☆53Updated 7 months ago
- Collection of scripts to build PyTorch and the domain libraries from source.☆10Updated this week
- Python bindings for OpenSHMEM☆15Updated 2 weeks ago
- OpenMP for Python in Numba☆96Updated last month
- Exploring using stdpar and Cython☆33Updated 4 years ago
- Python SYCL bindings and SYCL-based Python Array API library☆110Updated this week
- NVIDIA HPCG is based on the HPCG benchmark and optimized for performance on NVIDIA accelerated HPC systems.☆51Updated 3 weeks ago
- SLATE is a distributed, GPU-accelerated, dense linear algebra library targetting current and upcoming high-performance computing (HPC) sy…☆110Updated 2 months ago
- Creates performance portable libraries with embedded source representations.☆24Updated 3 months ago
- Numba @jit compatible wrappers for MPI C API tested on Linux, macOS and Windows☆41Updated last week
- Python bindings for UCX☆126Updated last week
- Graph-indexed Pandas DataFrames for analyzing hierarchical performance data☆31Updated 4 months ago
- A hands-on introduction to tuning GPU kernels using Kernel Tuner https://github.com/KernelTuner/kernel_tuner/☆30Updated 6 months ago
- Performance portable parallel programming in Python.☆108Updated 5 months ago
- The Foundation for All Legate Libraries☆206Updated this week
- An Aspiring Drop-In Replacement for Pandas at Scale☆75Updated 3 years ago
- Legate Sparse is a Legate library that aims to provide a distributed and accelerated drop-in replacement for the scipy.sparse library on …☆19Updated this week
- RFC document, tooling and other content related to the array API standard☆230Updated 3 weeks ago
- YAKL is A Kokkos Layer: A simple C++ framework for performance portability and Fortran code porting☆62Updated last week
- CPU and GPU tutorial examples☆13Updated 3 weeks ago
- Parallel NumPy seamlessly speeds up NumPy for large arrays (64K+ elements) with no change required to existing code.☆61Updated 4 years ago
- Efficient Distributed GPU Programming for Exascale, an SC/ISC Tutorial☆243Updated 3 months ago