NVIDIA / numbast
Numbast is a tool to build an automated pipeline that converts CUDA APIs into Numba bindings.
☆31Updated last month
Alternatives and similar repositories for numbast:
Users that are interested in numbast are comparing it to the libraries listed below
- The CUDA target for Numba☆41Updated last week
- Exploring using stdpar and Cython☆33Updated 4 years ago
- Generate simple index ranges in C++ and CUDA C++☆39Updated last year
- SLATE is a distributed, GPU-accelerated, dense linear algebra library targetting current and upcoming high-performance computing (HPC) sy…☆103Updated last week
- Data Parallel Extension for Numba☆78Updated 2 months ago
- Sparse 3D FFT library with MPI, OpenMP, CUDA and ROCm support☆48Updated last week
- Deploy Dask using MPI4Py☆52Updated 3 months ago
- Kokkos C++ Performance Portability Programming Ecosystem: Profiling and Debugging Tools☆116Updated last month
- DLA-Future☆69Updated this week
- Graph-indexed Pandas DataFrames for analyzing hierarchical performance data☆30Updated 2 months ago
- Python bindings for OpenSHMEM☆15Updated last month
- Analyze graph/hierarchical performance data using pandas dataframes☆109Updated 2 months ago
- Specialized Parallel Linear Algebra, providing distributed GEMM functionality for specific matrix distributions with optional GPU acceler…☆27Updated 6 months ago
- ☆33Updated last month
- A project and machine deployment model using Spack☆26Updated this week
- GTensor is a multi-dimensional array C++14 header-only library for hybrid GPU development.☆35Updated 3 months ago
- Vectorised data model base and helper classes.☆20Updated last week
- Astrophysics program simulating the evolution of star systems based on the fast multipole method on adaptive Octrees☆48Updated last month
- NVIDIA Performance Libraries: Sample code☆20Updated this week
- NPBench - A Benchmarking Suite for High-Performance NumPy☆76Updated 2 months ago
- NVIDIA Math Libraries for the Python Ecosystem☆220Updated last month
- NVIDIA HPCG is based on the HPCG benchmark and optimized for performance on NVIDIA accelerated HPC systems.☆45Updated 3 months ago
- A nanobind example project☆95Updated last week
- ☆19Updated this week
- LAPACK++ is a C++ wrapper around CPU and GPU LAPACK and LAPACK-like linear algebra libraries, developed as part of the SLATE project.☆55Updated last week
- Scientific algorithms implemented on top of the x-stack (xtensor, xsimd ...)☆9Updated 5 years ago
- Data Parallel Extension for NumPy☆101Updated this week
- Distributed View Extension for Kokkos☆43Updated last month
- NVIDIA curated collection of educational resources related to general purpose GPU programming.☆171Updated this week
- Copy-hiding array abstraction to automatically migrate data between memory spaces☆106Updated this week