NVIDIA / numbast
Numbast is a tool to build an automated pipeline that converts CUDA APIs into Numba bindings.
☆36Updated last week
Alternatives and similar repositories for numbast:
Users that are interested in numbast are comparing it to the libraries listed below
- Generate simple index ranges in C++ and CUDA C++☆39Updated last year
- The CUDA target for Numba☆73Updated this week
- Exploring using stdpar and Cython☆33Updated 4 years ago
- SLATE is a distributed, GPU-accelerated, dense linear algebra library targetting current and upcoming high-performance computing (HPC) sy…☆110Updated 2 months ago
- Specialized Parallel Linear Algebra, providing distributed GEMM functionality for specific matrix distributions with optional GPU acceler…☆28Updated 8 months ago
- Sparse 3D FFT library with MPI, OpenMP, CUDA and ROCm support☆51Updated 2 weeks ago
- DLA-Future☆70Updated last week
- Copy-hiding array abstraction to automatically migrate data between memory spaces☆106Updated last week
- Analyze graph/hierarchical performance data using pandas dataframes☆113Updated last month
- Compiler agnostic metaprogramming library providing concepts, type operations and tuples for C++ and cuda☆84Updated last week
- Astrophysics program simulating the evolution of star systems based on the fast multipole method on adaptive Octrees☆51Updated 2 weeks ago
- AMD’s C++ library for accelerating tensor primitives☆38Updated last week
- GTensor is a multi-dimensional array C++14 header-only library for hybrid GPU development.☆36Updated 6 months ago
- Python bindings for OpenSHMEM☆15Updated 2 weeks ago
- Kokkos C++ Performance Portability Programming Ecosystem: Profiling and Debugging Tools☆121Updated 2 months ago
- Autonomic Performance Environment for eXascale (APEX)☆44Updated this week
- Next generation library for iterative sparse solvers for ROCm platform☆78Updated last week
- ☆20Updated this week
- Cooperative Primitives for CUDA C++ Kernel Authors. This repository contains CUB PRs from Q4 2019 until Q4 2020.☆22Updated 4 years ago
- A project and machine deployment model using Spack☆26Updated 3 weeks ago
- Benchmark of expression templates libraries☆40Updated 4 years ago
- A Low-Level Abstraction of Memory Access☆85Updated last year
- C++ User interface for the Platform independent Library Alpaka☆38Updated 7 months ago
- CS infrastructure components for HPC applications☆168Updated this week
- hipFFT is a FFT marshalling library.☆58Updated last week
- ☆37Updated last month
- This tool serves as a test harness for different optimization techniques to improve stencil computations performance in shared and distri…☆20Updated 2 years ago
- Deploy Dask using MPI4Py☆52Updated 2 weeks ago
- Distributed View Extension for Kokkos☆45Updated 3 months ago
- ☆31Updated last week