NVIDIA / numbastLinks
Numbast is a tool to build an automated pipeline that converts CUDA APIs into Numba bindings.
☆51Updated last week
Alternatives and similar repositories for numbast
Users that are interested in numbast are comparing it to the libraries listed below
Sorting:
- The CUDA target for Numba☆206Updated this week
- The Foundation for All Legate Libraries☆229Updated this week
- Data Parallel Extension for Numba☆84Updated last month
- Data Parallel Extension for NumPy☆116Updated this week
- ☆80Updated last week
- Analyze graph/hierarchical performance data using pandas dataframes☆115Updated this week
- Exploring using stdpar and Cython☆34Updated 4 years ago
- NPBench - A Benchmarking Suite for High-Performance NumPy☆89Updated 3 weeks ago
- Generate simple index ranges in C++ and CUDA C++☆39Updated 2 years ago
- Python SYCL bindings and SYCL-based Python Array API library☆117Updated this week
- Python bindings for OpenSHMEM☆24Updated last month
- NVIDIA Math Libraries for the Python Ecosystem☆522Updated last month
- ☆46Updated last week
- Next generation library for iterative sparse solvers for ROCm platform☆89Updated last week
- [DEPRECATED] Moved to ROCm/rocm-libraries repo☆115Updated this week
- NVIDIA HPCG is based on the HPCG benchmark and optimized for performance on NVIDIA accelerated HPC systems.☆61Updated 2 weeks ago
- LLM training in simple, raw C/CUDA☆107Updated last year
- C++ HPC Tutorial materials☆55Updated this week
- Kokkos C++ Performance Portability Programming Ecosystem: Profiling and Debugging Tools☆135Updated last week
- SLATE is a distributed, GPU-accelerated, dense linear algebra library targetting current and upcoming high-performance computing (HPC) sy…☆130Updated 5 months ago
- A task benchmark☆44Updated last year
- A unified framework across multiple programming platforms☆41Updated 4 months ago
- Cooperative Primitives for CUDA C++ Kernel Authors. This repository contains CUB PRs from Q4 2019 until Q4 2020.☆22Updated 5 years ago
- AMD’s C++ library for accelerating tensor primitives☆46Updated 2 weeks ago
- Programmable JIT Compilation and Optimization for C/C++ using LLVM☆31Updated last week
- [DEPRECATED] Moved to ROCm/rocm-libraries repo☆69Updated last week
- [DEPRECATED] Moved to ROCm/rocm-libraries repo☆124Updated this week
- Subset of BLAS routines optimized for NVIDIA GPUs☆73Updated 2 years ago
- [DEPRECATED] Moved to ROCm/rocm-libraries repo☆83Updated 2 weeks ago
- Compiler agnostic metaprogramming library providing concepts, type operations and tuples for C++ and cuda☆92Updated this week