NVIDIA / numba-cudaLinks
The CUDA target for Numba
☆251Updated this week
Alternatives and similar repositories for numba-cuda
Users that are interested in numba-cuda are comparing it to the libraries listed below
Sorting:
- Numbast is a tool to build an automated pipeline that converts CUDA APIs into Numba bindings.☆57Updated this week
- NVIDIA Math Libraries for the Python Ecosystem☆544Updated 3 weeks ago
- The Foundation for All Legate Libraries☆235Updated this week
- Data Parallel Extension for Numba☆90Updated 4 months ago
- KvikIO - High Performance File IO☆240Updated this week
- Data Parallel Extension for NumPy☆121Updated last week
- A stand-alone implementation of several NumPy dtype extensions used in machine learning.☆329Updated this week
- The NVIDIA® Tools Extension SDK (NVTX) is a C-based Application Programming Interface (API) for annotating events, code ranges, and resou…☆507Updated last week
- Python SYCL bindings and SYCL-based Python Array API library☆121Updated this week
- ☆53Updated this week
- Kernel Tuner☆383Updated this week
- A Fusion Code Generator for NVIDIA GPUs (commonly known as "nvFuser")☆380Updated this week
- RAPIDS Memory Manager☆681Updated this week
- No-GIL Python environment featuring NVIDIA Deep Learning libraries.☆70Updated 9 months ago
- LLM training in simple, raw C/CUDA☆112Updated last year
- NPBench - A Benchmarking Suite for High-Performance NumPy☆91Updated last week
- GitHub Action to install CUDA☆199Updated last month
- ☆55Updated 2 months ago
- ☆74Updated this week
- Efficient Distributed GPU Programming for Exascale, an SC/ISC Tutorial☆350Updated 2 months ago
- POC work on MLIR backend☆61Updated last year
- HIP Python Low-level Bindings☆33Updated 3 months ago
- NVIDIA HPCG is based on the HPCG benchmark and optimized for performance on NVIDIA accelerated HPC systems.☆67Updated 3 weeks ago
- High-Performance FP32 GEMM on CUDA devices☆117Updated last year
- ☆103Updated last week
- CUDA Kernel Benchmarking Library☆809Updated this week
- OpenMP for Python in Numba☆151Updated this week
- Evaluating Large Language Models for CUDA Code Generation ComputeEval is a framework designed to generate and evaluate CUDA code from Lar…☆96Updated last month
- We aim to redefine Data Parallel libraries portabiliy, performance, programability and maintainability, by using C++ standard features, i…☆47Updated this week
- ☆137Updated 3 months ago