Numbast is a tool to build an automated pipeline that converts CUDA APIs into Numba bindings.
☆57Mar 16, 2026Updated last week
Alternatives and similar repositories for numbast
Users that are interested in numbast are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- The CUDA target for Numba☆263Mar 16, 2026Updated last week
- Exploring using stdpar and Cython☆34Nov 19, 2020Updated 5 years ago
- NVIDIA Math Libraries for the Python Ecosystem☆555Mar 11, 2026Updated last week
- ☆27Dec 20, 2023Updated 2 years ago
- Source code supporting the High Performance Graphics 2022 paper: Supporting Unified Shader Specialization by Co-opting C++ Features☆14Jul 9, 2022Updated 3 years ago
- A Single .py File Sympy Extension to Generate Eigen C++ Code from the Symbols.☆12Dec 17, 2025Updated 3 months ago
- ☆19Mar 16, 2026Updated last week
- Unified Incremental Potential Contact Framework Documentation☆13Updated this week
- ☆16Feb 26, 2026Updated 3 weeks ago
- A reference implementation of std::simd, providing data parallel types in the C++ standard☆14Mar 9, 2020Updated 6 years ago
- Repository for participants of the "Containers for HPC" training☆11Feb 11, 2026Updated last month
- OptiX SDK headers, everything needed to build & run OptiX applications. SDK samples not included.☆43Dec 12, 2025Updated 3 months ago
- Automatic differentiation for Triton Kernels☆29Aug 12, 2025Updated 7 months ago
- A Python rule engine powered by numba☆18Oct 26, 2025Updated 4 months ago
- A high performance and friendly GPU LBVH implementation.☆41Oct 23, 2025Updated 5 months ago
- Template for starting CUDA/C++ project using CMake with Github Action for CI☆31Jun 25, 2025Updated 8 months ago
- gQuery: Fast CPU and GPU-Accelerated Geometry Queries☆20Apr 14, 2025Updated 11 months ago
- Ship correct and fast LLM kernels to PyTorch☆145Jan 14, 2026Updated 2 months ago
- NVIDIA curated collection of educational resources related to general purpose GPU programming.☆1,348Mar 15, 2026Updated last week
- The repository contains container recipes to build the entire stack of Xeus-Cling and Cling including cuda extension with just a few comm…☆10Dec 22, 2020Updated 5 years ago
- Julia package to read MatrixMarket file format☆32Jul 4, 2024Updated last year
- GPU accelerated multigrid library for Python☆69Sep 24, 2024Updated last year
- SuperGlue -- A C++ Library for Data-Dependency Driven Task Parallelism☆36Sep 28, 2015Updated 10 years ago
- Microbenchmarks showing relative performance of different Python functions/patterns.☆13Oct 3, 2025Updated 5 months ago
- ☆11Mar 5, 2026Updated 2 weeks ago
- A simple BVH data structure.☆23Nov 12, 2025Updated 4 months ago
- A warp-oriented dynamic hash table for GPUs☆76Jan 19, 2024Updated 2 years ago
- Celltree data structure for searching for points, lines, boxes, and cells (convex polygons) in a two dimensional unstructured mesh.☆13Mar 4, 2026Updated 2 weeks ago
- ☆18Nov 11, 2025Updated 4 months ago
- A dynamic GPU memory allocator, suitable for warp synchronized scenarios.☆11Aug 20, 2019Updated 6 years ago
- Pytorch routines for (Ker)nel (Mac)hines☆11Oct 10, 2025Updated 5 months ago
- ☆17Sep 1, 2025Updated 6 months ago
- [DEPRECATED] Moved to ROCm/rocm-systems repo☆165Updated this week
- Scripts for building Singularity images☆10Mar 26, 2019Updated 6 years ago
- 🎃 GPU load-balancing library for regular and irregular computations.☆66Sep 9, 2025Updated 6 months ago
- CUDA Python: Performance meets Productivity☆3,193Updated this week
- Open-source library for Graph Streaming. Solves the connected components problem using sub-linear space. Published in SIGMOD'22.☆10Mar 12, 2026Updated last week
- LaTeX file checking tools☆49Mar 13, 2026Updated last week
- NCAR HPC Docs Repository☆14Updated this week