The CUDA target for Numba
☆287Jun 13, 2026Updated 2 weeks ago
Alternatives and similar repositories for numba-cuda
Users that are interested in numba-cuda are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Numbast is a tool to build an automated pipeline that converts CUDA APIs into Numba bindings.☆62Updated this week
- NVIDIA Math Libraries for the Python Ecosystem☆589May 28, 2026Updated last month
- CUDA Python: Performance meets Productivity☆3,295Updated this week
- Worked example of the process from Python source to CUDA kernel execution with Numba☆45Sep 11, 2024Updated last year
- ☆65Apr 26, 2025Updated last year
- Deploy open-source AI quickly and easily - Special Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- Benchmark tests supporting the TiledCUDA library.☆19Nov 19, 2024Updated last year
- CUDA Core Compute Libraries☆2,395Updated this week
- MPI-Rockstar: a Hybrid MPI and OpenMP Parallel Implementation of the Rockstar Halo finder☆15Apr 27, 2026Updated 2 months ago
- Asynchronous I/O for HDF5☆24Feb 10, 2026Updated 4 months ago
- ☆46Jun 15, 2026Updated last week
- An experimental communicating attention kernel based on DeepEP.☆34Jul 29, 2025Updated 11 months ago
- Manipulating ragged arrays in an Array API compliant way.☆48Jun 22, 2026Updated last week
- Performance portable parallel programming in Python backed by Kokkos☆127Jun 15, 2026Updated 2 weeks ago
- Luthier, a GPU binary instrumentation tool for AMD GPUs☆27Updated this week
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- TileFusion is an experimental C++ macro kernel template library that elevates the abstraction level in CUDA C for tile processing.☆112Jun 28, 2025Updated last year
- NumPy and SciPy on Multi-Node Multi-GPU systems☆978Updated this week
- The NVIDIA® Tools Extension SDK (NVTX) is a C-based Application Programming Interface (API) for annotating events, code ranges, and resou…☆540Jun 19, 2026Updated last week
- Histograms with task scheduling.☆25Updated this week
- ☆73Jun 11, 2026Updated 2 weeks ago
- DeeperGEMM: crazy optimized version☆86May 5, 2025Updated last year
- Offline as of 2026-03-13☆14Mar 13, 2026Updated 3 months ago
- ☆651Updated this week
- HIP backend patch for Numba, the NumPy aware dynamic Python compiler using LLVM.☆21May 11, 2026Updated last month
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- DeepGEMM: clean and efficient FP8 GEMM kernels with fine-grained scaling☆28Jun 20, 2026Updated last week
- Numba compatible SCFG (Structured Control Flow Graphs) utilities.☆29Updated this week
- An MPI ABI compatibility layer☆34Jun 17, 2026Updated last week
- Is the GIL seeing someone else? How's about repetitively calling and seeing how long it takes to answer?☆16Jan 7, 2026Updated 5 months ago
- ☆52May 19, 2025Updated last year
- Implementation of AMD HIP for CPUs☆22Jun 16, 2020Updated 6 years ago
- CUDA Kernel Benchmarking Library☆878Jun 22, 2026Updated last week
- The C++ Standard Library for your entire system.☆28May 29, 2026Updated last month
- An Aspiring Drop-In Replacement for Pandas at Scale☆74Oct 19, 2021Updated 4 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- General purpose, language-agnostic Continuous Benchmarking (CB) framework☆35Apr 15, 2020Updated 6 years ago
- Tools and libraries for writing Kokkos-enabled HPC C++ in E3SM ecosystem☆22Jun 18, 2026Updated last week
- The Foundation for All Legate Libraries☆240Updated this week
- NVIDIA curated collection of educational resources related to general purpose GPU programming.☆1,787Updated this week
- GPU移植のための実装例(直接法に基づくN体計算)☆17Apr 11, 2026Updated 2 months ago
- Department of Energy Standard Utility Library☆33Updated this week
- An MPI wrapper for the pytorch tensor library that is automatically differentiable☆10Mar 27, 2023Updated 3 years ago