gmarkall / life-of-a-numba-kernel
Worked example of the process from Python source to CUDA kernel execution with Numba
☆37Updated 6 months ago
Alternatives and similar repositories for life-of-a-numba-kernel:
Users that are interested in life-of-a-numba-kernel are comparing it to the libraries listed below
- An Aspiring Drop-In Replacement for Pandas at Scale☆75Updated 3 years ago
- A library that translates Python and NumPy to optimized distributed systems code.☆132Updated 2 years ago
- Automatically insert nvtx ranges to PyTorch models☆17Updated 3 years ago
- Numbast is a tool to build an automated pipeline that converts CUDA APIs into Numba bindings.☆39Updated this week
- Cooperative Primitives for CUDA C++ Kernel Authors. This repository contains CUB PRs from Q4 2019 until Q4 2020.☆22Updated 4 years ago
- Collection of scripts to build PyTorch and the domain libraries from source.☆10Updated this week
- NPBench - A Benchmarking Suite for High-Performance NumPy☆80Updated this week
- Codebase associated with the PyTorch compiler tutorial☆46Updated 5 years ago
- pytest plugin for a better developer experience when working with the PyTorch test suite☆44Updated 3 years ago
- Exploring using stdpar and Cython☆33Updated 4 years ago
- ☆51Updated 7 months ago
- A library for syntactically rewriting Python programs, pronounced (sinner).☆70Updated 3 years ago
- ArrayViews: creating specific views to array storage objects☆17Updated 6 years ago
- Analyze graph/hierarchical performance data using pandas dataframes☆113Updated last month
- The CUDA target for Numba☆73Updated this week
- The Foundation for All Legate Libraries☆206Updated this week
- A task benchmark☆41Updated 7 months ago
- DLPack for Tensorflow☆36Updated 4 years ago
- Einsum optimization using opt_einsum and PyTorch FX graph rewriting☆20Updated 3 years ago
- ☆16Updated 2 years ago
- A benchmark to measure performance of popular Gradient boosting algorithms against popular ML datasets.☆38Updated 2 years ago
- POC work on MLIR backend☆53Updated 7 months ago
- Example python package with pybind11 cpp extension☆57Updated 4 years ago
- RFC document, tooling and other content related to the array API standard☆230Updated 3 weeks ago
- Test suite for Python array API standard compliance☆67Updated this week
- Material for the SC22 Deep Learning at Scale Tutorial☆40Updated last year
- ☆15Updated 5 months ago
- Data and tooling to compare the API surfaces of various array libraries.☆54Updated last month
- Scientific algorithms implemented on top of the x-stack (xtensor, xsimd ...)☆9Updated 5 years ago
- Python bindings for UCX☆126Updated last week