dionhaefner / pyhpc-benchmarks
A suite of benchmarks for CPU and GPU performance of the most popular high-performance libraries for Python
☆315Updated 4 months ago
Alternatives and similar repositories for pyhpc-benchmarks:
Users that are interested in pyhpc-benchmarks are comparing it to the libraries listed below
- RFC document, tooling and other content related to the array API standard☆226Updated last week
- Zero-copy MPI communication of JAX arrays, for turbo-charged HPC applications in Python☆460Updated 3 weeks ago
- The Foundation for All Legate Libraries☆203Updated last week
- An Aspiring Drop-In Replacement for Pandas at Scale☆75Updated 3 years ago
- Extending JAX with custom C++ and CUDA code☆385Updated 5 months ago
- Python bindings for UCX☆124Updated this week
- Automatic parallelization of Python/NumPy, C, and C++ codes on Linux and MacOSX☆220Updated 4 years ago
- Parallel Programming with Python and Charm++☆294Updated this week
- Nvidia contributed CUDA tutorial for Numba☆241Updated 2 years ago
- Documentation:☆119Updated last year
- A tensor-aware point-to-point communication primitive for machine learning☆253Updated 2 years ago
- An Aspiring Drop-In Replacement for NumPy at Scale☆824Updated last week
- Python helpers to limit the number of threads used in native libraries that handle their own internal threadpool (BLAS and OpenMP impleme…☆359Updated 7 months ago
- Mathematical operations for JAX pytrees☆198Updated 2 months ago
- Example Numba implementations of functions☆171Updated 2 years ago
- ⚡️Optimizing einsum functions in NumPy, Tensorflow, Dask, and more with contraction order optimization.☆881Updated 4 months ago
- Machine Learning for HPC Workflows☆126Updated 3 months ago
- A code generator for array-based code on CPUs and GPUs☆597Updated this week
- Sparse multi-dimensional arrays for the PyData ecosystem☆614Updated this week
- A library that translates Python and NumPy to optimized distributed systems code.☆132Updated 2 years ago
- Utilities for Dask and CUDA interactions☆297Updated this week
- NPBench - A Benchmarking Suite for High-Performance NumPy☆77Updated this week
- Example python (numpy) -- CUDA installable package with a C-extension library☆142Updated 5 years ago
- Parallel NumPy seamlessly speeds up NumPy for large arrays (64K+ elements) with no change required to existing code.☆60Updated 4 years ago
- Turn SymPy expressions into trainable JAX expressions.☆328Updated 3 weeks ago
- Concise deep learning for JAX☆184Updated 4 years ago
- common in-memory tensor structure☆938Updated last week
- A modular system for machinable research code☆35Updated last month
- Distributed Communication-Optimal Matrix-Matrix Multiplication Algorithm☆198Updated 2 months ago
- KvikIO - High Performance File IO☆181Updated this week