IntelPython / numba-dpex
Data Parallel Extension for Numba
☆77Updated this week
Related projects ⓘ
Alternatives and complementary repositories for numba-dpex
- Python SYCL bindings and SYCL-based Python Array API library☆102Updated this week
- Data Parallel Extension for NumPy☆99Updated this week
- Benchmark suite to evaluate Data Parallel Extensions for Python☆17Updated 2 months ago
- ☆36Updated this week
- Analyze graph/hierarchical performance data using pandas dataframes☆107Updated last month
- POC work on MLIR backend☆50Updated 3 months ago
- Deploy Dask using MPI4Py☆52Updated last month
- oneAPI Technical Advisory Board (TAB) Meeting Notes☆72Updated 9 months ago
- NPBench - A Benchmarking Suite for High-Performance NumPy☆73Updated this week
- The Foundation for All Legate Libraries☆193Updated last week
- Numbast is a tool to build an automated pipeline that converts CUDA APIs into Numba bindings.☆27Updated this week
- Experimental plugin for scikit-learn to be able to run (some estimators) on Intel GPUs via numba-dpex.☆15Updated 8 months ago
- OpenMP for Python in Numba☆78Updated this week
- ROCm SPARSE marshalling library☆69Updated this week
- NVIDIA Math Libraries for the Python Ecosystem☆207Updated this week
- Next generation LAPACK implementation for ROCm platform☆95Updated this week
- An implementation of BLAS using the SYCL open standard.☆259Updated 3 weeks ago
- SLATE is a distributed, GPU-accelerated, dense linear algebra library targetting current and upcoming high-performance computing (HPC) sy…☆93Updated 3 weeks ago
- Sample configuration files for using oneAPI in CI systems☆93Updated this week
- Python bindings for OpenSHMEM☆14Updated this week
- GTensor is a multi-dimensional array C++14 header-only library for hybrid GPU development.☆35Updated 2 months ago
- An Aspiring Drop-In Replacement for Pandas at Scale☆74Updated 3 years ago
- ☆233Updated this week
- oneAPI Level Zero Conformance & Performance test content☆47Updated this week
- KvikIO - High Performance File IO☆159Updated this week
- Next generation FFT implementation for ROCm☆177Updated this week
- ☆71Updated this week
- Python bindings for UCX☆121Updated this week
- Creates performance portable libraries with embedded source representations.☆21Updated 4 months ago
- A tracing infrastructure for heterogeneous computing applications.☆25Updated last week