nv-legate / legateLinks
The Foundation for All Legate Libraries
☆232Updated this week
Alternatives and similar repositories for legate
Users that are interested in legate are comparing it to the libraries listed below
Sorting:
- The CUDA target for Numba☆210Updated this week
- An Aspiring Drop-In Replacement for Pandas at Scale☆74Updated 4 years ago
- Numbast is a tool to build an automated pipeline that converts CUDA APIs into Numba bindings.☆52Updated this week
- Python SYCL bindings and SYCL-based Python Array API library☆117Updated last week
- Python bindings for UCX☆140Updated last month
- Data Parallel Extension for NumPy☆118Updated this week
- RAPIDS Memory Manager☆656Updated this week
- KvikIO - High Performance File IO☆231Updated this week
- NumPy and SciPy on Multi-Node Multi-GPU systems☆948Updated this week
- Analyze graph/hierarchical performance data using pandas dataframes☆115Updated 3 weeks ago
- A Fusion Code Generator for NVIDIA GPUs (commonly known as "nvFuser")☆360Updated this week
- A suite of benchmarks for CPU and GPU performance of the most popular high-performance libraries for Python☆333Updated last year
- Data Parallel Extension for Numba☆86Updated last month
- Legate Sparse is a Legate library that aims to provide a distributed and accelerated drop-in replacement for the scipy.sparse library on …☆23Updated last month
- Kernel Tuner☆372Updated this week
- NVIDIA Math Libraries for the Python Ecosystem☆532Updated 2 months ago
- The NVIDIA® Tools Extension SDK (NVTX) is a C-based Application Programming Interface (API) for annotating events, code ranges, and resou…☆466Updated 2 weeks ago
- Utilities for Dask and CUDA interactions☆316Updated this week
- NPBench - A Benchmarking Suite for High-Performance NumPy☆89Updated last month
- Machine Learning for HPC Workflows☆142Updated 3 weeks ago
- A code generator for array-based code on CPUs and GPUs☆616Updated last week
- RFC document, tooling and other content related to the array API standard☆260Updated 2 months ago
- RAPIDS GPU-BDB☆108Updated last year
- Distributed Communication-Optimal Matrix-Matrix Multiplication Algorithm☆211Updated 2 weeks ago
- Subset of BLAS routines optimized for NVIDIA GPUs☆73Updated 2 years ago
- Samples demonstrating how to use the Compute Sanitizer Tools and Public API☆89Updated 2 years ago
- ☆51Updated 2 weeks ago
- ☆596Updated 2 weeks ago
- Provide Python access to the NVML library for GPU diagnostics☆249Updated 2 months ago
- Worked example of the process from Python source to CUDA kernel execution with Numba☆42Updated last year