nv-legate / legate
The Foundation for All Legate Libraries
☆216Updated this week
Alternatives and similar repositories for legate:
Users that are interested in legate are comparing it to the libraries listed below
- An Aspiring Drop-In Replacement for Pandas at Scale☆75Updated 3 years ago
- Python bindings for UCX☆134Updated this week
- KvikIO - High Performance File IO☆206Updated this week
- A Fusion Code Generator for NVIDIA GPUs (commonly known as "nvFuser")☆323Updated this week
- RAPIDS Memory Manager☆575Updated this week
- The CUDA target for Numba☆110Updated this week
- A tensor-aware point-to-point communication primitive for machine learning☆257Updated 2 years ago
- ☆537Updated this week
- oneAPI Collective Communications Library (oneCCL)☆232Updated last week
- An Aspiring Drop-In Replacement for NumPy at Scale☆887Updated this week
- NVIDIA Math Libraries for the Python Ecosystem☆308Updated last month
- The NVIDIA® Tools Extension SDK (NVTX) is a C-based Application Programming Interface (API) for annotating events, code ranges, and resou…☆381Updated this week
- A suite of benchmarks for CPU and GPU performance of the most popular high-performance libraries for Python☆324Updated 6 months ago
- Data Parallel Extension for NumPy☆108Updated this week
- NPBench - A Benchmarking Suite for High-Performance NumPy☆80Updated last week
- RAPIDS GPU-BDB☆108Updated last year
- Samples demonstrating how to use the Compute Sanitizer Tools and Public API☆81Updated last year
- Python SYCL bindings and SYCL-based Python Array API library☆110Updated this week
- RFC document, tooling and other content related to the array API standard☆235Updated last month
- Numbast is a tool to build an automated pipeline that converts CUDA APIs into Numba bindings.☆44Updated this week
- Analyze graph/hierarchical performance data using pandas dataframes☆113Updated 2 months ago
- A Data-Centric Compiler for Machine Learning☆82Updated last year
- A Library for fast Hash Tables on GPUs☆115Updated 2 years ago
- Legate Sparse is a Legate library that aims to provide a distributed and accelerated drop-in replacement for the scipy.sparse library on …☆20Updated 2 weeks ago
- CUDA Kernel Benchmarking Library☆629Updated this week
- Reference implementations of MLPerf™ HPC training benchmarks☆47Updated 2 months ago
- Examples demonstrating available options to program multiple GPUs in a single node or a cluster☆697Updated 2 months ago
- Home for OctoML PyTorch Profiler☆112Updated 2 years ago
- ROCm BLAS marshalling library☆140Updated this week
- Kernel Tuner☆331Updated this week