The Foundation for All Legate Libraries
☆238Mar 18, 2026Updated this week
Alternatives and similar repositories for legate
Users that are interested in legate are comparing it to the libraries listed below
Sorting:
- NumPy and SciPy on Multi-Node Multi-GPU systems☆967Updated this week
- An Aspiring Drop-In Replacement for Pandas at Scale☆74Oct 19, 2021Updated 4 years ago
- Legate Hello World Pedagogical Library☆10Apr 5, 2023Updated 2 years ago
- Legate Sparse is a Legate library that aims to provide a distributed and accelerated drop-in replacement for the scipy.sparse library on …☆24Feb 14, 2026Updated last month
- GBM implementation on Legate☆14Jan 28, 2026Updated last month
- Microbenchmarks showing relative performance of different Python functions/patterns.☆13Oct 3, 2025Updated 5 months ago
- ☆11Jul 13, 2022Updated 3 years ago
- Contains the xSDK community policies. The master branch is the latest accepted version of the policies and will be applied to future xSDK…☆11Jun 14, 2024Updated last year
- best CPU/GPU sparse solver for large sparse matrices☆21Oct 5, 2021Updated 4 years ago
- Tools and libraries for writing Kokkos-enabled HPC C++ in E3SM ecosystem☆20Updated this week
- An Attention Superoptimizer☆22Jan 20, 2025Updated last year
- [ARCHIVED] cuDF [alpha] - RAPIDS Merge of GoAi into cuDF☆35Oct 28, 2018Updated 7 years ago
- Unified Collective Communication Library☆296Mar 12, 2026Updated last week
- cuNumeric.jl wraps the cuPyNumeric C++ API providing a simple array programming interface that executes code on distributed clusters.☆19Mar 3, 2026Updated 2 weeks ago
- CUDA Kernel Benchmarking Library☆831Updated this week
- Matrix multiplication on GPUs for matrices stored on a CPU. Similar to cublasXt, but ported to both NVIDIA and AMD GPUs.☆31Apr 2, 2025Updated 11 months ago
- Example codes demonstrating the use of various XSDK packages in combination.☆19Jun 1, 2023Updated 2 years ago
- [ARCHIVED] The C++ Standard Library for your entire system. See https://github.com/NVIDIA/cccl☆2,308Feb 7, 2024Updated 2 years ago
- Implementation of AMD HIP for CPUs☆22Jun 16, 2020Updated 5 years ago
- The Exascale Computing Project Software Technologies Capability Assessment Report - Public Version☆21Aug 18, 2022Updated 3 years ago
- Standard interface for collecting HPC run metadata☆16Nov 7, 2025Updated 4 months ago
- List all available information about all SYCL devices and platforms☆15Sep 14, 2020Updated 5 years ago
- Distributed SDDMM Kernel☆12Jul 8, 2022Updated 3 years ago
- MPI accelerator-integrated communication extensions☆40Apr 4, 2023Updated 2 years ago
- A task benchmark☆44Aug 5, 2024Updated last year
- DaCe - Data Centric Parallel Programming☆579Mar 13, 2026Updated last week
- A tracing JIT compiler for PyTorch☆13Dec 11, 2021Updated 4 years ago
- Examples demonstrating available options to program multiple GPUs in a single node or a cluster☆874Sep 26, 2025Updated 5 months ago
- A single-header C++ library for simplifying the use of CUDA Runtime Compilation (NVRTC).☆569Sep 15, 2025Updated 6 months ago
- Code examples for "Under the hood of calling C/C++ from Python"☆13Sep 16, 2020Updated 5 years ago
- A simple yet powerful tool to turn traditional container/OS images into unprivileged sandboxes.☆913Feb 18, 2026Updated last month
- Utilities for Dask and CUDA interactions☆319Updated this week
- Python bindings for UCX☆139Sep 18, 2025Updated 6 months ago
- Collection of scripts to build PyTorch and the domain libraries from source.☆14Feb 4, 2026Updated last month
- Abstraction Library for Parallel Kernel Acceleration☆408Mar 13, 2026Updated last week
- [ARCHIVED] Cooperative primitives for CUDA C++. See https://github.com/NVIDIA/cccl☆1,821Oct 9, 2023Updated 2 years ago
- Distributed Communication-Optimal Matrix-Matrix Multiplication Algorithm☆213Mar 3, 2026Updated 2 weeks ago
- LaunchMON is a software infrastructure that enables HPC run-time tools to co-locate tool daemons with a parallel job. Its API allows a to…☆13Feb 11, 2026Updated last month
- ☆22Aug 28, 2020Updated 5 years ago