nv-legate / legate-sparseLinks
Legate Sparse is a Legate library that aims to provide a distributed and accelerated drop-in replacement for the scipy.sparse library on top of the Legate runtime
☆23Updated this week
Alternatives and similar repositories for legate-sparse
Users that are interested in legate-sparse are comparing it to the libraries listed below
Sorting:
- ☆60Updated last month
- The Foundation for All Legate Libraries☆218Updated this week
- Numbast is a tool to build an automated pipeline that converts CUDA APIs into Numba bindings.☆47Updated last week
- The CUDA target for Numba☆142Updated this week
- ☆75Updated 4 months ago
- ALCF Computational Performance Workshop☆37Updated 2 years ago
- NPBench - A Benchmarking Suite for High-Performance NumPy☆83Updated last month
- Round matrix elements to lower precision in MATLAB☆37Updated 3 years ago
- GPU accelerated multigrid library for Python☆60Updated 9 months ago
- Next generation library for iterative sparse solvers for ROCm platform☆81Updated last week
- Distributed Communication-Optimal Matrix-Matrix Multiplication Algorithm☆206Updated last month
- SLATE is a distributed, GPU-accelerated, dense linear algebra library targetting current and upcoming high-performance computing (HPC) sy…☆120Updated last month
- Material for the SC22 Deep Learning at Scale Tutorial☆41Updated last year
- Intermediate MPI lesson☆28Updated 2 years ago
- ☆29Updated 3 weeks ago
- resources pour le cours d'introduction à la programmation des GPUs du mastère spécialisé HPC-AI☆21Updated last year
- A header-only C++ library for sketching in randomized linear algebra☆91Updated 3 months ago
- A searchable Python interface to the SuiteSparse Matrix Collection☆49Updated 3 years ago
- A web interface for the SuiteSparse Matrix Collection, formerly known as the University of Florida Sparse Matrix Collection☆24Updated 3 weeks ago
- GEMMul8 (GEMMulate): GEMM emulation using int8 matrix engines based on the Ozaki Scheme II☆19Updated last week
- This tool serves as a test harness for different optimization techniques to improve stencil computations performance in shared and distri…☆20Updated 2 years ago
- Data Parallel Extension for Numba☆81Updated 7 months ago
- FP64 equivalent GEMM via Int8 Tensor Cores using the Ozaki scheme☆75Updated 3 months ago
- MagmaDNN: a simple deep learning framework in c++☆49Updated 4 years ago
- MATLAB Code for Parameters of Floating-Point Arithmetics☆8Updated 3 years ago
- Subset of BLAS routines optimized for NVIDIA GPUs☆69Updated 2 years ago
- Machine Learning for HPC Workflows☆136Updated last week
- The Combinatorial BLAS (CombBLAS) is an extensible distributed-memory parallel graph library offering a small but powerful set of linear …☆78Updated last month
- NVIDIA HPCG is based on the HPCG benchmark and optimized for performance on NVIDIA accelerated HPC systems.☆56Updated 2 months ago
- An Adaptive Pencil Decomposition Library for NVIDIA GPUs☆63Updated last week