Michalos88 / Randomized_SVD_in_CUDA
FAST Randomized SVD on a GPU with CUDA 🏎️
☆11Updated 5 years ago
Alternatives and similar repositories for Randomized_SVD_in_CUDA:
Users that are interested in Randomized_SVD_in_CUDA are comparing it to the libraries listed below
- Benchmarking OpenBLAS on the Apple M1☆18Updated 4 years ago
- Numbast is a tool to build an automated pipeline that converts CUDA APIs into Numba bindings.☆44Updated last week
- Next generation library for iterative sparse solvers for ROCm platform☆79Updated this week
- cuASR: CUDA Algebra for Semirings☆35Updated 2 years ago
- NPBench - A Benchmarking Suite for High-Performance NumPy☆80Updated 2 weeks ago
- Experimental plugin for scikit-learn to be able to run (some estimators) on Intel GPUs via numba-dpex.☆16Updated last year
- No-GIL Python environment featuring NVIDIA Deep Learning libraries.☆57Updated last week
- ☆16Updated 7 months ago
- Distributed Communication-Optimal LU-factorization Algorithm☆12Updated 3 years ago
- A tracing infrastructure for heterogeneous computing applications.☆31Updated this week
- ☆23Updated 2 weeks ago
- Exploring using stdpar and Cython☆33Updated 4 years ago
- NVIDIA HPCG is based on the HPCG benchmark and optimized for performance on NVIDIA accelerated HPC systems.☆51Updated last week
- CUDA Templates for Linear Algebra Subroutines☆20Updated this week
- Linnea is an experimental tool for the automatic generation of optimized code for linear algebra problems.☆68Updated 3 years ago
- FFTX Project☆24Updated 4 months ago
- ROCm SPARSE marshalling library☆67Updated this week
- Get started with your NVIDIA Arm HPC Developers Kit!☆33Updated 2 years ago
- Reference implementation of the draft C++ GraphBLAS specification.☆32Updated 2 months ago
- Round matrix elements to lower precision in MATLAB☆36Updated 2 years ago
- A tracing JIT compiler for PyTorch☆13Updated 3 years ago
- Random number library that generate pseudo-random and quasi-random numbers.☆26Updated last week
- Benchmarks to capture important workloads.☆31Updated 2 months ago
- MPI accelerator-integrated communication extensions☆33Updated 2 years ago
- MGARD: MultiGrid Adaptive Reduction of Data☆40Updated last month
- MLIR tools and dialect for GraphBLAS☆18Updated 3 years ago
- ☆13Updated 5 years ago
- Legate Sparse is a Legate library that aims to provide a distributed and accelerated drop-in replacement for the scipy.sparse library on …☆20Updated last week
- A task benchmark☆41Updated 8 months ago
- AMD’s C++ library for accelerating tensor primitives☆39Updated last week