gptune / GPTuneLinks
☆74Updated 3 months ago
Alternatives and similar repositories for GPTune
Users that are interested in GPTune are comparing it to the libraries listed below
Sorting:
- ☆57Updated 2 weeks ago
- A searchable Python interface to the SuiteSparse Matrix Collection☆48Updated 3 years ago
- HiCMA: Hierarchical Computations on Manycore Architectures☆30Updated 2 years ago
- SLATE is a distributed, GPU-accelerated, dense linear algebra library targetting current and upcoming high-performance computing (HPC) sy…☆119Updated last week
- Round matrix elements to lower precision in MATLAB☆37Updated 2 years ago
- XBraid Parallel-in-Time Solvers☆78Updated 3 weeks ago
- Library of GPU-resident linear solvers☆63Updated this week
- H2 Matrix Package☆30Updated last year
- Tensor Contraction Code Generator☆37Updated 7 years ago
- Automating the Design of Multigrid Methods with Evolutionary Program Synthesis☆12Updated 3 months ago
- Next generation library for iterative sparse solvers for ROCm platform☆81Updated this week
- Structured Matrix Package (LBNL)☆173Updated 2 weeks ago
- Distributed Communication-Optimal LU-factorization Algorithm☆12Updated 3 years ago
- ☆29Updated 2 weeks ago
- Fast gradient evaluation in C++ based on Expression Templates.☆96Updated 3 weeks ago
- ☆15Updated 4 years ago
- A proxy app for the Monte Carlo Transport Code, Mercury. LLNL-CODE-684037☆41Updated last year
- This tool serves as a test harness for different optimization techniques to improve stencil computations performance in shared and distri…☆20Updated 2 years ago
- The Combinatorial BLAS (CombBLAS) is an extensible distributed-memory parallel graph library offering a small but powerful set of linear …☆75Updated this week
- ytopt: machine-learning-based autotuning and hyperparameter optimization framework using Bayesian Optimization☆48Updated last week
- Livermore Unstructured Lagrangian Explicit Shock Hydrodynamics (LULESH)☆108Updated 2 years ago
- PaRSEC is a generic framework for architecture aware scheduling and management of micro-tasks on distributed, GPU accelerated, many-core …☆59Updated 2 weeks ago
- Highly Efficient FFT for Exascale☆38Updated last year
- MagmaDNN: a simple deep learning framework in c++☆49Updated 4 years ago
- GPU accelerated multigrid library for Python☆59Updated 8 months ago
- GTensor is a multi-dimensional array C++14 header-only library for hybrid GPU development.☆35Updated last month
- RAJA Performance Suite☆117Updated last week
- An implementation of the 1. Parallel, 2. Streaming, 3. Randomized SVD using MPI4Py☆61Updated 4 years ago
- resources pour le cours d'introduction à la programmation des GPUs du mastère spécialisé HPC-AI☆22Updated last year
- Error-Free Transformations as building blocks for compensated algorithms☆15Updated 2 years ago