gptune / GPTune
☆68Updated last week
Alternatives and similar repositories for GPTune:
Users that are interested in GPTune are comparing it to the libraries listed below
- HiCMA: Hierarchical Computations on Manycore Architectures☆30Updated last year
- ☆15Updated 3 years ago
- H2 Matrix Package☆26Updated last year
- Library of GPU-resident linear solvers☆60Updated this week
- ytopt: machine-learning-based autotuning and hyperparameter optimization framework using Bayesian Optimization☆47Updated this week
- Round matrix elements to lower precision in MATLAB☆36Updated 2 years ago
- Tensor Contraction Code Generator☆36Updated 7 years ago
- A searchable Python interface to the SuiteSparse Matrix Collection☆43Updated 2 years ago
- SLATE is a distributed, GPU-accelerated, dense linear algebra library targetting current and upcoming high-performance computing (HPC) sy…☆106Updated last month
- Automating the Design of Multigrid Methods with Evolutionary Program Synthesis☆13Updated this week
- Next generation library for iterative sparse solvers for ROCm platform☆78Updated this week
- XBraid Parallel-in-Time Solvers☆74Updated 5 months ago
- An implementation of the 1. Parallel, 2. Streaming, 3. Randomized SVD using MPI4Py☆58Updated 3 years ago
- H2Opus: a performance-oriented library for hierarchical matrices☆13Updated 2 years ago
- Distributed Communication-Optimal LU-factorization Algorithm☆12Updated 3 years ago
- Livermore Unstructured Lagrangian Explicit Shock Hydrodynamics (LULESH)☆105Updated last year
- Fast gradient evaluation in C++ based on Expression Templates.☆94Updated last month
- ☆20Updated 2 months ago
- A proxy app for the Monte Carlo Transport Code, Mercury. LLNL-CODE-684037☆39Updated last year
- Tensor decomposition with arbitrary expressions: inner, outer, elementwise operators; nonlinear transformations; and more.☆58Updated 2 years ago
- H2Lib public repository☆53Updated 2 years ago
- GTensor is a multi-dimensional array C++14 header-only library for hybrid GPU development.☆35Updated 4 months ago
- Zoltan Dynamic Load Balancing and Graph Algorithm Toolkit -- Distribution site☆34Updated last year
- Graph-indexed Pandas DataFrames for analyzing hierarchical performance data☆31Updated 3 months ago
- PyMGRIT is a package for the Multigrid-Reduction-in-Time (MGRIT) algorithm in Python.☆18Updated 2 years ago
- Linnea is an experimental tool for the automatic generation of optimized code for linear algebra problems.☆68Updated 3 years ago
- DBCSR: Distributed Block Compressed Sparse Row matrix library☆139Updated this week
- RAJA Performance Suite☆118Updated this week
- PaRSEC is a generic framework for architecture aware scheduling and management of micro-tasks on distributed, GPU accelerated, many-core …☆53Updated this week
- The Combinatorial BLAS (CombBLAS) is an extensible distributed-memory parallel graph library offering a small but powerful set of linear …☆70Updated 2 months ago