☆80Jan 6, 2026Updated last month
Alternatives and similar repositories for GPTune
Users that are interested in GPTune are comparing it to the libraries listed below
Sorting:
- A fast shared & distributed memory task-based runtime in C++☆28Mar 9, 2021Updated 4 years ago
- YAKL is A Kokkos Layer: A simple C++ framework for performance portability and Fortran code porting☆69Sep 9, 2025Updated 5 months ago
- A Symbolic Emulator for Shuffle Synthesis on the NVIDIA PTX Code☆15Mar 19, 2023Updated 2 years ago
- GPU-friendly Small Non-Linear Solvers (SNLS)☆17Oct 24, 2025Updated 4 months ago
- High Performance Linpack for Next-Generation AMD HPC Accelerators☆67Dec 10, 2025Updated 2 months ago
- ☆17Dec 8, 2023Updated 2 years ago
- A simple, but fast, triangular solver☆18Mar 22, 2021Updated 4 years ago
- Responses to 2021 RFI on Stewardship of Software for Scientific and High-Performance Computing☆16Jan 20, 2022Updated 4 years ago
- Pavilion is a Python 3 (3.5+) based framework for running and analyzing tests targeting HPC systems.☆46Feb 25, 2026Updated last week
- ☆18Jan 17, 2024Updated 2 years ago
- Graph-indexed Pandas DataFrames for analyzing hierarchical performance data☆34Jan 30, 2026Updated last month
- ☆18Jul 11, 2023Updated 2 years ago
- Very-Low Overhead Checkpointing System☆59Aug 5, 2025Updated 7 months ago
- Asynchronous I/O for HDF5☆24Feb 10, 2026Updated 3 weeks ago
- EPOCH Input System Version 2☆10Jun 5, 2020Updated 5 years ago
- The SOLAR blackbox optimization problem☆16Sep 24, 2025Updated 5 months ago
- OpenMP offload playground☆10Nov 16, 2024Updated last year
- [DEPRECATED] Moved to ROCm/rocm-libraries repo☆84Feb 26, 2026Updated last week
- Next generation library for iterative sparse solvers for ROCm platform☆94Feb 23, 2026Updated last week
- A flyweight in situ visualization and analysis runtime for multi-physics HPC simulations☆235Updated this week
- A shared-memory FFT for the Kokkos ecosystem☆48Updated this week
- ☆26Aug 14, 2025Updated 6 months ago
- UniSparse: An Intermediate Language for General Sparse Format Customization (OOPSLA'24)☆33Nov 12, 2024Updated last year
- An MPI wrapper for the pytorch tensor library that is automatically differentiable☆10Mar 27, 2023Updated 2 years ago
- Contains the xSDK community policies. The master branch is the latest accepted version of the policies and will be applied to future xSDK…☆11Jun 14, 2024Updated last year
- ☆14Sep 7, 2023Updated 2 years ago
- Simple Fortran parallel IO benchmark for teaching and benchmarking purposes☆11Nov 25, 2025Updated 3 months ago
- Distributed Communication-Optimal LU-factorization Algorithm☆12Aug 1, 2021Updated 4 years ago
- Distributed SDDMM Kernel☆12Jul 8, 2022Updated 3 years ago
- Modified Shepard Algorithm for Interpolation of Scattered Multivariate Data☆11May 28, 2022Updated 3 years ago
- ExaWorks SDK☆11Feb 1, 2024Updated 2 years ago
- ☆12Aug 4, 2025Updated 7 months ago
- This fork of SWIG creates Fortran wrapper code from C++ headers.☆47Aug 22, 2023Updated 2 years ago
- The parGeMSLR is an MPI-based sparse linear system solution/preconditioning package implementation with C++.☆25Aug 21, 2025Updated 6 months ago
- ☆49Sep 5, 2020Updated 5 years ago
- ytopt: machine-learning-based autotuning and hyperparameter optimization framework using Bayesian Optimization☆49Feb 25, 2026Updated last week
- ☆17Nov 26, 2023Updated 2 years ago
- A PyTorch native platform for training generative AI models☆15Nov 18, 2025Updated 3 months ago
- Parallel element agglomeration algebraic multigrid upscaling and solvers.☆16Jul 25, 2025Updated 7 months ago