☆80Jan 6, 2026Updated 2 months ago
Alternatives and similar repositories for GPTune
Users that are interested in GPTune are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- GPU-friendly Small Non-Linear Solvers (SNLS)☆17Updated this week
- A simple, but fast, triangular solver☆18Mar 22, 2021Updated 5 years ago
- A fast shared & distributed memory task-based runtime in C++☆28Mar 9, 2021Updated 5 years ago
- YAKL is A Kokkos Layer: A simple C++ framework for performance portability and Fortran code porting☆69Sep 9, 2025Updated 6 months ago
- Responses to 2021 RFI on Stewardship of Software for Scientific and High-Performance Computing☆16Jan 20, 2022Updated 4 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Parallel element agglomeration algebraic multigrid upscaling and solvers.☆16Jul 25, 2025Updated 8 months ago
- The SOLAR blackbox optimization problem☆16Sep 24, 2025Updated 6 months ago
- Graph-indexed Pandas DataFrames for analyzing hierarchical performance data☆35Mar 6, 2026Updated 2 weeks ago
- A Symbolic Emulator for Shuffle Synthesis on the NVIDIA PTX Code☆15Mar 19, 2023Updated 3 years ago
- Modified Shepard Algorithm for Interpolation of Scattered Multivariate Data☆11May 28, 2022Updated 3 years ago
- R Interface to CVODE/CVODES/IDA functions in the SUNDIALS ODE solving C library☆11Jun 11, 2025Updated 9 months ago
- ☆17Nov 26, 2023Updated 2 years ago
- Pavilion is a Python 3 (3.6+) based framework for running and analyzing tests targeting HPC systems.☆46Updated this week
- Next generation library for iterative sparse solvers for ROCm platform☆96Mar 18, 2026Updated last week
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- [DEPRECATED] Moved to ROCm/rocm-libraries repo☆83Mar 17, 2026Updated last week
- High Performance Linpack for Next-Generation AMD HPC Accelerators☆68Dec 10, 2025Updated 3 months ago
- Instructions and templates for SC authors☆17Aug 22, 2021Updated 4 years ago
- A GPU benchmark suite for autotuners☆19Feb 20, 2024Updated 2 years ago
- The parGeMSLR is an MPI-based sparse linear system solution/preconditioning package implementation with C++.☆25Aug 21, 2025Updated 7 months ago
- Automates using spack to build and deploy software☆30Mar 13, 2026Updated last week
- ☆17Dec 8, 2023Updated 2 years ago
- UniSparse: An Intermediate Language for General Sparse Format Customization (OOPSLA'24)☆33Nov 12, 2024Updated last year
- 2020 Collegeville Workshop on Scientific Software - Developer Productivity☆12Mar 1, 2022Updated 4 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- ☆19Jan 17, 2024Updated 2 years ago
- ☆26Mar 4, 2026Updated 3 weeks ago
- Contains the xSDK community policies. The master branch is the latest accepted version of the policies and will be applied to future xSDK…☆11Jun 14, 2024Updated last year
- Modular Expression Language for Ordinary Differential Equation Editing☆12Nov 10, 2021Updated 4 years ago
- Sandia National Laboratories' Albany multiphysics code☆321Updated this week
- pLiner is a framework that helps programmers identify locations in the source of numerical code that are highly affected by compiler opti…☆17Oct 27, 2023Updated 2 years ago
- ExaWorks SDK☆11Feb 1, 2024Updated 2 years ago
- Using C++ magic to capture CUDA kernels and tune them with Kernel Tuner☆21Sep 12, 2025Updated 6 months ago
- Matrix multiplication on GPUs for matrices stored on a CPU. Similar to cublasXt, but ported to both NVIDIA and AMD GPUs.☆32Apr 2, 2025Updated 11 months ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- ☆12Aug 4, 2025Updated 7 months ago
- Distributed Communication-Optimal LU-factorization Algorithm☆12Aug 1, 2021Updated 4 years ago
- PyTorch block-diagonal ODE CUDA solver, designed for gradient-based optimization☆16Apr 27, 2020Updated 5 years ago
- Randomized algorithms for numerical linear algebra in Julia☆21Mar 14, 2023Updated 3 years ago
- EPOCH Input System Version 2☆10Jun 5, 2020Updated 5 years ago
- Very-Low Overhead Checkpointing System☆59Aug 5, 2025Updated 7 months ago
- Reference implementation of the draft C++ GraphBLAS specification.☆32Feb 19, 2025Updated last year