☆80Apr 7, 2026Updated 3 weeks ago
Alternatives and similar repositories for GPTune
Users that are interested in GPTune are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- GPU-friendly Small Non-Linear Solvers (SNLS)☆17Mar 19, 2026Updated last month
- A simple, but fast, triangular solver☆18Mar 22, 2021Updated 5 years ago
- A fast shared & distributed memory task-based runtime in C++☆28Mar 9, 2021Updated 5 years ago
- YAKL is A Kokkos Layer: A simple C++ framework for performance portability and Fortran code porting☆71Apr 23, 2026Updated last week
- Responses to 2021 RFI on Stewardship of Software for Scientific and High-Performance Computing☆16Jan 20, 2022Updated 4 years ago
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- Parallel element agglomeration algebraic multigrid upscaling and solvers.☆16Jul 25, 2025Updated 9 months ago
- The SOLAR blackbox optimization problem☆16Mar 26, 2026Updated last month
- Graph-indexed Pandas DataFrames for analyzing hierarchical performance data☆35Mar 6, 2026Updated last month
- Modified Shepard Algorithm for Interpolation of Scattered Multivariate Data☆11May 28, 2022Updated 3 years ago
- A Symbolic Emulator for Shuffle Synthesis on the NVIDIA PTX Code☆16Mar 19, 2023Updated 3 years ago
- R Interface to CVODE/CVODES/IDA functions in the SUNDIALS ODE solving C library☆11Apr 19, 2026Updated 2 weeks ago
- ☆17Nov 26, 2023Updated 2 years ago
- Pavilion is a Python 3 (3.6+) based framework for running and analyzing tests targeting HPC systems.☆46Updated this week
- Next generation library for iterative sparse solvers for ROCm platform☆97Apr 23, 2026Updated last week
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- [DEPRECATED] Moved to ROCm/rocm-libraries repo☆83Updated this week
- High Performance Linpack for Next-Generation AMD HPC Accelerators☆71Apr 21, 2026Updated 2 weeks ago
- Instructions and templates for SC authors☆17Aug 22, 2021Updated 4 years ago
- A GPU benchmark suite for autotuners☆19Feb 20, 2024Updated 2 years ago
- The parGeMSLR is an MPI-based sparse linear system solution/preconditioning package implementation with C++.☆26Aug 21, 2025Updated 8 months ago
- Automates using spack to build and deploy software☆30Mar 13, 2026Updated last month
- ☆17Dec 8, 2023Updated 2 years ago
- UniSparse: An Intermediate Language for General Sparse Format Customization (OOPSLA'24)☆33Nov 12, 2024Updated last year
- 2020 Collegeville Workshop on Scientific Software - Developer Productivity☆12Mar 1, 2022Updated 4 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- ☆19Jan 17, 2024Updated 2 years ago
- Official repo for "The impact of internal variability on benchmarking deep learning climate emulators" in JAMES25 (public)☆22Sep 29, 2025Updated 7 months ago
- Contains the xSDK community policies. The master branch is the latest accepted version of the policies and will be applied to future xSDK…☆11Jun 14, 2024Updated last year
- ☆26Mar 4, 2026Updated 2 months ago
- Modular Expression Language for Ordinary Differential Equation Editing☆12Nov 10, 2021Updated 4 years ago
- Sandia National Laboratories' Albany multiphysics code☆325Updated this week
- [DEPRECATED] Moved to ROCm/rocm-libraries repo☆135Apr 22, 2026Updated last week
- Thompson and Shampine's DDE_SOLVER, a Fortran library for delay differential equations.☆12Apr 8, 2026Updated 3 weeks ago
- pLiner is a framework that helps programmers identify locations in the source of numerical code that are highly affected by compiler opti…☆17Oct 27, 2023Updated 2 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- ExaWorks SDK☆11Feb 1, 2024Updated 2 years ago
- Using C++ magic to capture CUDA kernels and tune them with Kernel Tuner☆21Sep 12, 2025Updated 7 months ago
- Matrix multiplication on GPUs for matrices stored on a CPU. Similar to cublasXt, but ported to both NVIDIA and AMD GPUs.☆32Apr 2, 2025Updated last year
- ☆12Aug 4, 2025Updated 9 months ago
- Distributed Communication-Optimal LU-factorization Algorithm☆12Aug 1, 2021Updated 4 years ago
- PyTorch block-diagonal ODE CUDA solver, designed for gradient-based optimization☆16Apr 27, 2020Updated 6 years ago
- EPOCH Input System Version 2☆10Jun 5, 2020Updated 5 years ago