A GPU benchmark suite for autotuners
☆19Feb 20, 2024Updated 2 years ago
Alternatives and similar repositories for BAT
Users that are interested in BAT are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Using C++ magic to capture CUDA kernels and tune them with Kernel Tuner☆21Sep 12, 2025Updated 6 months ago
- Kernel Tuner☆389Mar 17, 2026Updated last week
- PyTorch block-diagonal ODE CUDA solver, designed for gradient-based optimization☆16Apr 27, 2020Updated 5 years ago
- Instructions and templates for SC authors☆17Aug 22, 2021Updated 4 years ago
- ☆17Dec 8, 2023Updated 2 years ago
- A Symbolic Emulator for Shuffle Synthesis on the NVIDIA PTX Code☆15Mar 19, 2023Updated 3 years ago
- CLTune: An automatic OpenCL & CUDA kernel tuner☆185Dec 12, 2022Updated 3 years ago
- OCCA Python API: JIT Compilation for Multiple Architectures☆11Dec 20, 2019Updated 6 years ago
- RISC-V vector extension ISA simulation☆17Jun 11, 2019Updated 6 years ago
- PIRA - Automatic Instrumentation Refinement☆16Mar 28, 2024Updated last year
- Virtual programming language☆10Dec 5, 2022Updated 3 years ago
- 🔮 High-performance kaleidoscope effects for real-time applications☆15Mar 16, 2026Updated last week
- Distributed machine learning platform☆13Aug 20, 2015Updated 10 years ago
- A series of high-performance GEMM (General Matrix Multiply) implementations Iteratively optimised for H100 GPUs in Pure CUDA.☆73Feb 18, 2026Updated last month
- Base container for developing C++ and Fortran HPC applications☆18Jun 14, 2022Updated 3 years ago
- a lightweight, semi-automated setup guide for HashiStack: Consul + Vault + Nomad, on Footloose powered Docker "container VMs", with Ansib…☆11Jul 2, 2021Updated 4 years ago
- ☆32Nov 13, 2020Updated 5 years ago
- ☆80Jan 6, 2026Updated 2 months ago
- Forslag til norske oversettelser av git-begreper☆37Aug 7, 2019Updated 6 years ago
- Automated bottleneck detection and solution orchestration☆19Feb 24, 2026Updated 3 weeks ago
- A short introduction to modern Fortran☆13Feb 28, 2024Updated 2 years ago
- UCAS网络登录☆13Nov 17, 2018Updated 7 years ago
- ☆13Nov 1, 2021Updated 4 years ago
- Dark channel Haze removal algorithm with CUDA acceleration (typically 10x or more speedup using a Nvidia GPU)☆14Dec 7, 2017Updated 8 years ago
- A quick way of spawning many batch jobs☆14Oct 24, 2022Updated 3 years ago
- Ansible config for Cluster in the Cloud☆11Apr 25, 2024Updated last year
- Terraform project to create a cli for drift detection☆18Jun 19, 2025Updated 9 months ago
- TUI for browsing, canceling, and inspecting SLURM jobs☆13Nov 13, 2023Updated 2 years ago
- cuPC: CUDA-based Parallel PC Algorithm for Causal Structure Learning on GPU☆16Mar 19, 2021Updated 5 years ago
- GPU-powered stochastic MPC for drinking water networks☆16Sep 12, 2022Updated 3 years ago
- ☆12May 18, 2024Updated last year
- CUDA solutions for the lab assignments in the UIUC-ECE408 Applied Parallel Programming course.☆19Apr 18, 2023Updated 2 years ago
- study of cutlass☆22Nov 10, 2024Updated last year
- ☆32Jul 2, 2025Updated 8 months ago
- Scripts for running various benchmarks on Isambard and other systems.☆29May 13, 2021Updated 4 years ago
- PyDTNN - Python Distributed Training of Neural Networks☆14Feb 20, 2026Updated last month
- ☆20Sep 28, 2024Updated last year
- WIP · CUDA compatibility for Blaze · https://bitbucket.org/blaze-lib/blaze☆21Nov 18, 2019Updated 6 years ago
- ☆13Sep 19, 2024Updated last year